Apache Tika

A toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries

started in 2007
8,067 commits
180 contributors

Rank

1514 105

Git Repositories

Started

2007-03-31 6,604 days ago

Alternatives to

Microsoft 365, OneNote, Evernote For Teams, Toggl Track, Roam Research, Adobe Experience Manager

GitHub Stars

2,754 #2,262

Weekly commits since inception

2007 2016 2025

Weekly contributors since inception

2007 2016 2025

Recent Project Activity

Time Period Commits Contributors
Past 30 days 34 commits #1,091 3 contributors #1,537
Past 90 days 78 commits #1,359 5 contributors #1,802
Past 365 days 542 commits #1,098 22 contributors #1,447
Past 1095 days 2,309 commits #1,000 45 contributors #1,745
All time 8,067 commits 180 contributors

Contributing Individuals

Contributor 30 days 90 days 365 days 1095 days All time
0 0 0 1 1
81 tledoux
0 0 0 1 1
0 0 0 0 3
0 0 0 1 1
0 0 0 1 1
0 0 0 1 1
0 0 0 0 2
0 0 0 0 2
0 0 0 0 2
0 0 0 0 2
0 0 0 0 2
107 AarjavP
0 0 0 0 2
107 cmenekse
0 0 0 0 2
107 NamithaGS
0 0 0 0 2
0 0 0 0 2
0 0 0 0 2
0 0 0 0 2
107 smadha
0 0 0 0 2
0 0 0 0 2
0 0 0 0 2
Showing 101 to 120 of 180 results

Contributing Companies

Add the OSSRank badge to this project

OSSRank Badge Add this OSSRank shield to your project's README.md
[![OSSRank](https://shields.io/endpoint?url=https://ossrank.com/shield/2987)](https://ossrank.com/p/2987)