Apache Tika

A toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries

started in 2007
8,190 commits
183 contributors

Rank

1383 80

Git Repositories

Started

2007-03-31 6,692 days ago

Alternatives to

Microsoft 365, OneNote, Evernote For Teams, Toggl Track, Roam Research, Adobe Experience Manager

GitHub Stars

2,754 #2,260

Weekly commits since inception

2007 2016 2025

Weekly contributors since inception

2007 2016 2025

Recent Project Activity

Time Period Commits Contributors
Past 30 days 41 commits #933 3 contributors #1,519
Past 90 days 123 commits #997 7 contributors #1,459
Past 365 days 485 commits #1,128 23 contributors #1,386
Past 1095 days 2,177 commits #1,021 45 contributors #1,711
All time 8,190 commits 183 contributors

Contributing Individuals

Contributor 30 days 90 days 365 days 1095 days All time
10 36 121 541 1,946
0 1 13 1,025 1,121
30 81 308 507 599
0 0 0 0 960
0 0 0 0 725
0 0 0 0 695
0 0 0 3 484
0 0 0 5 222
0 0 1 4 146
0 0 0 0 121
0 0 0 0 98
0 2 4 15 23
0 0 0 0 67
14 ThejanW
0 0 0 0 66
0 0 0 10 44
16 dk2k
0 0 9 9 11
0 0 0 0 43
0 0 0 0 35
0 0 0 6 23
0 0 0 0 34
Showing 1 to 20 of 183 results

Contributing Companies

Add the OSSRank badge to this project

OSSRank Badge Add this OSSRank shield to your project's README.md
[![OSSRank](https://shields.io/endpoint?url=https://ossrank.com/shield/2987)](https://ossrank.com/p/2987)