Apache Tika

A toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries

started in 2007
8,132 commits
182 contributors

Rank

1356 213

Git Repositories

Started

2007-03-31 6,654 days ago

Alternatives to

Microsoft 365, OneNote, Evernote For Teams, Toggl Track, Roam Research, Adobe Experience Manager

GitHub Stars

2,754 #2,262

Weekly commits since inception

2007 2016 2025

Weekly contributors since inception

2007 2016 2025

Recent Project Activity

Time Period Commits Contributors
Past 30 days 54 commits #788 4 contributors #1,332
Past 90 days 102 commits #1,134 6 contributors #1,567
Past 365 days 518 commits #1,098 24 contributors #1,353
Past 1095 days 2,211 commits #1,017 45 contributors #1,726
All time 8,132 commits 182 contributors

Contributing Individuals

Contributor 30 days 90 days 365 days 1095 days All time
17 30 135 551 1,935
0 0 41 1,042 1,120
34 67 293 505 554
0 0 0 0 960
0 0 0 0 725
0 0 0 0 695
0 0 0 3 484
0 0 0 8 222
0 0 1 5 146
0 0 0 0 121
0 0 0 0 98
2 2 8 16 23
0 0 0 0 67
14 ThejanW
0 0 0 0 66
0 0 0 10 44
16 dk2k
0 0 9 9 11
0 0 0 0 43
0 0 0 10 23
0 0 0 0 35
0 0 0 0 34
Showing 1 to 20 of 182 results

Contributing Companies

Add the OSSRank badge to this project

OSSRank Badge Add this OSSRank shield to your project's README.md
[![OSSRank](https://shields.io/endpoint?url=https://ossrank.com/shield/2987)](https://ossrank.com/p/2987)