Apache Tika

A toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries

started in 2007
8,086 commits
181 contributors

Rank

1540

Git Repositories

Started

2007-03-31 6,633 days ago

Alternatives to

Microsoft 365, OneNote, Evernote For Teams, Toggl Track, Roam Research, Adobe Experience Manager

GitHub Stars

2,754 #2,262

Weekly commits since inception

2007 2016 2025

Weekly contributors since inception

2007 2016 2025

Recent Project Activity

Time Period Commits Contributors
Past 30 days 19 commits #1,339 3 contributors #1,518
Past 90 days 72 commits #1,361 4 contributors #1,887
Past 365 days 508 commits #1,124 23 contributors #1,408
Past 1095 days 2,208 commits #1,025 44 contributors #1,758
All time 8,086 commits 181 contributors

Contributing Individuals

Contributor 30 days 90 days 365 days 1095 days All time
12 18 125 560 1,922
0 0 73 1,042 1,120
6 52 263 494 524
0 0 0 0 960
0 0 0 0 725
0 0 0 0 695
0 0 0 3 484
0 0 0 10 222
0 0 1 5 146
0 0 0 0 121
0 0 0 0 98
0 0 7 14 21
0 0 0 0 67
14 ThejanW
0 0 0 0 66
0 0 0 10 44
16 dk2k
0 0 9 9 11
0 0 0 0 43
0 0 0 10 23
0 0 0 0 35
0 0 0 0 34
Showing 1 to 20 of 181 results

Contributing Companies

Add the OSSRank badge to this project

OSSRank Badge Add this OSSRank shield to your project's README.md
[![OSSRank](https://shields.io/endpoint?url=https://ossrank.com/shield/2987)](https://ossrank.com/p/2987)