OSS Project

Apache Tika

A toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries

Rank
1524
Decreased by 33
Git Repositories
tika
Started
2007-03-31 6,505 days ago
Categories
Productivity
Alternatives to
Microsoft 365, OneNote, Evernote For Teams, Toggl Track, Roam Research, Adobe Experience Manager
GitHub Stars
2,671 #2,271
Weekly commits since inception
2007 2016 2025
Weekly contributors since inception
2007 2016 2025
Recent Project Activity
Day Span Commits Contributors
30 23 #1,126 2 #1,660
90 107 #1,094 5 #1,745
365 682 #957 20 #1,565
1095 2,359 #983 44 #1,788
All time 7,977 177
Contributing Individuals
Commits past X days
Contributor 30 90 All
1 Tilman Hausherr 0 0 1,120
2 Tim Allison 2 24 1,895
3 Tilman Hausherr 19 76 450
4 Jukka Zitting 0 0 960
5 Nick Burch 0 0 725
6 tballison 0 0 695
7 Chris Mattmann 0 0 484
8 Gagravarr 0 0 222
9 David Meikle 0 0 146
10 Tyler Palsulich 0 0 121
11 Mike McCandless 0 0 98
12 Nicholas DiPiazza 0 0 21
13 Konstantin Gribov 0 0 44
14 Maxim Valyanskiy 0 0 67
15 ThejanW 0 0 66
16 dk2k 0 0 11
17 lfcnassif 0 0 23
18 Lewis John McGibbney 0 0 43
19 Kenneth William Krugler 0 0 35
20 Thamme Gowda 0 0 34
Contributing Companies

Add this OSSRank shield to this project's README.md

[![OSSRank](https://shields.io/endpoint?url=https://ossrank.com/shield/2987)](https://ossrank.com/p/2987)