Apache Tika

A toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries

Rank

1447 116

Git Repositories

tika

Started

2007-03-31 6,591 days ago

Categories

Alternatives to

Microsoft 365, OneNote, Evernote For Teams, Toggl Track, Roam Research, Adobe Experience Manager

GitHub Stars

2,754 #2,262

Weekly commits since inception

2007 2016 2025

Weekly contributors since inception

2007 2016 2025

Recent Project Activity

Past 30 days 36 commits #1,086 3 contributors #1,551
Past 90 days 88 commits #1,268 7 contributors #1,551
Past 365 days 553 commits #1,091 23 contributors #1,422
Past 1095 days 2,346 commits #991 45 contributors #1,753
All time 8,062 commits 180 contributors

Contributing Individuals

Commits past X days
30 90 365 1095 All
1 Tim Allison 5 15 126 649 1,910
2 Tilman Hausherr 0 0 114 1,066 1,120
3 Tilman Hausherr 30 66 263 513 513
4 Jukka Zitting 0 0 0 0 960
5 Nick Burch 0 0 0 0 725
6 tballison 0 0 0 0 695
7 Chris Mattmann 0 0 0 3 484
8 Gagravarr 0 0 0 11 222
9 David Meikle 0 0 1 5 146
10 Tyler Palsulich 0 0 0 0 121
11 Mike McCandless 0 0 0 0 98
12 Nicholas DiPiazza 0 0 10 14 21
13 Maxim Valyanskiy 0 0 0 0 67
14 ThejanW 0 0 0 0 66
15 Konstantin Gribov 0 0 0 10 44
16 dk2k 0 0 9 9 11
17 lfcnassif 0 0 0 14 23
18 Lewis John McGibbney 0 0 0 0 43
19 Kenneth William Krugler 0 0 0 0 35
20 Thamme Gowda 0 0 0 0 34
Showing 1 to 20 of 180 results Previous Next
Contributing Companies

Add this OSSRank shield to this project's README.md

[![OSSRank](https://shields.io/endpoint?url=https://ossrank.com/shield/2987)](https://ossrank.com/p/2987)