Apache Tika

A toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries

Rank

1456 105

Git Repositories

tika

Started

2007-03-31 6,592 days ago

Categories

Alternatives to

Microsoft 365, OneNote, Evernote For Teams, Toggl Track, Roam Research, Adobe Experience Manager

GitHub Stars

2,754 #2,262

Weekly commits since inception

2007 2016 2025

Weekly contributors since inception

2007 2016 2025

Recent Project Activity

Past 30 days 36 commits #1,088 3 contributors #1,561
Past 90 days 88 commits #1,273 7 contributors #1,556
Past 365 days 554 commits #1,091 23 contributors #1,418
Past 1095 days 2,347 commits #992 45 contributors #1,754
All time 8,063 commits 180 contributors

Contributing Individuals

Commits past X days
30 90 365 1095 All
20 Thamme Gowda 0 0 0 0 34
22 Dmitry Kriukov 0 0 5 5 5
22 subbudvk 0 3 3 3 3
22 Hong-Thai Nguyen 0 0 0 0 30
25 Lee 0 0 0 0 29
26 Dave Meikle 0 0 0 1 26
27 Alexey Pelykh 0 0 4 4 4
27 manali 0 0 0 0 24
29 Konstantin Gribov 0 0 0 0 22
30 Kranthi Kiran GV 0 0 0 0 21
30 Ken Krugler 0 0 0 0 21
32 PeterAlfredLee 0 0 0 0 20
32 Rohan 0 0 0 0 20
34 PJ Fanning 1 1 1 2 3
34 nprate2 0 0 0 0 19
36 bitsgalore 0 0 0 0 18
37 ashankbehara 0 0 0 0 17
38 lsliwko 0 1 2 2 2
39 Madhav Sharan 0 0 0 0 15
39 Thamme Gowda 0 0 0 0 15
Showing 21 to 40 of 180 results Previous Next
Contributing Companies

Add this OSSRank shield to this project's README.md

[![OSSRank](https://shields.io/endpoint?url=https://ossrank.com/shield/2987)](https://ossrank.com/p/2987)