Apache Tika

A toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries

Rank

1591 76

Git Repositories

tika

Started

2007-03-31 6,574 days ago

Categories

Alternatives to

Microsoft 365, OneNote, Evernote For Teams, Toggl Track, Roam Research, Adobe Experience Manager

GitHub Stars

2,754 #2,262

Weekly commits since inception

2007 2016 2025

Weekly contributors since inception

2007 2016 2025

Recent Project Activity

Past 30 days 19 commits #1,446 2 contributors #1,821
Past 90 days 77 commits #1,374 6 contributors #1,677
Past 365 days 560 commits #1,084 22 contributors #1,467
Past 1095 days 2,329 commits #995 45 contributors #1,755
All time 8,033 commits 180 contributors

Contributing Individuals

Commits past X days
30 90 365 1095 All
20 Ray Gauss II 0 0 0 0 34
22 Dmitry Kriukov 0 0 5 5 5
22 Hong-Thai Nguyen 0 0 0 0 30
22 subbudvk 0 3 3 3 3
25 Lee 0 0 0 0 29
26 Dave Meikle 0 0 0 1 26
27 Alexey Pelykh 0 0 4 4 4
27 manali 0 0 0 0 24
29 Konstantin Gribov 0 0 0 0 22
30 Ken Krugler 0 0 0 0 21
30 Kranthi Kiran GV 0 0 0 0 21
32 PeterAlfredLee 0 0 0 0 20
32 Rohan 0 0 0 0 20
34 nprate2 0 0 0 0 19
35 bitsgalore 0 0 0 0 18
36 ashankbehara 0 0 0 0 17
37 lsliwko 0 1 2 2 2
38 Madhav Sharan 0 0 0 0 15
38 Thamme Gowda 0 0 0 0 15
38 asmehra95 0 0 0 0 15
Showing 21 to 40 of 180 results Previous Next
Contributing Companies

Add this OSSRank shield to this project's README.md

[![OSSRank](https://shields.io/endpoint?url=https://ossrank.com/shield/2987)](https://ossrank.com/p/2987)