Apache Tika

A toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries

Rank

1564 45

Git Repositories

tika

Started

2007-03-31 6,577 days ago

Categories

Alternatives to

Microsoft 365, OneNote, Evernote For Teams, Toggl Track, Roam Research, Adobe Experience Manager

GitHub Stars

2,754 #2,262

Weekly commits since inception

2007 2016 2025

Weekly contributors since inception

2007 2016 2025

Recent Project Activity

Past 30 days 30 commits #1,207 2 contributors #1,831
Past 90 days 87 commits #1,297 6 contributors #1,674
Past 365 days 570 commits #1,077 22 contributors #1,472
Past 1095 days 2,339 commits #995 45 contributors #1,755
All time 8,044 commits 180 contributors

Contributing Individuals

Commits past X days
30 90 365 1095 All
60 George Gastaldi 0 0 0 2 2
60 Christian Schmidt 0 0 1 1 1
60 Eric Pugh 0 0 0 0 6
60 Björn Kautler 0 0 1 1 1
60 Antoni Mylka 0 0 0 0 6
60 Mingchun Zhao 0 0 1 1 1
60 Valery Yatsynovich 0 0 1 1 1
60 Dan Coldrick 0 0 0 2 2
60 Ewan Mellor 0 0 0 0 6
60 DHL 0 0 1 1 1
71 Sergey Beryozkin 0 0 0 0 5
71 Subhajit Das 0 0 0 0 5
71 Giuseppe Totaro 0 0 0 0 5
74 Javen O'Neal 0 0 0 0 4
74 Nassif 0 0 0 0 4
74 Kristen 0 0 0 0 4
74 Julien Nioche 0 0 0 0 4
74 Bob Paulin 0 0 0 0 4
74 Uwe Schindler 0 0 0 0 4
74 PJ Fanning 0 0 0 1 2
Showing 61 to 80 of 180 results Previous Next
Contributing Companies

Add this OSSRank shield to this project's README.md

[![OSSRank](https://shields.io/endpoint?url=https://ossrank.com/shield/2987)](https://ossrank.com/p/2987)