Apache Tika

A toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries

Rank

1464 52

Git Repositories

tika

Started

2007-03-31 6,587 days ago

Categories

Alternatives to

Microsoft 365, OneNote, Evernote For Teams, Toggl Track, Roam Research, Adobe Experience Manager

GitHub Stars

2,754 #2,262

Weekly commits since inception

2007 2016 2025

Weekly contributors since inception

2007 2016 2025

Recent Project Activity

Past 30 days 35 commits #1,111 3 contributors #1,570
Past 90 days 93 commits #1,257 7 contributors #1,563
Past 365 days 552 commits #1,092 23 contributors #1,425
Past 1095 days 2,340 commits #996 45 contributors #1,755
All time 8,056 commits 180 contributors

Contributing Individuals

Commits past X days
30 90 365 1095 All
107 Thamme Gowda 0 0 0 0 2
122 Alexander Kazakov 0 0 0 0 1
122 Felix Sonntag 0 0 0 0 1
122 Cameron Rollheiser 0 0 0 0 1
122 Christian 0 0 0 0 1
122 Colm O hEigeartaigh 0 0 0 0 1
122 Gavin McDonald 0 0 0 0 1
122 Ioannis Kakavas 0 0 0 0 1
122 Julien Nioche 0 0 0 0 1
122 Karl-Philipp Richter 0 0 0 0 1
122 Marc Breslow 0 0 0 0 1
122 Olle Jonsson 0 0 0 0 1
122 Prasad Nagaraj Subramanya 0 0 0 0 1
122 Przemysław Sobala 0 0 0 0 1
122 Sean C. Sullivan 0 0 0 0 1
122 Graham 0 0 0 0 1
122 Matthieu Baechler 0 0 0 0 1
122 Hans Brende 0 0 0 0 1
122 Alexander Klimetschek 0 0 0 0 1
122 Hasan Kara 0 0 0 0 1
Showing 121 to 140 of 180 results Previous Next
Contributing Companies

Add this OSSRank shield to this project's README.md

[![OSSRank](https://shields.io/endpoint?url=https://ossrank.com/shield/2987)](https://ossrank.com/p/2987)