Apache Tika

A toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries

Rank

1597 92

Git Repositories

tika

Started

2007-03-31 6,576 days ago

Categories

Alternatives to

Microsoft 365, OneNote, Evernote For Teams, Toggl Track, Roam Research, Adobe Experience Manager

GitHub Stars

2,754 #2,262

Weekly commits since inception

2007 2016 2025

Weekly contributors since inception

2007 2016 2025

Recent Project Activity

Past 30 days 19 commits #1,441 2 contributors #1,823
Past 90 days 76 commits #1,373 6 contributors #1,672
Past 365 days 560 commits #1,084 22 contributors #1,470
Past 1095 days 2,328 commits #995 45 contributors #1,755
All time 8,033 commits 180 contributors

Contributing Individuals

Commits past X days
30 90 365 1095 All
38 Zarana Parekh 0 0 0 0 15
38 asmehra95 0 0 0 0 15
43 Oleg Tikhonov 0 0 0 0 14
43 Keith Bennett 0 0 0 0 14
43 Sergey Beryozkin 0 0 0 0 14
46 bpaulin 0 0 0 0 13
47 ruwi-next 0 0 2 2 2
47 Jøger Hansegård 0 0 2 2 2
49 Bertrand Delacretaz 0 0 0 0 11
49 Sami Siren 0 0 0 0 11
51 Sebastian Nagel 0 0 1 1 5
51 luman 0 1 1 1 1
51 pleeplop 0 1 1 1 1
51 Peter Kronenberg 0 0 0 0 10
55 Rida Benjelloun 0 0 0 0 9
56 Tom Barber 0 0 0 0 8
56 Gérard Bouchar 0 0 0 0 8
56 Furkan KAMACI 0 0 0 0 8
59 Pascal Essiembre 0 0 0 0 7
60 Björn Kautler 0 0 1 1 1
Showing 41 to 60 of 180 results Previous Next
Contributing Companies

Add this OSSRank shield to this project's README.md

[![OSSRank](https://shields.io/endpoint?url=https://ossrank.com/shield/2987)](https://ossrank.com/p/2987)