Apache Tika

A toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries

Rank

1562 33

Git Repositories

tika

Started

2007-03-31 6,579 days ago

Categories

Alternatives to

Microsoft 365, OneNote, Evernote For Teams, Toggl Track, Roam Research, Adobe Experience Manager

GitHub Stars

2,754 #2,262

Weekly commits since inception

2007 2016 2025

Weekly contributors since inception

2007 2016 2025

Recent Project Activity

Past 30 days 31 commits #1,176 2 contributors #1,818
Past 90 days 88 commits #1,294 6 contributors #1,675
Past 365 days 569 commits #1,077 22 contributors #1,470
Past 1095 days 2,340 commits #996 45 contributors #1,755
All time 8,045 commits 180 contributors

Contributing Individuals

Commits past X days
30 90 365 1095 All
82 Ben Gilbert 0 0 0 1 1
82 tledoux 0 0 0 1 1
82 trevorlewis 0 0 0 0 3
82 Thorsten Heit 0 0 0 1 1
82 Luca Foppiano 0 0 0 1 1
82 lxb007981 0 0 0 1 1
107 Bruno P. Kinoshita 0 0 0 0 2
107 Kenneth Hoste 0 0 0 0 2
107 Joseph Naegele 0 0 0 0 2
107 Giuseppe Totaro 0 0 0 0 2
107 U-BASIS\dsmyda 0 0 0 0 2
107 AarjavP 0 0 0 0 2
107 cmenekse 0 0 0 0 2
107 NamithaGS 0 0 0 0 2
107 Ian Fricker 0 0 0 0 2
107 Joshua Hight 0 0 0 0 2
107 karanjeet-singh 0 0 0 0 2
107 smadha 0 0 0 0 2
107 phantuanminh 0 0 0 0 2
107 Sam Heijens 0 0 0 0 2
Showing 101 to 120 of 180 results Previous Next
Contributing Companies

Add this OSSRank shield to this project's README.md

[![OSSRank](https://shields.io/endpoint?url=https://ossrank.com/shield/2987)](https://ossrank.com/p/2987)