Apache Tika

A toolkit for detecting and extracting metadata and structured text content from various documents using existing parser libraries

Rank

1562 36

Git Repositories

tika

Started

2007-03-31 6,579 days ago

Categories

Alternatives to

Microsoft 365, OneNote, Evernote For Teams, Toggl Track, Roam Research, Adobe Experience Manager

GitHub Stars

2,754 #2,262

Weekly commits since inception

2007 2016 2025

Weekly contributors since inception

2007 2016 2025

Recent Project Activity

Past 30 days 31 commits #1,179 2 contributors #1,828
Past 90 days 88 commits #1,293 6 contributors #1,674
Past 365 days 569 commits #1,077 22 contributors #1,471
Past 1095 days 2,340 commits #996 45 contributors #1,755
All time 8,045 commits 180 contributors

Contributing Individuals

Commits past X days
30 90 365 1095 All
74 ThejanW 0 0 0 0 4
82 John Patrick 0 0 0 0 3
82 Aditya Dhulipala 0 0 0 0 3
82 Gagravarr 0 0 0 0 3
82 Hervé Boutemy 0 0 0 1 1
82 Matthew Caruana Galizia 0 0 0 0 3
82 Yakiv Yereskovskyi 0 0 0 1 1
82 Marcos Pereira 0 0 0 1 1
82 NissimShiman 0 0 0 1 1
82 Tom 0 0 0 1 1
82 thebsssss 0 0 0 1 1
82 Tayseer K. Sabha 0 0 0 1 1
82 Ann Bryant Burgess 0 0 0 0 3
82 ReEvApp - Re-Evolution Applications, LLC 0 0 0 0 3
82 amensiko 0 0 0 0 3
82 davidxie-glean 0 0 0 1 1
82 Manali Shah 0 0 0 0 3
82 nandan-pc 0 0 0 0 3
82 raviranjanjha 0 0 0 1 1
82 anantdahiya8 0 0 0 1 1
Showing 81 to 100 of 180 results Previous Next
Contributing Companies

Add this OSSRank shield to this project's README.md

[![OSSRank](https://shields.io/endpoint?url=https://ossrank.com/shield/2987)](https://ossrank.com/p/2987)