tika

The Apache Tika toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).

APACHE-2.0 License

Stars
2.2K
Committers
162

Commit Statistics

Past Year

All Time

Total Commits
817
7,575
Total Committers
20
182
Avg. Commits Per Committer
40.85
41.62
Bot Commits
438
1,140

Issue Statistics

Past Year

All Time

Total Pull Requests
369
975
Merged Pull Requests
338
904
Total Issues
0
0
Time to Close Issues
N/A
N/A