s3-ocr | Python Ecosystem Directory

Commit Statistics

Past Year

All Time

Total Commits

Total Committers

Avg. Commits Per Committer

0.0

45.0

Bot Commits

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

N/A

about 1 month

Package Rankings

Top 20.01% on Pypi.org

Badges

Extracted from project README

Related Projects

ArchiveBox

🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc...

05 May 2017 19,808

image-to-text

Images of Text to Text: Call Tesseract from Python and OCR a directory of pdfs

29 Jan 2015 15

datatrove

Freeing data processing from scripting madness by providing a set of platform-agnostic customizab...

14 Jun 2023 1,761

OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

20 Dec 2013 12,250

ocrd_detectron2

OCR-D wrapper for detectron2 based segmentation models

21 Jan 2022 16

img2table

img2table is a table identification and extraction Python Library for PDF and images, based on Op...

21 Mar 2022 527

omniparse

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compa...

04 Jun 2024 5,111

file-scraper

Scrape files for sensitive information, and generate an interactive HTML report. Based on Rabin2.

01 Apr 2023 8

bin-utils

Utility scripts / apps

10 May 2017 8

biblio_glutton_harvester

Open Access PDF harvester

29 Jul 2018 35

surya

OCR, layout analysis, reading order, line detection in 90+ languages

10 Jan 2024 6,739

pdftabextract

A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) ...

08 Jul 2016 2,209

label_generator

Training data generator for text detection

30 Apr 2015 39

s3tk

A security toolkit for Amazon S3

13 Sep 2017 451

Nkocr

🔎📝 This is a module to make specifics OCRs at food products and nutritional tables.

07 Jul 2020 34