pdf-ocr-overlay | Python Ecosystem Directory

Statistics for this project are still being loaded, please check back later.

🔎📝 This is a module to make specifics OCRs at food products and nutritional tables.

A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) ...

An easy way to extract information from documents

pdfrw is a pure Python library that reads and writes PDFs

ocr-docker is small, Flask powerd web app, helps us to extract text from images and pdf document ...

Parse vision is an open source tool to visualise what OCR is parsing in a PDF document to help de...

Convert pdf to pages of images

Easy to use text extractor, from PDF, DOC, DOCX and other documents, including if necessary using...

Images of Text to Text: Call Tesseract from Python and OCR a directory of pdfs

A simple OCR preprocessing tool using Python with a GUI.

PoC bulk search you pdf files using text look up

Tools for running OCR against files stored in S3

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Search and replace text in PDF files with PyPDF.