Simple way to make scanned PDFs searchable
Statistics for this project are still being loaded, please check back later.
🔎📝 This is a module to make specifics OCRs at food products and nutritional tables.
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) ...
An easy way to extract information from documents
pdfrw is a pure Python library that reads and writes PDFs
ocr-docker is small, Flask powerd web app, helps us to extract text from images and pdf document ...
Parse vision is an open source tool to visualise what OCR is parsing in a PDF document to help de...
Convert pdf to pages of images
Easy to use text extractor, from PDF, DOC, DOCX and other documents, including if necessary using...
Images of Text to Text: Call Tesseract from Python and OCR a directory of pdfs
A simple OCR preprocessing tool using Python with a GUI.
PoC bulk search you pdf files using text look up
Tools for running OCR against files stored in S3
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Search and replace text in PDF files with PyPDF.