img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing
MIT License
OCR-D wrapper for arbitrary coords-preserving image operations
ktrain is a Python library that makes deep learning and AI more accessible and easier to apply
OCR, layout analysis, reading order, line detection in 90+ languages
A Python wrapper for Google Tesseract
OCR-D wrapper for DoxaPy image binarization via locally adaptive thresholding
🔎📝 This is a module to make specifics OCRs at food products and nutritional tables.
Parse vision is an open source tool to visualise what OCR is parsing in a PDF document to help de...
OCR-D wrapper for detectron2 based segmentation models
A Python wrapper for the tesseract-ocr API
Tools for running OCR against files stored in S3
Improved file parsing for LLM’s
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) ...
Automatic Number (License) Plate Recognition using Tensorflow Object Detection API
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compa...
Extract structured data from PDF invoices