Small utility to prepare scanned documents. Supports separating PDF files by separator pages and removing blank pages.
MIT License
Transforms PDF, Documents and Images into Enriched Structured Data
Small python-gtk application, which helps the user to merge or split PDF documents and rotate, cr...
Toolkit for pdf editing.
Management script for LHAPDF files
A Python script that uses Python libraries, ImageJ, and ImageMagick to automatically convert a sc...
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of ...
Simple way to make scanned PDFs searchable
A simple OCR preprocessing tool using Python with a GUI.
Python module to drive the awesome pdftk binary.
pdfrw is a pure Python library that reads and writes PDFs
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) ...
Convert pdf to pages of images
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Parse vision is an open source tool to visualise what OCR is parsing in a PDF document to help de...