A CLI toolset to generate table of contents for PDF files automatically.
GPL-3.0 License
A simple previewer for various markup formats.
pdfrw is a pure Python library that reads and writes PDFs
Open Access PDF harvester
Extract structured text from pdfs quickly
a tool to quickly create sweet PDF files from text files
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) ...
Small python-gtk application, which helps the user to merge or split PDF documents and rotate, cr...
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of ...
Improved file parsing for LLM’s
Combine LaTeX docs into a single PDF
Python module to drive the awesome pdftk binary.
Transforms PDF, Documents and Images into Enriched Structured Data
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched