a tool to quickly create sweet PDF files from text files
APACHE-2.0 License
Improved file parsing for LLM’s
A CLI toolset to generate table of contents for PDF files automatically.
Open Access PDF harvester
python package to calculate readability statistics of a text object - paragraphs, sentences, arti...
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of ...
Converse with book - Built with GPT-3
A Python tool for splitting large Markdown files into smaller sections based on a specified token...
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Prepare documents for distribution
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) ...
CLI & Python API to easily summarize text-based files with transformers
Implementation of Nougat Neural Optical Understanding for Academic Documents
pdfrw is a pure Python library that reads and writes PDFs
Convert PDF to markdown quickly with high accuracy
OCR-D wrapper for detectron2 based segmentation models