Convert PDF to markdown quickly with high accuracy
GPL-3.0 License
Improved file parsing for LLM’s
AI pretends to be paper/textbook author, you can ask it questions about the paper as a whole, spe...
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Implementation of Nougat Neural Optical Understanding for Academic Documents
Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>
The PyTorch Implementation based on YOLOv4 of the paper: "Complex-YOLO: Real-time 3D Object Detec...
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 ...
Extract structured text from pdfs quickly
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
CLI & Python API to easily summarize text-based files with transformers
OCR-D wrapper for detectron2 based segmentation models
pix2tex: Using a ViT to convert images of equations into LaTeX code.
OCR, layout analysis, reading order, line detection in 90+ languages
Tools for running OCR against files stored in S3
a tool to quickly create sweet PDF files from text files