A step-by-step C# implementation of the Docstrum algorithm
MIT License
Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The ...
Document Layout Analysis resources repos for development with PdfPig.
Read and extract text and other content from PDFs in C# (port of PDFBox)
A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).
Using a MaskRCNN model trained on the PublayNet dataset with ML.Net in C# / .Net for Document lay...
Extract tables from PDF files (port of tabula-java)
Proof of concept of a simple SVM Region Classifier using PdfPig and Accord.Net. The objective is ...