PdfPigSvmRegionClassifier

Proof of concept of a simple SVM Region Classifier using PdfPig and Accord.Net. The objective is to classify each text block in a pdf document page as either title, text, list, table and image.

MIT License

Stars

View Code on GitHub

Ecosystems: C#

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

N/A

Related Projects

DocumentLayoutAnalysis

Document Layout Analysis resources repos for development with PdfPig.

02 Sep 2019 563

simple-docstrum

A step-by-step C# implementation of the Docstrum algorithm

23 Apr 2020 19

PdfPig

Read and extract text and other content from PDFs in C# (port of PDFBox)

09 Nov 2017 1,492

tabula-sharp

Extract tables from PDF files (port of tabula-java)

08 Sep 2020 136

PublayNet-maskrcnn-mlnet

Using a MaskRCNN model trained on the PublayNet dataset with ML.Net in C# / .Net for Document lay...

17 Jul 2021 10

PdfPigMLNetBlockClassifier

Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The ...

15 Jan 2020 17