docstruct

Document structure detection from PAGE-XML to METS-XML

APACHE-2.0 License

Stars
6

docstruct

Document structure detection from PAGE to METS

Provides an OCR-D processor which will parse the input page-level structure (as detected by some OCR-D workflow including preprocessing, layout analysis and OCR) of a document annotated via PAGE-XML and METS-XML, further analyse it (...) and wrap it into a document-level structure in the METS using logical mets:structMap and either

for representation.