Piereling is a webservice and web-application to convert between a variety of document formats, mostly from and to FoLiA XML. It is intended for NLP pipelines.
GPL-3.0 License
Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic...
A linter for prose.
An easy way to extract information from documents
Open Access PDF harvester, metadata aggregator and full-text ingester
An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, ...
A simple RAG chatbot that can retrieve from a mediawiki data dump
FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the rep...
GPT-powered chat for documentation, chat with your documents
Generic Environment for Context-Aware Correction of Orthography
Open Access PDF harvester
Text-Induced Corpus Clean-up
A number of command-line tools for working with FoLiA (Format for Linguistic Annotation). Include...
Question and Answer based on Anything.
Annotator combining different NLP pipelines.