A number of command-line tools for working with FoLiA (Format for Linguistic Annotation). Includes validators, converters, visualisers, and more.
GPL-3.0 License
Statistics for this project are still being loaded, please check back later.
This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost a...
FoLiA Document Server - HTTP webservice backend for serving and annotating FoLiA documents using ...
Namespace encoding hierarchical relationships between proteins, protein families, and protein com...
Dataframe Integration with spaCy.
Text-Induced Corpus Clean-up
Piereling is a webservice and web-application to convert between a variety of document formats, m...
Annotator combining different NLP pipelines.
Use spaCy for NLP and output to the FoLiA XML format.
A JupyterLab extension for papyri
A knowledge extraction tool that uses a large language model to extract semantic information from...
An extensive Python library for dealing with FoLiA (Format for Linguistic Annotation) documents, ...
Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpi...
FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the rep...
🦆 Contextually-keyed word vectors