Turkish Natural Language Toolkit
APACHE-2.0 License
A code snippet repository that provides examples of how to use different syntax parser generator ...
Mishkal is an arabic text vocalization software
a friendly yet powerful LR-parser written in Python
A set of data files that can be used to train tesseract-ocr to read Georgian script (ქართული ენა)
Translation-over-Diacritization technique implementation
Text-Induced Corpus Clean-up
A python module for English lemmatization and inflection.
Heteronym to Phoneme Parser
Code for the paper Mind the Labels: Describing Relations in Knowledge Graphs With Pretrained Models
A light-weight, extendable, high level, universal code parser built on top of tree-sitter