Benchmark Arabic text diacritization dataset
MIT License
Simple Solution for Multi-Criteria Chinese Word Segmentation
Turkish Natural Language Toolkit
Hebrew Diacritizer
Open dubbing is an AI dubbing system which uses machine learning models to automatically translat...
Translation-over-Diacritization technique implementation
Mishkal is an arabic text vocalization software
Assem's Arabic Light Stemmer is a snowball-based stemming algorithm for Arabic aimed mainly to i...
Properties of IPA symbols
Automatic categorization of documents, consists in assigning a category to a text based on the in...
A set of data files that can be used to train tesseract-ocr to read Georgian script (ქართული ენა)
Official implementation of: Tha3aroon at NSURL-2019 Task 8: Semantic Question Similarity in Arabic
A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project
Deep learning for AR text Vocalization - التشكيل الالي للنصوص العربية