NeatText a simple NLP package for cleaning textual data and text preprocessing
MIT License
Easily clean text with spaCy!
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
Text preprocessing, representation and visualization from zero to hero.
NLP预/后处理工具。
Template and Macros Expansion for Path names.
dict subclass with keylist/keypath support, built-in I/O operations (base64, csv, html, ini, json...
a friendly yet powerful LR-parser written in Python
yet another text augmentation python package
A set of data files that can be used to train tesseract-ocr to read Georgian script (ქართული ენა)
A definitive guide to generating usernames for OSINT purposes
Commonly Consumed Code Commodities
A desktop application that transcribes audio from files, microphone input or YouTube videos with ...
A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse ra...
A text processing tool including tag(HTML, URL, Email) extraction and removing, punctuation norma...