A text processing tool including tag(HTML, URL, Email) extraction and removing, punctuation normalization, simple segmentation, and so on.
MIT License
NeatText a simple NLP package for cleaning textual data and text preprocessing
Python to Regex. Regex to Python. The yRegex for humans.
yet another text augmentation python package
A powerful and useful hacker dictionary builder for a brute-force attack
A definitive guide to generating usernames for OSINT purposes
UNIX command-line tool for python line-based stream processing
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
English word segmentation, written in pure-Python, and based on a trillion-word corpus.
Screens legal text and extracts sentences containing user input party name-predicate phrases
a friendly yet powerful LR-parser written in Python
NLP预/后处理工具。