📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
MIT License
A Python library for calculating a large variety of metrics from text
Fuzzy matching and more functionality for spaCy.
Rapid fuzzy string matching in Python using various string metrics
python package to calculate readability statistics of a text object - paragraphs, sentences, arti...
pytextclassifier is a toolkit for text classification. 文本分类,LR,Xgboost,TextCNN,FastText,TextRNN,B...
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
Distance metrics are one of the most important parts of some machine learning algorithms, supervi...
dict subclass with keylist/keypath support, built-in I/O operations (base64, csv, html, ini, json...
Ongoing research training transformer language models at scale, including: BERT & GPT-2
结巴中文分词
Text preprocessing, representation and visualization from zero to hero.
Automated Multi-Locus Sequence Analysis phylogenetic tree construction software
SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/lat...
Lightweight, super fast C/C++ (& Python) library for sequence alignment using edit (Levenshtein) ...
yet another text augmentation python package