Mailgun library to extract message quotations and signatures.
APACHE-2.0 License
OCR, layout analysis, reading order, line detection in 90+ languages
A curated list of applied machine learning and data science notebooks and libraries across differ...
Training data generator for text detection
pytextclassifier is a toolkit for text classification. 文本分类,LR,Xgboost,TextCNN,FastText,TextRNN,B...
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model trainin...
🗣️ Tool to generate adversarial text examples and test machine learning models against them
Unsupervised Language Modeling at scale for robust sentiment classification
pix2tex: Using a ViT to convert images of equations into LaTeX code.
The data scientist's open-source choice to scale, assess and maintain natural language data. Trea...
General Assembly's 2015 Data Science course in Washington, DC
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sf...
Official Python SDK for Kern AI refinery.
ktrain is a Python library that makes deep learning and AI more accessible and easier to apply
📜 A python library for distributed training of a Transformer neural network across the Internet ...