Mailgun library to extract message quotations and signatures.
APACHE-2.0 License
Published by obukhov-sergey over 8 years ago
Published by r0mant about 9 years ago
The data scientist's open-source choice to scale, assess and maintain natural language data. Trea...
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sf...
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model trainin...
OCR, layout analysis, reading order, line detection in 90+ languages
pix2tex: Using a ViT to convert images of equations into LaTeX code.
ktrain is a Python library that makes deep learning and AI more accessible and easier to apply
Official Python SDK for Kern AI refinery.
Unsupervised Language Modeling at scale for robust sentiment classification
pytextclassifier is a toolkit for text classification. 文本分类,LR,Xgboost,TextCNN,FastText,TextRNN,B...
📜 A python library for distributed training of a Transformer neural network across the Internet ...
Training data generator for text detection
🗣️ Tool to generate adversarial text examples and test machine learning models against them
A curated list of applied machine learning and data science notebooks and libraries across differ...
General Assembly's 2015 Data Science course in Washington, DC