Transformer seq2seq model, program that can build a language translator from parallel corpus
APACHE-2.0 License
GPT implementation in Flax
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sf...
Multi-layer Recurrent Neural Networks (LSTM, RNN) for character-level language models in Python u...
A TensorFlow Implementation of the Transformer: Attention Is All You Need
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Code for paper Fine-tune BERT for Extractive Summarization
Home of StarCoder2!
Unsupervised Language Modeling at scale for robust sentiment classification
한국어 문장 띄어쓰기(삭제/추가) 모델입니다. 데이터 준비 후 직접 학습이 가능하도록 작성하였습니다.
Build English-Vietnamese machine translation with ProtonX Transformer. :D
Transformer-based Text Auto-encoder (T-TA) using TensorFlow 2.
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
Home of StarCoder: fine-tuning & inference!
TensorFlow Models for the Stanford Question Answering Dataset
Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE