Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
APACHE-2.0 License
Unsupervised Language Modeling at scale for robust sentiment classification
GLM (General Language Model)
Ongoing research training transformer models at scale
keras implement of transformers for humans
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sf...
20+ high-performance LLM implementations with recipes to pretrain, finetune and deploy at scale.
Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer ...
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Home of StarCoder: fine-tuning & inference!
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating poin...
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA a...
Implementation of UNet by Tensorflow Lite. Semantic segmentation without using GPU with Raspberry...
An open platform for training, serving, and evaluating large language models. Release repo for Vi...