MIT License
Statistics for this project are still being loaded, please check back later.
Situated Interative Language Grounding Benchmark.
A modular RL library to fine-tune language models to human preferences
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sf...
Clean PyTorch implementations of imitation and reward learning algorithms
Collection of reinforcement learning algorithms
The official PyTorch implementation of the paper "Human Motion Diffusion Model"
Unsupervised Language Modeling at scale for robust sentiment classification
Official repository of Evolutionary Optimization of Model Merging Recipes
A framework for few-shot evaluation of language models.
An open platform for training, serving, and evaluating large language models. Release repo for Vi...
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA a...
SGLang is a structured generation language designed for large language models (LLMs). It makes yo...
GLM (General Language Model)
Dromedary: towards helpful, ethical and reliable LLMs.