Chain-of-Hindsight, A Scalable RLHF Method
APACHE-2.0 License
Statistics for this project are still being loaded, please check back later.
An open platform for training, serving, and evaluating large language models. Release repo for Vi...
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently sup...
Hybrid Discriminative-Generative Training via Contrastive Learning
GLM (General Language Model)
minichatgpt - To Train ChatGPT In 5 Minutes
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA a...
A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse ra...
Unsupervised Language Modeling at scale for robust sentiment classification
A framework for few-shot evaluation of language models.
Code accompanying the paper Pretraining Language Models with Human Preferences
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
Home of StarCoder: fine-tuning & inference!
Embed arbitrary modalities (images, audio, documents, etc) into large language models.