Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging
Chain-of-Hindsight, A Scalable RLHF Method
Geom3D: Geometric Modeling on 3D Structures, NeurIPS 2023
Code Release of F-LMM: Grounding Frozen Large Multimodal Models
A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse ra...
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA a...
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
My Digital Palace - A Personal Journal for Reflection - A place to store all my thoughts
GLM (General Language Model)
An open platform for training, serving, and evaluating large language models. Release repo for Vi...
Official release of InternLM2 7B and 20B base and chat models. 200K context support
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently sup...
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL