A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训...
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of...
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
[ICLR 2024] Mol-Instructions: A Large-Scale Biomolecular Instruction Dataset for Large Language M...
ChatGPT爆火,开启了通往AGI的关键一步,本项目旨在汇总那些ChatGPT的开源平替们,包括文本大模型、多模态大模型等,为大家提供一些便利
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
A high-performance inference system for large language models, designed for production environments.
EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
This repository contains a web application designed to execute relatively compact, locally-operat...
An Efficient "Factory" to Build Multiple LoRA Adapters
TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet ...
🏗️ Fine-tune, build, and deploy open-source LLMs easily!
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3
KoAlpaca: 한국어 명령어를 이해하는 오픈소스 언어모델