build_MiniLLM_from_scratch

从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)

MIT License

Stars

308

Committers

View Code on GitHub

Ecosystems: Llama

Commit Statistics

Past Year

All Time

Total Commits

113

Total Committers

Avg. Commits Per Committer

22.6

Bot Commits

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

about 1 month

Badges

Extracted from project README's

Related Projects

MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训...

02 Jun 2023 2,446

Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixt...

02 Apr 2023 5,702

textgen

TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet ...

07 Apr 2021 926

llama3-Chinese-chat

Llama3、Llama3.1 中文仓库（随书籍撰写中... 各种网友及厂商微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档）

18 Apr 2024 3,967

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

15 Mar 2023 18,249

Baichuan2

A series of large language models developed by Baichuan Intelligent Technology

31 Aug 2023 4,079

Chinese-Vicuna

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案，结构参考alpaca

23 Mar 2023 4,144

TigerBot

TigerBot: A multi-language multi-task LLM

12 May 2023 2,233

FindTheChatGPTer

ChatGPT爆火，开启了通往AGI的关键一步，本项目旨在汇总那些ChatGPT的开源平替们，包括文本大模型、多模态大模型等，为大家提供一些便利

07 Apr 2023 2,014

Chinese-LLaMA-Alpaca-3

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

09 Nov 2023 878

Baichuan-7B

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

14 Jun 2023 5,670

Chinese-Llama-2-7b

开源社区第一个能下载、能运行的中文 LLaMA2 模型！

20 Jul 2023 2,225

Llama-Chinese

Llama中文社区，Llama3在线体验和微调模型已开放，实时汇总最新Llama3学习资料，已将所有代码更新适配Llama3，构建最好的中文Llama大模型，完全开源可商用

19 Jul 2023 12,154

Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context m...

18 Jul 2023 7,068

Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

22 May 2023 13,245