Aligning pretrained language models with instruction data generated by themselves.
APACHE-2.0 License
Implementation of "SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks"
A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-...
[ACL 2024] An Easy-to-use Instruction Processing Framework for LLMs.
Ongoing research training transformer models at scale
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA a...
HuatuoGPT, Towards Taming Language Models To Be a Doctor. (An Open Medical GPT)
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
Code and documentation to train Stanford's Alpaca models, and generate the data.
prompt2model - Generate Deployable Models from Natural Language Instructions
structured outputs for llms
A school for camelids
GLM (General Language Model)
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sf...