An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks
APACHE-2.0 License
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA a...
Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantiz...
Meditron is a suite of open-source medical Large Language Models (LLMs).
prompt2model - Generate Deployable Models from Natural Language Instructions
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities...
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sf...
GLM (General Language Model)
[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/...
Code for NAACL 2024 main conference paper "An Empirical Study of Consistency Regularization for E...
Ongoing research training transformer models at scale
Toolkit for creating, sharing and using natural language prompts.
Inference and training library for high-quality TTS models.