SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language Models

This repository contains the codes of experiments of the paper SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language Models.

The rapid advancement in large language models (LLMs) comes with a significant increase in their parameter size, presenting challenges for adaptation and fine-tuning. Parameter-efficient fine-tuning (PEFT) methods are widely used to adapt LLMs for downstream tasks efficiently. In this paper, we propose Singular Values and Orthonormal Regularized Singular Vectors Adaptation, or SORSA, a novel PEFT method. Each SORSA adapter consists of two main parts: trainable principal singular weights $W_p = U_p \text{diag}(S_p) V^\top_p$, and frozen residual weights $W_r = U_r \text{diag}(S_r) V^\top_r$. These parts are initialized by performing SVD on pre-trained weights. Moreover, we implement and analyze an orthonormal regularizer. SORSA adapters could be merged during inference, thus eliminating any inference latency.

Empirical Experiments

Reproduce the Experiments

First, install sorsa package from pip:

pip install sorsa

Then, create .env file in the root directory of the project and add your Hugging Face Access Token:

hf=Your_Hugging_Face_Access_Token

Llama 2 7B, Mistral v0.1 7B and Gemma 7B

First, install the packages via anaconda

conda env create -f environment.yml

Run scripts from ./scripts/train_sorsa.sh to train the model.

After training, run the ./scripts/merge_sorsa.sh to merge the adapter to the base model:

Run following command to evaluate on GSM-8K:

python3 run.py --name llama2_sorsa_r128 \
  --test \
  --test-dataset gsm-8k \
  --test-precision bf16

Run following command to evaluate on MATH:

python3 run.py --name llama2_sorsa_r128 \
  --test \
  --test-dataset math \
  --test-precision bf16

Run following command to evaluate on HumanEval:

python3 run.py --name llama2_sorsa_r128 \
  --test \
  --test-dataset humaneval \
  --test-precision bf16

RWKV6

If you are training, merging or testing RWKV6 model, please add --rwkv flag to run.py.

Cite the work

You could cite the work by using the BibTeX Code in CITATION.bib.

Package Rankings

Top 33.95% on Pypi.org

Badges

Extracted from project README

Related Projects

LLamaTuner

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Fal...

25 May 2023 568

EAGLE

EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

27 Jun 2024 509

lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

20 Oct 2023 1,529

LLaVA-MORE

LLaVA-MORE: Enhancing Visual Instruction Tuning with LLaMA 3.1

31 Jul 2024 82

mLoRA

An Efficient "Factory" to Build Multiple LoRA Adapters

24 Aug 2023 204

llms-resist-alignment

Repo for paper "Language Models Resist Alignment"

09 Jun 2024 3

Chinese-Vicuna

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案，结构参考alpaca

23 Mar 2023 4,144

xllm

🦖 X—LLM: Cutting Edge & Easy LLM Finetuning

10 Nov 2023 375

ViP-LLaVA

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

02 Dec 2023 278

LLM-Finetuning

LLM Finetuning with peft

08 Jun 2023 2,100

Groma

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

21 Apr 2024 545

Multimodal-GPT

26 Apr 2023 1,467

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

03 Jun 2024 1,227

LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

16 Oct 2023 545

KoAlpaca

KoAlpaca: 한국어 명령어를 이해하는 오픈소스 언어모델

18 Mar 2023 1,460