LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

APACHE-2.0 License

Stars

2.1K

View Code on GitHub Visit Website

Ecosystems: Python

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

3 days

Badges

Extracted from project README

Related Projects

EchoMimic

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

03 Jul 2024 2,543

openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

25 May 2023 5,218

Xwin-LM

Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment

11 Sep 2023 1,018

whisper-plus

WhisperPlus: Advancing Speech-to-Text Processing 🚀

21 Nov 2023 1,318

LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

19 Mar 2023 5,713

LLMZoo

⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language mod...

01 Apr 2023 2,927

Video-LLaVA

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

23 Oct 2023 2,881

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to supp...

15 Nov 2023 4,482

Dromedary

Dromedary: towards helpful, ethical and reliable LLMs.

03 May 2023 1,114

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deploy...

23 Feb 2024 1,061

mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-ti...

27 Aug 2024 2,721

cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

17 Jun 2024 1,703

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

21 Jul 2023 2,701

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

14 Dec 2023 1,932

long_llama

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA a...

06 Jul 2023 1,448