SwiftInfer

Efficient AI Inference & Serving

APACHE-2.0 License

Stars

453

Committers

View Code on GitHub Visit Website

Ecosystems: Llama

Commit Statistics

Past Year

All Time

Total Commits

Total Committers

Avg. Commits Per Committer

2.0

Bot Commits

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

4 days

Related Projects

Online-RLHF

A recipe for online RLHF and online iterative DPO.

10 May 2024 387

LLamaTuner

Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Fal...

25 May 2023 568

airllm

AirLLM 70B inference with single 4GB GPU

12 Jun 2023 4,536

ScaleLLM

A high-performance inference system for large language models, designed for production environments.

24 Jul 2023 289

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and b...

17 Apr 2023 19,659

LLaVA-NeXT-Image-Llama3-Lora

LLaVA-NeXT-Image-Llama3-Lora, Modified from https://github.com/arielnlee/LLaVA-1.6-ft

24 Jun 2024 37

inferflow

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

26 Dec 2023 235

textgen

TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet ...

07 Apr 2021 926

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

15 Jun 2023 4,364

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable fo...

22 Jul 2023 1,967

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qw...

11 Jul 2023 3,820

EAGLE

EAGLE: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

27 Jun 2024 509

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

09 Feb 2023 28,039

KoAlpaca

KoAlpaca: 한국어 명령어를 이해하는 오픈소스 언어모델

18 Mar 2023 1,460

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

15 Dec 2023 7,913