DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

APACHE-2.0 License

Downloads

3.9K

Stars

1.9K

Committers

View Code on GitHub View on X

Ecosystems: Playwright, PyTorch, Windows, Windows UI Library (WinUI), VS Code Extension, TypeScript

Commit Statistics

Past Year

All Time

Total Commits

224

Total Committers

Avg. Commits Per Committer

4.41

6.22

Bot Commits

Issue Statistics

Past Year

All Time

Total Pull Requests

170

Merged Pull Requests

133

Total Issues

180

282

Time to Close Issues

25 days

about 1 month

Package Rankings

Top 6.75% on Proxy.golang.org

Top 5.23% on Pypi.org

Badges

Extracted from project README

Related Projects

tutel

Tutel MoE: An Optimized Mixture-of-Experts Implementation

06 Aug 2021 716

BioGPT

15 Aug 2022 4,292

PromptCraft-Robotics

Community for applying LLMs to robotics and a robot simulator with ChatGPT integration

08 Feb 2023 1,852

mscclpp

MSCCL++: A GPU-driven communication stack for scalable AI applications

02 Feb 2023 235

ai-sentry

AI-Sentry: A lightweight, pluggable facade layer for Azure Open AI, addressing common cross-cutti...

03 Jun 2024 13

BlingFire

A lightning fast Finite State machine and REgular expression manipulation library.

13 Mar 2019 1,823

BiDR

Repo for WWW 2022 paper: Progressively Optimized Bi-Granular Document Representation for Scalable...

28 Feb 2022 15

MInference

To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention,...

22 May 2024 725

MEGAVERSE

Official Codebase for MEGAVERSE: (published in ACL: NAACL 2024)

04 Jun 2024 6

evodiff

Generation of protein sequences and evolutionary alignments via discrete diffusion models

07 Jun 2022 487

aici

AICI: Prompts as (Wasm) Programs

26 Sep 2023 1,916

chiron

In Greek mythology, Chiron is a wise centaur known for his knowledge of medicine and healing.

09 Aug 2024 4

subseasonal_toolkit

Subseasonal forecasting models

27 Jul 2021 42

torchscale

Foundation Architecture for (M)LLMs

17 Nov 2022 3,006

mttl

Building modular LMs with parameter-efficient fine-tuning.

11 Jul 2022 76