accelerated-pytorch-transformers-generation

MIT License

Stars

View Code on GitHub View on X

Ecosystems: Python

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

N/A

about 2 hours

Related Projects

starcoder

Home of StarCoder: fine-tuning & inference!

24 Apr 2023 7,267

BertSum

Code for paper Fine-tune BERT for Extractive Summarization

25 Mar 2019 1,464

GPTQ-for-LLaMa

4 bits quantization of LLaMA using GPTQ

06 Mar 2023 2,986

FlexiGen

Running large language models on a single GPU for throughput-oriented scenarios.

15 Feb 2023 9,156

maxtext

A simple, performant and scalable Jax LLM!

28 Feb 2023 1,465

diffusers-torchao

End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 ...

05 Aug 2024 213

SpQR

05 Jun 2023 525

llm-inference-benchmark

LLM Inference benchmark

12 Dec 2023 341

minimal-gpt-neox-20b

09 Mar 2022 126

minimal-llama

06 Mar 2023 442

starcoder2

Home of StarCoder2!

08 Dec 2023 1,732

llmsearch

Find better generation parameters for your LLM

30 Mar 2023 27

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

02 Jul 2021 1,323

torchprep

21 Jul 2022 11