kernl

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

APACHE-2.0 License

Stars

1.5K

View Code on GitHub Visit Website View on X

Ecosystems: PyTorch

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

102

Time to Close Issues

2 days

about 2 months

Badges

Extracted from project README

Related Projects

ao

torchao: PyTorch Architecture Optimization (AO). Performant kernels that work with PyTorch.

03 Nov 2023 193

DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reprod...

02 May 2018 13,307

burn

Burn is a new comprehensive dynamic Deep Learning Framework built using Rust with extreme flexibi...

18 Jul 2022 8,309

js-pytorch

A JavaScript library like PyTorch, with GPU acceleration.

27 Feb 2024 1,043

torchchat

Run PyTorch LLMs locally on servers, desktop and mobile

22 Mar 2024 3,266

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating poin...

20 Sep 2022 1,482

text-generation-inference

Large Language Model Text Generation Inference

08 Oct 2022 7,916

the-incredible-pytorch

The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relat...

11 Feb 2017 11,389

Deep-Learning-in-Production

In this repository, I will share some useful notes and references about deploying deep learning-b...

03 May 2018 4,294

pytorch-styleguide

An unofficial styleguide and best practices summary for PyTorch

14 Apr 2019 1,909

TensorRT

PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT

11 Mar 2020 2,359

Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

31 Aug 2020 9,151

Transformers4Rec

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendat...

14 Apr 2021 1,089

serve

Serve, optimize and scale PyTorch models in production

03 Oct 2019 4,177

TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on...

20 Apr 2020 1,473