sparse_attention

Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"

Stars

1.5K

Committers

View Code on GitHub

Ecosystems: Whisper

Commit Statistics

Past Year

All Time

Total Commits

Total Committers

Avg. Commits Per Committer

0.0

2.5

Bot Commits

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

N/A

6 months

Related Projects

finetune-transformer-lm

Code and model for the paper "Improving Language Understanding by Generative Pre-Training"

11 Jun 2018 2,144

blocksparse

Efficient GPU kernels for block-sparse matrix multiplication and convolution

06 Dec 2017 1,008

transformer-debugger

11 Mar 2024 4,020

iaf

Code for reproducing key results in the paper "Improving Variational Inference with Inverse Autor...

15 Jun 2016 517

glow

Code for reproducing results in "Glow: Generative Flow with Invertible 1x1 Convolutions"

19 Jun 2018 3,109

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

24 May 2017 15,695

ai-and-efficiency

Submissions for AI and Efficiency SOTA's

13 Apr 2020 54

image-gpt

07 May 2020 2,040

sparse_autoencoder

12 Jun 2024 281

distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error...

31 Oct 2023 3,520

grade-school-math

20 Oct 2021 1,014

neural-gpu

Code for the Neural GPU model originally described in "Neural GPUs Learn Algorithms"

06 Jul 2016 136

whisper-jax

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

02 Mar 2023 4,370

faster-whisper

Faster Whisper transcription with CTranslate2

11 Feb 2023 9,301

distribution_augmentation

Code for the paper, "Distribution Augmentation for Generative Modeling", ICML 2020.

30 Jun 2020 121