MIT License
Repository for the paper "Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them...
Code for reproducing results in "Glow: Generative Flow with Invertible 1x1 Convolutions"
Faster Whisper transcription with CTranslate2
Code for the Neural GPU model originally described in "Neural GPUs Learn Algorithms"
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an i...
Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"
Fine-tune the Whisper speech recognition model to support training without timestamp data, traini...
Code for Implicit Generation and Generalization with Energy Based Models
Release for Improved Denoising Diffusion Probabilistic Models
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Efficient GPU kernels for block-sparse matrix multiplication and convolution
Speech o Text using docker image with ggerganov/whisper.cpp