Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
MIT License
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Implementation of MagViT2 Tokenizer in Pytorch
A concise but complete implementation of CLIP with various experimental improvements from recent ...
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Re...
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Unofficial implementation of iTransformer - SOTA Time Series Forecasting using Attention networks...
Implementation of Alphafold 3 in Pytorch
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 min...
Implementation of MeshGPT, SOTA Mesh generation using Attention, in Pytorch
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-...
Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch