Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
MIT License
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Re...
Implementation of the Equiformer, SE3/E3 equivariant attention network that reaches new SOTA, and...
Unofficial implementation of iTransformer - SOTA Time Series Forecasting using Attention networks...
A concise but complete implementation of CLIP with various experimental improvements from recent ...
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Implementation of Alphafold 3 in Pytorch
Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch
Implementation of Autoregressive Diffusion in Pytorch
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch