Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch
MIT License
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Genera...
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
Implementation of Autoregressive Diffusion in Pytorch
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-...
Lumina-T2X is a unified framework for Text to Any Modality Generation
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023
Audio generation using diffusion models, in PyTorch.
Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images an...
Implementation of a framework for Gamengen in Pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch