Implementation of NWT, audio-to-video generation, in Pytorch
MIT License
Implementation of Block Recurrent Transformer - Pytorch
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
Implementation of Hierarchical Transformer Memory (HTM) for Pytorch
Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver ...
Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 ...
Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizin...
Implementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a ...
Implementation of Feedback Transformer in Pytorch
Implementation of Discrete Key / Value Bottleneck, in Pytorch
Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch
Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neu...
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Genera...
Implementation of Memformer, a Memory-augmented Transformer, in Pytorch
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch