Implementation of the Remixer Block from the Remixer paper, in Pytorch
MIT License
Implementation of Feedback Transformer in Pytorch
Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch
Implementation of Block Recurrent Transformer - Pytorch
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
Implementation of MlpMixer model, Original paper: MLP-Mixer: An all-MLP Architecture for Vision
Implementation of Memformer, a Memory-augmented Transformer, in Pytorch
Implementation of Hierarchical Transformer Memory (HTM) for Pytorch
Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ...
PyTorch implementation of "MLP-Mixer: An all-MLP Architecture for Vision" Tolstikhin et al. (2021)
Implementation of Fast Transformer in Pytorch
Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen...
Implementation of ConvMixer for "Patches Are All You Need? 🤷"
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Py...
An All-MLP solution for Vision, from Google AI
Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composi...