Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
Python - Released: 03 Jan 2023 - 860
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
Python - Released: 15 Jan 2022 - 849
Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
Python - Released: 05 May 2022 - 977
Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch
Python - Released: 20 Mar 2023 - 223
Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch
Python - Released: 26 Mar 2023 - 285
Implementation of Autoregressive Diffusion in Pytorch
Python - Released: 23 Jul 2024 - 265
Implementation of a U-net complete with efficient attention as well as the latest research findings
Python - Released: 23 Mar 2022 - 259
A concise but complete implementation of CLIP with various experimental improvements from recent papers
Python - Released: 01 Dec 2021 - 686
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways
Python - Released: 04 Apr 2022 - 822
Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch
Python - Released: 05 Feb 2024 - 243
Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch
Python - Released: 04 May 2024 - 244
Implementation of Block Recurrent Transformer - Pytorch
Python - Released: 07 Feb 2023 - 212
Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch
Python - Released: 21 Mar 2022 - 623
Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT
Python - Released: 06 Apr 2023 - 204
A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models
Python - Released: 13 Jul 2020 - 624
Implementation of E(n)-Transformer, which incorporates attention mechanisms into Welling's E(n)-Equivariant Graph Neural Network
Python - Released: 27 Feb 2021 - 208
Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification
Python - Released: 11 Feb 2021 - 689
Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind
Python - Released: 09 Jan 2024 - 167
Implementation of E(n)-Equivariant Graph Neural Networks, in Pytorch
Python - Released: 26 Feb 2021 - 414
Implementation of rectified flow and some of its followup research / improvements in Pytorch
Python - Released: 17 Jul 2024 - 161
Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch
Python - Released: 28 Nov 2021 - 540
Implementation of Bottleneck Transformer in Pytorch
Python - Released: 28 Jan 2021 - 670
Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch
Python - Released: 22 Jun 2022 - 522
Pytorch implementation of SIREN - Implicit Neural Representations with Periodic Activation Function
Python - Released: 19 Jun 2020 - 465
A simple cross attention that updates both the source and target in one step
Python - Released: 27 Mar 2022 - 145