Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways
MIT License
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architectu...
Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch
Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
Implementation of MaMMUT, a simple vision-encoder text-decoder architecture for multimodal tasks ...
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling wit...
Implementation of RT1 (Robotic Transformer) in Pytorch
Implementation of a Transformer, but completely in Triton
Unofficial implementation of iTransformer - SOTA Time Series Forecasting using Attention networks...
Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch
🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly bet...
Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net o...
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Py...
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch