Train vision models using JAX and 🤗 transformers
APACHE-2.0 License
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architectu...
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Computational photography pipeline that performs multiple inferences from any image or video.
Implementation of Alphafold 3 in Pytorch
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Re...
Unofficial implementation of iTransformer - SOTA Time Series Forecasting using Attention networks...
Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 min...
Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch
A concise but complete implementation of CLIP with various experimental improvements from recent ...