This repository contains the official implementation of Astroformer, an ICLR Workshop 2023 paper.
APACHE-2.0 License
Dense Prediction Transformers
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregress...
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with ...
Latte: Latent Diffusion Transformer for Video Generation.
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Elucidating the Design Space of Diffusion-Based Generative Models (EDM)
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (...
MambaOut: Do We Really Need Mamba for Vision?
[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
A Library for Advanced Deep Time Series Models.
This is a collection of our NAS and Vision Transformer work.
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
DALL·E Mini - Generate images from a text prompt