This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models
OTHER License
Train high-quality text-to-image diffusion models in a data & compute efficient manner
4M: Massively Multimodal Masked Modeling
This repository contains the official implementation of the research paper, "FastViT: A Fast Hybr...
This is an official implementation for "AutoFocusFormer: Image Segmentation off the Grid".
This repository contains the official implementation of the research paper, "MobileCLIP: Fast Ima...