[CVPR 2021] Code for "Augmentation Strategies for Learning with Noisy Labels".
MIT License
Elucidating the Design Space of Diffusion-Based Generative Models (EDM)
CNN-based singing voice detection experiments
Train vision models using JAX and 🤗 transformers
Generic image compressor for machine learning. Pytorch code for our paper "Lossy compression for ...
Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to...
Official code of CVPR 2021's PLOP: Learning without Forgetting for Continual Semantic Segmentation
Dynamic Token Expansion with Continual Transformers, accepted at CVPR 2022
[CVPR2024] DisCo: Referring Human Dance Generation in Real World
Embed arbitrary modalities (images, audio, documents, etc) into large language models.
Semantic Image Synthesis with SPADE
Usable Implementation of "Bootstrap Your Own Latent" self-supervised learning, from Deepmind, in ...
GLM (General Language Model)
Hybrid Discriminative-Generative Training via Contrastive Learning
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregress...
A simple method to perform semi-supervised learning with limited data.