A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.
APACHE-2.0 License
[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering
Kandinsky 2 — multilingual text2image latent diffusion model
This repository contains implementations and illustrative code to accompany DeepMind publications
Taming Transformers for High-Resolution Image Synthesis