Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]
MIT License
Statistics for this project are still being loaded, please check back later.
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video ...
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Official repository of the "Fine-grained Key-Value Memory Enhanced Predictor for Video Representa...
FILM: Frame Interpolation for Large Motion, In ECCV 2022.
Code Release of F-LMM: Grounding Frozen Large Multimodal Models
Background Matting: The World is Your Green Screen
Large-scale text-video dataset. 10 million captioned short videos.
NVIDIA's Deep Imagination Team's PyTorch Library
Depth-Aware Video Frame Interpolation (CVPR 2019)
Official repo for VGen: a holistic video generation ecosystem for video generation building on di...
Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
[ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis
[CVPR 2024 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Te...
Vid2Avatar: 3D Avatar Reconstruction from Videos in the Wild via Self-supervised Scene Decomposit...