A PyTorch implementation of Factorized Inference in Deep Markov Models for Incomplete Multimodal Time Series (https://arxiv.org/abs/1905.13570).
Statistics for this project are still being loaded, please check back later.
Semantic Image Synthesis with SPADE
Official repo for VGen: a holistic video generation ecosystem for video generation building on di...
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
Repository for the paper " 3D Human Mesh Regression with Dense Correspondence "
Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic video-to...
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregress...
Code Release of F-LMM: Grounding Frozen Large Multimodal Models
ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation
The official repo for CVPR2021——ViPNAS: Efficient Video Pose Estimation via Neural Architecture S...
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tune...
PyTorch re-implementation of [Structured Inference Networks for Nonlinear State Space Models, AAA...
Unsupervised Language Modeling at scale for robust sentiment classification
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多...
The official PyTorch implementation of the paper "Human Motion Diffusion Model"