A fast gigapixel processing system
APACHE-2.0 License
Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tune...
Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral
Official PyTorch implementation of VoxFormer [CVPR 2023 Highlight]
A python library built to empower developers to build applications and systems with self-contain...
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
Background Matting: The World is Your Green Screen
Boosting Driving Scene Understanding with Advanced Vision-Language Models
Official codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多...
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
A Swift Wrapper for PyTorch and Torchvision.
[NeurIPS 2023] Official code of "One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Pe...
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)