LightGlue: Local Feature Matching at Light Speed (ICCV 2023)
APACHE-2.0 License
Bot releases are hidden (Show)
Published by Phil26AT over 1 year ago
[CVPR'22] NICE-SLAM: Neural Implicit Scalable Encoding for SLAM
A collection of computer vision pre-trained models.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多...
[CVPR 2024] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This...
High-Resolution Image Synthesis with Latent Diffusion Models
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
Official codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
Visual localization made easy with hloc
Official PyTorch implementation of VoxFormer [CVPR 2023 Highlight]
20+ high-performance LLM implementations with recipes to pretrain, finetune and deploy at scale.
[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution
Code Release of F-LMM: Grounding Frozen Large Multimodal Models
SCAN: Learning to Classify Images without Labels, incl. SimCLR. [ECCV 2020]