Pytorch implementation for "Open Compound Domain Adaptation" (CVPR 2020 ORAL)
BSD-3-CLAUSE License
Bamboo: 4 times larger than ImageNet; 2 time larger than Object365; Built by active learning.
This is the official implemantation of “Learn-to-Decompose: Cascaded Decomposition Network for Cr...
[ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning fr...
[NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual R...
Code for ShadeGAN (NeurIPS2021)
The official repo for CVPR2021——ViPNAS: Efficient Video Pose Estimation via Neural Architecture S...
Pytorch implementation for "Large-Scale Long-Tailed Recognition in an Open World" (CVPR 2019 ORAL)
[ICCV2019] Robust Multi-Modality Multi-Object Tracking
Group R-CNN for Point-based Weakly Semi-supervised Object Detection (CVPR2022)
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large...
PyTorch codes for "Progressive Semantic-Aware Style Transformation for Blind Face Restoration", C...
Relate Anything Model is capable of taking an image as input and utilizing SAM to identify the co...
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
Code for CVPR 2019 paper "Libra R-CNN: Towards Balanced Learning for Object Detection"
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest