CMR

CDNet

This is the official implemantation of “Learn-to-Decompose: Cascaded Decomposition Network for Cr...

16 Jul 2022 13

CrossConST-SR

Code for EMNLP 2023 industry track paper "Learning Multilingual Sentence Representations with Cro...

20 Apr 2023 5

DaSiamRPN

[ECCV2018] Distractor-aware Siamese Networks for Visual Object Tracking

13 Aug 2018 1,248

FeMaSR

PyTorch codes for "Real-World Blind Super-Resolution via Feature Matching with Implicit High-Reso...

05 Jan 2022 161

STT

A multi-task model which does image captioning, sentence paraphrasing and cross-modal retrieval.

25 Dec 2018 18

pysot

SenseTime Research platform for single object tracking, implementing algorithms like SiamRPN and ...

07 May 2019 4,421

xmodaler

X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image capt...

25 Jun 2021 1,020

ovdet

[CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection

26 Feb 2023 125

VisCPM

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模...

30 Jun 2023 1,075

Realtime_Multi-Person_Pose_Estimation

Code repo for realtime multi-person pose estimation in CVPR'17 (Oral)

12 Dec 2016 5,084

Union14M

[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective

07 Nov 2022 165

mx-maskrcnn

An MXNet implementation of Mask R-CNN

24 Oct 2017 1,755

InternImage

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable...

10 Nov 2022 2,486

multiview_pose

[ICCV2021] Code Release of Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images

02 Dec 2022 14

ViP-Object-Detection

This repository contains the official implementation to reproduce object detection results of ViP.

28 Jul 2021 8

Related Projects