The Places365-CNNs for Scene Classification
MIT License
[CVPR2021, PAMI2023] End-to-End Object Detection with Learnable Proposal
An MXNet implementation of Mask R-CNN
[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Ne...
Semantic Image Synthesis with SPADE
Gluon CV Toolkit
🔥3D点云目标检测&语义分割(深度学习)-SOTA方法,代码,论文,数据集等
Object detection, 3D detection, and pose estimation using center point detection:
Interactive Image Generation via Generative Adversarial Networks
SCAN: Learning to Classify Images without Labels, incl. SimCLR. [ECCV 2020]
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregress...
The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This...
A collection of computer vision pre-trained models.
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO