This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"
MIT License
Code for CVPR 2019 paper "Libra R-CNN: Towards Balanced Learning for Object Detection"
Atlas: End-to-End 3D Scene Reconstruction from Posed Images
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
The Places365-CNNs for Scene Classification
The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This...
SCAN: Learning to Classify Images without Labels, incl. SimCLR. [ECCV 2020]
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
📈 目前最大的工业缺陷检测数据库及论文集 Constantly summarizing open source dataset and critical papers in the field ...
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (...
The Learnable Typewriter: A Generative Approach to Text Line Analysis
A Simple and Versatile Framework for Object Detection and Instance Recognition
Gluon CV Toolkit
[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective
[CVPR2021, PAMI2023] End-to-End Object Detection with Learnable Proposal