DocTrPP: DocTr++ in PaddlePaddle

Introduction

This is a PaddlePaddle implementation of DocTr++. The original paper is DocTr++: Deep Unrestricted Document Image Rectification. The original code is here.

Requirements

You need to install the latest version of PaddlePaddle, which is done through this link.

Training

Data Preparation

To prepare datasets, refer to doc3D.

Training

sh train.sh

export OPENCV_IO_ENABLE_OPENEXR=1
export CUDA_VISIBLE_DEVICES=0

python train.py --img-size 288 \
    --name "DocTr++" \
    --batch-size 12 \
    --lr 2.5e-5 \
    --exist-ok \
    --use-vdl

Load Trained Model and Continue Training

export OPENCV_IO_ENABLE_OPENEXR=1
export CUDA_VISIBLE_DEVICES=0

python train.py --img-size 288 \
    --name "DocTr++" \
    --batch-size 12 \
    --lr 2.5e-5 \
    --resume "runs/train/DocTr++/weights/last.ckpt" \
    --exist-ok \
    --use-vdl

Test and Inference

Test the dewarp result on a single image:

python predict.py -i "crop/12_2 copy.png" -m runs/train/DocTr++/weights/best.ckpt -o 12.2.png

Export to onnx

pip install paddle2onnx

python export.py -m ./best.ckpt --format onnx

Model Download

The trained model can be downloaded from here.

Related Projects

Deep-Vectorization-of-Technical-Drawings

Official Pytorch repository for Deep Vectorization of Technical Drawings https://arxiv.org/abs/20...

09 Jun 2020 98

mdef_detr

22 Nov 2021 10

PaddleVideo

Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation too...

12 Nov 2020 1,506

mxnet-yolo

YOLO: You only look once real-time object detector

28 Apr 2017 239

OmniBenchmark

[ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning fr...

12 Jul 2022 105

PPYOLOE_pytorch

An unofficial implementation of Pytorch version PP-YOLOE,based on Megvii YOLOX training code.

15 Apr 2022 179

convnet-for-geometric-matching

A Lasagne and Theano implementation of the paper "Convolutional neural network architecture for g...

10 May 2017 29

DAIN

Depth-Aware Video Frame Interpolation (CVPR 2019)

22 Mar 2019 8,196

midv-500-models

Model for document segmentation trained on the midv-500-models dataset.

19 May 2020 72

fatigue-detection

本项目基于PaddleDetection目标检测开发套件，选取1.3M超轻量PPYOLO tiny进行项目开发，并部署于windows端。

30 May 2021 10

torch-em

Deep-learning based semantic and instance segmentation for 3D Electron Microscopy and other bioim...

01 Mar 2021 72

CVPR2021_PLOP

Official code of CVPR 2021's PLOP: Learning without Forgetting for Continual Semantic Segmentation

05 Mar 2021 144

MAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

13 Nov 2021 2,591

ssl_for_fgvc

Self-Supervised Learning for Fine-Grained Image Categorization

28 Jan 2021 24

GPT4RoI

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

06 Jul 2023 365