GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Stars

5.2K

View Code on GitHub

Ecosystems: Python

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

113

Time to Close Issues

2 days

Package Rankings

Top 33.85% on Pypi.org

Badges

Extracted from project README

Related Projects

VisCPM

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模...

30 Jun 2023 1,075

OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities...

29 Jan 2022 2,401

Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large...

09 Nov 2023 1,314

open-instruct

09 Jun 2023 1,214

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

18 Sep 2023 4,242

DB

A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".

18 Nov 2019 2,084

Vary

Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language M...

07 Dec 2023 1,518

F-LMM

Code Release of F-LMM: Grounding Frozen Large Multimodal Models

28 Mar 2024 28

CoDeF

[CVPR 2024 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Te...

15 Aug 2023 4,828

ITER

PyTorch codes for "Iterative Token Evaluation and Refinement for Real-World Super-Resolution", AA...

10 Dec 2023 47

zero123

Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)

17 Mar 2023 2,666

DiffBIR

Official codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

28 Aug 2023 3,289

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

18 Sep 2023 5,913

HRNet-Semantic-Segmentation

The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This...

09 Apr 2019 3,130

long_llama

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA a...

06 Jul 2023 1,448