nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

MIT License

Downloads

13.7K

Stars

8.8K

Committers

View Code on GitHub Visit Website

Ecosystems: Python

Bot releases are hidden (Show)

nougat - 0.1.0-small Latest Release

Published by lukas-blecher about 1 year ago

nougat-small weights

nougat - 0.1.0-base

Published by lukas-blecher about 1 year ago

nougat-base weights

Package Rankings

Top 7.08% on Pypi.org

Badges

Extracted from project README

Related Projects

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

18 Sep 2023 5,913

prismer

The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".

02 Mar 2023 1,295

Latte

Latte: Latent Diffusion Transformer for Video Generation.

28 Oct 2023 1,652

LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

11 Dec 2020 12,124

Complex-YOLOv4-Pytorch

The PyTorch Implementation based on YOLOv4 of the paper: "Complex-YOLO: Real-time 3D Object Detec...

03 Jul 2020 1,234

ocrd_detectron2

OCR-D wrapper for detectron2 based segmentation models

21 Jan 2022 16

OLMo

Modeling, training, eval, and inference code for OLMo

20 Feb 2023 3,877

ChatLM-mini-Chinese

中文对话0.2B小模型（ChatLM-Chinese-0.2B），开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sf...

27 Aug 2023 1,166

surya

OCR, layout analysis, reading order, line detection in 90+ languages

10 Jan 2024 6,739

AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

18 Sep 2023 4,242

ImageCaptioning.pytorch

I decide to sync up this repo and self-critical.pytorch. (The old master is in old master branch ...

10 Feb 2017 1,419

GLM

GLM (General Language Model)

18 Mar 2021 3,170

Marigold

[CVPR 2024] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

27 Nov 2023 1,590

SPADE

Semantic Image Synthesis with SPADE

14 Mar 2019 7,528

ALAE

[CVPR2020] Adversarial Latent Autoencoders

12 Mar 2019 3,508