mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

APACHE-2.0 License

Stars

1.3K

View Code on GitHub

Ecosystems: Python

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

101

Time to Close Issues

3 days

11 days

Related Projects

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多...

22 Nov 2023 5,641

long_llama

LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA a...

06 Jul 2023 1,448

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deploy...

23 Feb 2024 1,061

F-LMM

Code Release of F-LMM: Grounding Frozen Large Multimodal Models

28 Mar 2024 28

mmengine

OpenMMLab Foundational Library for Training Deep Learning Models

08 Feb 2022 1,093

AliceMind

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

21 May 2021 1,971

minimind

【大模型】3小时完全从0训练一个仅有26M的小参数GPT，最低仅需2G显卡即可推理训练！

27 Jul 2024 2,087

awesome-foundation-and-multimodal-models

👁️ + 💬 + 🎧 = 🤖 Curated list of top foundation and multimodal models! [Paper + Code + Examples + ...

08 Oct 2023 518

mmdetection_test

12 May 2021 4

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

18 Sep 2023 5,913

PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

27 Jun 2024 4,911

Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large...

09 Nov 2023 1,314

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

14 Dec 2023 1,932

VisCPM

[ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模...

30 Jun 2023 1,075

CVPR2024-Papers-with-Code

CVPR 2024 论文和开源项目合集

26 Feb 2020 17,384