TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
BSD-3-CLAUSE License
⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language mod...
Code Release of F-LMM: Grounding Frozen Large Multimodal Models
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language M...
GLM (General Language Model)
VILA - a multi-image visual language model with training, inference and evaluation recipe, deploy...
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2....
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
An open platform for training, serving, and evaluating large language models. Release repo for Vi...
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
20+ high-performance LLM implementations with recipes to pretrain, finetune and deploy at scale.
LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA a...
a state-of-the-art-level open visual language model | 多模态预训练模型