GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
OTHER License
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
[ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning fr...
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Code and documentation to train Stanford's Alpaca models, and generate the data.
Pytorch implementation for "Large-Scale Long-Tailed Recognition in an Open World" (CVPR 2019 ORAL)
Chain-of-Hindsight, A Scalable RLHF Method
An open platform for training, serving, and evaluating large language models. Release repo for Vi...
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
a state-of-the-art-level open visual language model | 多模态预训练模型
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
VILA - a multi-image visual language model with training, inference and evaluation recipe, deploy...
The official repo for CVPR2021——ViPNAS: Efficient Video Pose Estimation via Neural Architecture S...
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2....