Set-of-Mark Prompting for GPT-4V and LMMs
MIT License
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabil...
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
MASS: Masked Sequence to Sequence Pre-training for Language Generation
Bringing Old Photo Back to Life (CVPR 2020 oral)
All things prompt engineering
General technology for enabling AI capabilities w/ LLMs and MLLMs
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and la...
This repository provides the sample code designed to interpret human demonstration videos and con...
Grounded Language-Image Pre-training
A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metapr...
A Multi-Task Dataset for Simulated Humanoid Control
To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention,...
Large-scale pretraining for dialogue
A unified evaluation framework for large language models
This repository contains resources for accessing the official benchmarks, codes, and checkpoints ...