Set-of-Mark Prompting for GPT-4V and LMMs
MIT License
Bot releases are hidden (Show)
Published by jwyang 12 months ago
SoM + GPT-4V demo
Set-of-Mark Benchmark for evaluating visual grounding using visual prompting techniques.
Grounded Language-Image Pre-training
Bringing Old Photo Back to Life (CVPR 2020 oral)
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and la...
A unified evaluation framework for large language models
Large-scale pretraining for dialogue
A Multi-Task Dataset for Simulated Humanoid Control
This repository contains resources for accessing the official benchmarks, codes, and checkpoints ...
General technology for enabling AI capabilities w/ LLMs and MLLMs
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
This repository provides the sample code designed to interpret human demonstration videos and con...
A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metapr...
MASS: Masked Sequence to Sequence Pre-training for Language Generation
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabil...
To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention,...
All things prompt engineering