SoM

Set-of-Mark Prompting for GPT-4V and LMMs

MIT License

Stars

1.1K

Committers

View Code on GitHub View on X

Ecosystems: VS Code Extension, Playwright, TypeScript, Windows, Windows UI Library (WinUI)

Bot releases are hidden (Show)

SoM - demo Latest Release

Published by jwyang 12 months ago

SoM + GPT-4V demo

SoM - SoM-Bench

Published by jwyang 12 months ago

Set-of-Mark Benchmark for evaluating visual grounding using visual prompting techniques.

Package Rankings

Top 6.75% on Proxy.golang.org

Related Projects

GLIP

Grounded Language-Image Pre-training

24 Nov 2021 2,182

Bringing-Old-Photos-Back-to-Life

Bringing Old Photo Back to Life (CVPR 2020 oral)

24 Jun 2020 15,020

X-Decoder

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and la...

28 Nov 2022 1,287

promptbench

A unified evaluation framework for large language models

13 Jun 2023 2,407

DialoGPT

Large-scale pretraining for dialogue

29 Aug 2019 2,353

MoCapAct

A Multi-Task Dataset for Simulated Humanoid Control

07 Jun 2022 158

MathOctopus

This repository contains resources for accessing the official benchmarks, codes, and checkpoints ...

25 Oct 2023 41

LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

13 Dec 2022 3,604

JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

30 Mar 2023 23,583

GPT4Vision-Robot-Manipulation-Prompts

This repository provides the sample code designed to interpret human demonstration videos and con...

26 Mar 2024 25

sammo

A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metapr...

04 Dec 2023 558

MASS

MASS: Masked Sequence to Sequence Pre-training for Language Generation

27 May 2019 1,116

LLaVA-Med

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabil...

20 May 2023 1,483

MInference

To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention,...

22 May 2024 725

promptbase

All things prompt engineering

12 Dec 2023 5,372