X-Decoder

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language

APACHE-2.0 License

Stars

1.3K

View Code on GitHub View on X

Ecosystems: Playwright, Windows UI Library (WinUI), Windows, VS Code Extension, TypeScript

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

14 days

15 days

Related Projects

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

23 Jul 2019 19,616

SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

08 Feb 2022 1,164

MASS

MASS: Masked Sequence to Sequence Pre-training for Language Generation

27 May 2019 1,116

CodeBERT

17 Jun 2020 2,192

fastseq

An efficient implementation of the popular sequence models for text generation, summarization, an...

15 Jul 2020 431

VideoX

VideoX: a collection of video cross-modal models

21 Nov 2019 968

SoM

Set-of-Mark Prompting for GPT-4V and LMMs

16 Oct 2023 1,114

MMdnn

MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g...

16 Aug 2017 5,788

GLIP

Grounded Language-Image Pre-training

24 Nov 2021 2,182

LLaVA-Med

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabil...

20 May 2023 1,483

torchscale

Foundation Architecture for (M)LLMs

17 Nov 2022 3,006

Bringing-Old-Photos-Back-to-Life

Bringing Old Photo Back to Life (CVPR 2020 oral)

24 Jun 2020 15,000

JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

30 Mar 2023 23,575

DialoGPT

Large-scale pretraining for dialogue

29 Aug 2019 2,352

DeBERTa

The implementation of DeBERTa

08 Jun 2020 1,971