X-Decoder

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language

APACHE-2.0 License

Stars

1.3K

View Code on GitHub View on X

Ecosystems: Playwright, Windows UI Library (WinUI), Windows, TypeScript, VS Code Extension

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

14 days

15 days

Related Projects

VideoX

VideoX: a collection of video cross-modal models

21 Nov 2019 968

unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

23 Jul 2019 19,644

GLIP

Grounded Language-Image Pre-training

24 Nov 2021 2,182

DeBERTa

The implementation of DeBERTa

08 Jun 2020 1,976

CodeBERT

17 Jun 2020 2,192

torchscale

Foundation Architecture for (M)LLMs

17 Nov 2022 3,006

SoM

Set-of-Mark Prompting for GPT-4V and LMMs

16 Oct 2023 1,114

MathOctopus

This repository contains resources for accessing the official benchmarks, codes, and checkpoints ...

25 Oct 2023 41

MMdnn

MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g...

16 Aug 2017 5,792

JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

30 Mar 2023 23,583

fastseq

An efficient implementation of the popular sequence models for text generation, summarization, an...

15 Jul 2020 431

MASS

MASS: Masked Sequence to Sequence Pre-training for Language Generation

27 May 2019 1,116

SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

08 Feb 2022 1,164

Bringing-Old-Photos-Back-to-Life

Bringing Old Photo Back to Life (CVPR 2020 oral)

24 Jun 2020 15,020

DialoGPT

Large-scale pretraining for dialogue

29 Aug 2019 2,353