VideoX: a collection of video cross-modal models
OTHER License
Set-of-Mark Prompting for GPT-4V and LMMs
Dedicated to building industrial foundation models for universal data intelligence across industr...
Bringing Old Photo Back to Life (CVPR 2020 oral)
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
The implementation of DeBERTa
MASS: Masked Sequence to Sequence Pre-training for Language Generation
An efficient implementation of the popular sequence models for text generation, summarization, an...
Grounded Language-Image Pre-training
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and la...
Large-scale pretraining for dialogue
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
A Multi-Task Dataset for Simulated Humanoid Control
This repository contains resources for accessing the official benchmarks, codes, and checkpoints ...
OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, ...