Bringing Old Photo Back to Life (CVPR 2020 oral)
MIT License
Large-scale pretraining for dialogue
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documen...
Graphormer is a general-purpose deep learning backbone for molecular modeling.
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and la...
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using S...
To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention,...
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
TensorFlow 2 library implementing Graph Neural Networks
Repo for WWW 2022 paper: Progressively Optimized Bi-Granular Document Representation for Scalable...
MMdnn is a set of tools to help users inter-operate among different deep learning frameworks. E.g...
Grounded Language-Image Pre-training
Foundation Architecture for (M)LLMs
VideoX: a collection of video cross-modal models