A simple library for Fréchet Audio Distance (FAD) calculation
MIT License
AEC Challenge
Foundation Architecture for (M)LLMs
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention,...
ProbTS is a benchmarking toolkit for time series forecasting.
NOTSOFAR-1 Challenge: Distant Diarization and ASR
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) C...
Large-scale pretraining for dialogue
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and la...
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabil...
A Multi-Task Dataset for Simulated Humanoid Control
MASS: Masked Sequence to Sequence Pre-training for Language Generation
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
An efficient implementation of the popular sequence models for text generation, summarization, an...
This repository contains resources for accessing the official benchmarks, codes, and checkpoints ...