A simple library for Fréchet Audio Distance (FAD) calculation
MIT License
Bot releases are hidden (Show)
Published by hykilpikonna 12 months ago
🎉 First release!
Core Features:
Supported Models:
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabil...
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and la...
Foundation Architecture for (M)LLMs
ProbTS is a benchmarking toolkit for time series forecasting.
This repository contains resources for accessing the official benchmarks, codes, and checkpoints ...
NOTSOFAR-1 Challenge: Distant Diarization and ASR
AEC Challenge
To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention,...
Large-scale pretraining for dialogue
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) C...
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
A Multi-Task Dataset for Simulated Humanoid Control
MASS: Masked Sequence to Sequence Pre-training for Language Generation
An efficient implementation of the popular sequence models for text generation, summarization, an...