Audio Captioning datasets for PyTorch.
MIT License
Data manipulation and transformation for audio signal processing, powered by PyTorch
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Sy...
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE...
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient dat...
A PyTorch-based Speech Toolkit
Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain...
Kaggle | 1st place solution for Freesound Audio Tagging 2019
Unofficial PyTorch implementation of Google AI's VoiceFilter system
PyTorch Dataset for Speech and Music audio
An Open Source text-to-speech system built by inverting Whisper.
🎛️ A collection of diverse regression datasets, featuring PyTorch-like dataset classes that autom...
Automated Deep Learning without ANY human intervention. 1'st Solution for AutoDL challenge@NeurIPS.
Models, data loaders and abstractions for language processing, powered by PyTorch
Deep Semi-Supervised Learning with Holistic methods for audio classification.
An unofficial PyTorch implementation of the audio LM VALL-E