Data manipulation and transformation for audio signal processing, powered by PyTorch
BSD-2-CLAUSE License
A PyTorch-based Speech Toolkit
Unofficial PyTorch implementation of Google AI's VoiceFilter system
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training w...
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResN...
so-vits-svc fork with realtime support, improved interface and more features.
Kaggle | 1st place solution for Freesound Audio Tagging 2019
Noise supression using deep filtering
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Datasets, Transforms and Models specific to Computer Vision
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
An unofficial PyTorch implementation of the audio LM VALL-E
An Open Source text-to-speech system built by inverting Whisper.
Models, data loaders and abstractions for language processing, powered by PyTorch
A collection of Audio and Speech pre-trained models.