Audio processing by using pytorch 1D convolution network
MIT License
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Sy...
Foundational model for human-like, expressive TTS
An unofficial PyTorch implementation of the audio LM VALL-E
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
PyTorch Dataset for Speech and Music audio
splearn: package for signal processing and machine learning with Python. Contains tutorials on un...
Kaggle | 1st place solution for Freesound Audio Tagging 2019
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Unofficial PyTorch implementation of Google AI's VoiceFilter system
Data manipulation and transformation for audio signal processing, powered by PyTorch
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training w...
ReconVAT: a semi-supervised automatic music transcription (AMT) model
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
A collection of Audio and Speech pre-trained models.
Fast PyTorch based DSP for audio and 1D signals