SEGAN pytorch implementation https://arxiv.org/abs/1703.09452
GPL-3.0 License
Statistics for this project are still being loaded, please check back later.
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Sy...
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResN...
Neural building blocks for speaker diarization: speech activity detection, speaker change detecti...
CTC end -to-end ASR for timit and 863 corpus.
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Unofficial PyTorch implementation of Google AI's VoiceFilter system
Clone a voice in 5 seconds to generate arbitrary speech in real-time
My research code to setup and train PyTorch deep nets.
Semantic Segmentation Architectures Implemented in PyTorch
Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.
Data manipulation and transformation for audio signal processing, powered by PyTorch
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Kaggle | 1st place solution for Freesound Audio Tagging 2019
Some PyTorch code for the Kaggle Speech Recognition Challenge
Vocal Remover using Deep Neural Networks