A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
MIT License
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Unofficial PyTorch implementation of Google AI's VoiceFilter system
Data manipulation and transformation for audio signal processing, powered by PyTorch
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
End-to-End Speech Processing Toolkit
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResN...
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Foundational model for human-like, expressive TTS
WaveRNN Vocoder + TTS
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training w...
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models