Clone a voice in 5 seconds to generate arbitrary speech in real-time
OTHER License
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Sy...
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Foundational model for human-like, expressive TTS
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training w...
Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Unofficial PyTorch implementation of Google AI's VoiceFilter system