Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
MPL-2.0 License
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Foundational model for human-like, expressive TTS
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training w...
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Sy...
An Open Source text-to-speech system built by inverting Whisper.
End-to-End Speech Processing Toolkit
so-vits-svc fork with realtime support, improved interface and more features.
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Multilingual Voice Understanding Model
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.
WaveRNN Vocoder + TTS