VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
MIT License
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.
Official Implementation of Mockingjay in Pytorch
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Sy...
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
本项目是基于Pytorch的语音合成项目,使用的是VITS,VITS是一种语音合成方法,这种时端到端的模型使用起来非常简单,不需要文本对齐等太复杂的流程,直接一键训练和生成,大大降低了学习门槛。
Foundational model for human-like, expressive TTS
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Clone a voice in 5 seconds to generate arbitrary speech in real-time
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
End-to-End Speech Processing Toolkit
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training w...
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine