TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
APACHE-2.0 License
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's...
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system ...
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to supp...
VILA - a multi-image visual language model with training, inference and evaluation recipe, deploy...
Core Engine of Singing Voice Conversion & Singing Voice Clone
A generative speech model for daily dialogue.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
DeepFloyd-IF (Imagen Free)
OpenChat: Advancing Open-source Language Models with Imperfect Data
Inference and training library for high-quality TTS models.
A collection of Audio and Speech pre-trained models.
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models