This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
APACHE-2.0 License
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
KhanomTan TTS (ขนมตาล) is an open-source Thai text-to-speech model that supports multilingual spe...
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
http://www.facegood.cc
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Clone a voice in 5 seconds to generate arbitrary speech in real-time
A generative speech model for daily dialogue.
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ...
Core Engine of Singing Voice Conversion & Singing Voice Clone
Finetune VITS and MMS using HuggingFace's tools
Inference and training library for high-quality TTS models.
Multi-lingual large voice generation model, providing inference, training and deployment full-sta...
AudioLDM: Generate speech, sound effects, music and beyond, with text.