Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesis.
AGPL-3.0 License
Statistics for this project are still being loaded, please check back later.
Speech-to-text based on wav2letter built for transfer learning
Code for searching the www.xeno-canto.org bird sound database, and training a machine learning mo...
Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TT...
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
KhanomTan TTS (ขนมตาล) is an open-source Thai text-to-speech model that supports multilingual spe...
Command-line tools for speech and intent recognition on Linux
Speech-To-Text Prompter, an extension for stable-diffusion-webui using the Whisper model
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
WhisperPlus: Advancing Speech-to-Text Processing 🚀
Inference and training library for high-quality TTS models.
A command line interface to combine text information from subtitles with voice data in the video....
Simple Python library, distributed via binary wheels with few direct dependencies, for easily usi...
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unoffic...
My lightweight J.A.R.V.I.S desktop experiment