ESPNet TTS with Streamlit GUI
MIT License
Statistics for this project are still being loaded, please check back later.
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Multi-lingual large voice generation model, providing inference, training and deployment full-sta...
Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TT...
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
KhanomTan TTS (ขนมตาล) is an open-source Thai text-to-speech model that supports multilingual spe...
A generative speech model for daily dialogue.
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to supp...
Converts text to speech in realtime
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's...
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single ...
open-source multimodal large language model that can hear, talk while thinking. Featuring real-ti...
TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including ...
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system ...