The best gradio web-ui for ai subtitle, translation and dubbing. Automatic subtitle creation using faster whisper. Easy one click installation. Fully portable.
GPL-3.0 License
Statistics for this project are still being loaded, please check back later.
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
An Open Source text-to-speech system built by inverting Whisper.
so-vits-svc fork with realtime support, improved interface and more features.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
A Web UI for easy subtitle using whisper model.
Unofficial PyTorch implementation of Google AI's VoiceFilter system
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
An unofficial PyTorch implementation of the audio LM VALL-E
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training w...
GUI for a Vocal Remover that uses Deep Neural Networks.
PyTorch Dataset for Speech and Music audio
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Hum...
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
VITS-based Voice Conversion focused on simplicity, quality and performance.