Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into different languages.
APACHE-2.0 License
AudioLDM: Generate speech, sound effects, music and beyond, with text.
The python library and service for automatic speech recognition and transcribing in Russian and E...
A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project
A generative speech model for daily dialogue.
A desktop application that transcribes audio from files, microphone input or YouTube videos with ...
Multi-lingual large voice generation model, providing inference, training and deployment full-sta...
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Split a speech audio into separate sentences for language learners.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to supp...
WhisperPlus: Advancing Speech-to-Text Processing 🚀
Open-source subtitle generation for seamless content translation.
Converts text to speech in realtime
A Screen Translator/OCR Translator made by using Python and Tesseract, the user interface are mad...