A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project
OTHER License
Open dubbing is an AI dubbing system which uses machine learning models to automatically translat...
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
A collection of Audio and Speech pre-trained models.
TTS with The Massively Multilingual Speech (MMS) project
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's...
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
http://www.facegood.cc
A desktop application that transcribes audio from files, microphone input or YouTube videos with ...
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
WhisperPlus: Advancing Speech-to-Text Processing 🚀
The python library and service for automatic speech recognition and transcribing in Russian and E...
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Genera...
Finetune VITS and MMS using HuggingFace's tools
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to supp...