The python library and service for automatic speech recognition and transcribing in Russian and English
APACHE-2.0 License
How to use OpenAIs Whisper to transcribe and diarize audio files
A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project
Multi-lingual large voice generation model, providing inference, training and deployment full-sta...
My lightweight J.A.R.V.I.S desktop experiment
Split a speech audio into separate sentences for language learners.
A Screen Translator/OCR Translator made by using Python and Tesseract, the user interface are mad...
Open dubbing is an AI dubbing system which uses machine learning models to automatically translat...
Project that aims to sentenize all the open data of Riksdagen and other sources to create an easi...
Program to benchmark various speech recognition APIs
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
A desktop application that transcribes audio from files, microphone input or YouTube videos with ...
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to supp...