The python library and service for automatic speech recognition and transcribing in Russian and English
APACHE-2.0 License
Published by bond005 over 1 year ago
How to use OpenAIs Whisper to transcribe and diarize audio files
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高
Simple sentence mining tool for language learning
Program to benchmark various speech recognition APIs
A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project
A Screen Translator/OCR Translator made by using Python and Tesseract, the user interface are mad...
Multi-lingual large voice generation model, providing inference, training and deployment full-sta...
A desktop application that transcribes audio from files, microphone input or YouTube videos with ...
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Project that aims to sentenize all the open data of Riksdagen and other sources to create an easi...
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to supp...
Split a speech audio into separate sentences for language learners.
Open dubbing is an AI dubbing system which uses machine learning models to automatically translat...