A pyaudio based audio_common with text to speech for ROS 2
MIT License
Open dubbing is an AI dubbing system which uses machine learning models to automatically translat...
Versatile audio super resolution (any -> 48kHz) with AudioSR.
Multi-lingual large voice generation model, providing inference, training and deployment full-sta...
My lightweight J.A.R.V.I.S desktop experiment
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Ready-to-use Multilingual Text-To-Speech (TTS) package.
An Advanced Telegram Bot to Play Radio & Music in Voice Chat. This is Also The Source Code of Th...
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
open-source multimodal large language model that can hear, talk while thinking. Featuring real-ti...
Benchmark popular audio i/o packages
Asynchronous wrapper for AWS Polly API
AudioLDM: Generate speech, sound effects, music and beyond, with text.