EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
A Web UI for easy subtitle using whisper model.
Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisp...
Play ChatGPT and other LLM with Xiaomi AI Speaker
A local chatbot fine-tuned by bilibili user comments.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Open source short video automatic generation tool
李白 作为唐代杰出诗人,其诗歌作品在中国文学史上具有重要地位。近年来,随着数字技术和人工智能的快速发展,传统文化普及推广的形式也面临着创新与变革。国内外对于李白诗歌的研究虽已相当深入,但在...
an extremely simple tool for separating vocals and background music, completely localized for web...
【大模型】3小时完全从0训练一个仅有26M的小参数GPT,最低仅需2G显卡即可推理训练!
Real time interactive streaming digital human
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in h...
Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT4.0/ Claude/文心一言/讯飞星火/通义千问/ Gemini/...
WhisperPlus: Advancing Speech-to-Text Processing 🚀