适用于 diffsinger 的多功能工具集
MIT License
diffsinger
:
VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Common Voice Dataset explorer
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to supp...
Text-to-Audio/Music Generation
Core Engine of Singing Voice Conversion & Singing Voice Clone
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ...
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Offic...
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in h...
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
http://www.facegood.cc
無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXの音声合成エンジン