基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型
APACHE-2.0 License
语音感情识别
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
基于PaddlePaddle实现的音频分类,支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型,还有多种预处理方法
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's...
本项目是基于PaddlePaddle的语音合成项目,使用的是VITS,VITS是一种语音合成方法,这种时端到端的模型使用起来非常简单,不需要文本对齐等太复杂的流程,直接一键训练和生成,大大降低了...
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation too...
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
ASR client for Triton ASR Service
Core Engine of Singing Voice Conversion & Singing Voice Clone
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ...
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fban...
基于Pytorch实现的语音情感识别
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统