Wav2Vec 2.0 catalan training scripts and models
基于Pytorch实现的语音情感识别
Inference and training library for high-quality TTS models.
A generative speech model for daily dialogue.
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fban...
Audio super resolution using neural networks
语音感情识别
Finetune VITS and MMS using HuggingFace's tools
基于PaddlePaddle实现的音频分类,支持EcapaTdnn、PANNS、TDNN、Res2Net、ResNetSE等各种模型,还有多种预处理方法
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's...
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in h...
KhanomTan TTS (ขนมตาล) is an open-source Thai text-to-speech model that supports multilingual spe...
Deepspeech ASR Model for the Catalan Language