Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家
APACHE-2.0 License
Statistics for this project are still being loaded, please check back later.
Unofficial PyTorch implementation of Google AI's VoiceFilter system
Fine-tune the Whisper speech recognition model to support training without timestamp data, traini...
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResN...
📜 A python library for distributed training of a Transformer neural network across the Internet ...
Kaggle | 1st place solution for Freesound Audio Tagging 2019
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training w...
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE...
Foundational model for human-like, expressive TTS
End-to-End Speech Processing Toolkit
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models