CTC end -to-end ASR for timit and 863 corpus.
Statistics for this project are still being loaded, please check back later.
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition syst...
Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Hum...
Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
SincNet is a neural architecture for efficiently processing raw audio samples.
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResN...
NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, PO...
End-to-End Speech Processing Toolkit
Foundational model for human-like, expressive TTS
A PyTorch implementation of the Transformer model in "Attention is All You Need".
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Sy...
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models