CTC_pytorch | PyTorch Ecosystem Directory

Statistics for this project are still being loaded, please check back later.

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition syst...

Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Hum...

Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.

Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

SincNet is a neural architecture for efficiently processing raw audio samples.

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResN...

NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, PO...

End-to-End Speech Processing Toolkit

Foundational model for human-like, expressive TTS

A PyTorch implementation of the Transformer model in "Attention is All You Need".

A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Sy...

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models