PyTorch models for lipreading words and sentences
MIT License
Statistics for this project are still being loaded, please check back later.
The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResN...
Models, data loaders and abstractions for language processing, powered by PyTorch
Unofficial PyTorch implementation of Google AI's VoiceFilter system
Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Proc...
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
A PyTorch implementation of the Transformer model in "Attention is All You Need".
A PyTorch implementation of Location-Relative Attention Mechanisms For Robust Long-Form Speech Sy...
Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE...
A collection of Audio and Speech pre-trained models.
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training w...
Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Hum...
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech