MASR

Pytorch实现的流式与非流式的自动语音识别框架，同时兼容在线和离线识别，目前支持Conformer、Squeezeformer、DeepSpeech2模型，支持多种数据增强方法。

APACHE-2.0 License

Downloads

280

Stars

589

Committers

View Code on GitHub

Ecosystems: PyTorch

Commit Statistics

Past Year

All Time

Total Commits

198

Total Committers

Avg. Commits Per Committer

66.0

Bot Commits

Issue Statistics

Past Year

All Time

Total Pull Requests

Merged Pull Requests

Total Issues

Time to Close Issues

about 1 month

Package Rankings

Top 14.86% on Pypi.org

Related Projects

audio-pretrained-model

A collection of Audio and Speech pre-trained models.

18 Jul 2020 180

voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system

22 Mar 2019 1,035

EmotiVoice

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

08 Nov 2023 7,282

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

14 Feb 2023 3,811

deepvoice3_pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

31 Oct 2017 1,963

AudioClassification-Pytorch

The Pytorch implementation of sound classification supports EcapaTdnn, PANNS, TDNN, Res2Net, ResN...

20 Aug 2021 378

VoiceprintRecognition-Pytorch

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE...

28 Jun 2021 752

Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, traini...

22 Apr 2023 501

VITS-Pytorch

本项目是基于Pytorch的语音合成项目，使用的是VITS，VITS是一种语音合成方法，这种时端到端的模型使用起来非常简单，不需要文本对齐等太复杂的流程，直接一键训练和生成，大大降低了学习门槛。

23 Aug 2023 16

ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

29 Oct 2019 1,543

espnet

End-to-End Speech Processing Toolkit

13 Dec 2017 7,825