wav2vec2_stt_python

Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition

AGPL-3.0 License

Stars

View Code on GitHub

Ecosystems: Python, PyTorch

Statistics for this project are still being loaded, please check back later.

Badges

Extracted from project README

Related Projects

py-silero-vad-lite

Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package depende...

24 Sep 2024 5

AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

29 Jan 2023 2,400

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 ...

20 Oct 2022 3,213

tensorflow-wavenet

A TensorFlow implementation of DeepMind's WaveNet paper

12 Sep 2016 5,408

TensorFlowTTS

TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including ...

22 Mar 2020 3,810

dswav

23 Nov 2023 15

ChatTTS

A generative speech model for daily dialogue.

27 May 2024 31,328

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-sta...

03 Jul 2024 5,597

sd-wav2lip-uhq

Wav2Lip UHQ extension for Automatic1111

03 Aug 2023 1,254

audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

05 May 2017 2,468

Wav2Vec-Wrapper

An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.

16 Apr 2021 81

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

14 Jan 2024 33,328

vocal-separate

an extremely simple tool for separating vocals and background music, completely localized for web...

26 Dec 2023 1,267

TTS

Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

23 Jan 2018 8,880

audio-pretrained-model

A collection of Audio and Speech pre-trained models.

18 Jul 2020 180