whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

AGPL-3.0 License

Downloads

33.7K

Stars

1.9K

Committers

View Code on GitHub

Ecosystems: Python, PyTorch, Whisper

No releases found yet, please check back later.

Package Rankings

Top 6.7% on Proxy.golang.org

Top 4.09% on Pypi.org

Related Projects

wavenet_vocoder

WaveNet vocoder

27 Dec 2017 2,314

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Suppo...

24 Nov 2022 5,614

Mockingjay-Speech-Representation

Official Implementation of Mockingjay in Pytorch

08 Jun 2020 52

End-to-end-ASR-Pytorch-DLHLP

Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Hum...

10 Sep 2020 13

metavoice-src

Foundational model for human-like, expressive TTS

06 Feb 2024 3,112

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

29 Aug 2017 29,423

TTS

Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

23 Jan 2018 8,880

speechbrain

A PyTorch-based Speech Toolkit

28 Apr 2020 7,821

Whisper-WebUI

A Web UI for easy subtitle using whisper model.

02 Mar 2023 1,083

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

23 Nov 2020 4,206

pytorch-widedeep

A flexible package for multimodal-deep-learning to combine tabular data with text and images usin...

21 Oct 2017 1,243

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training w...

14 Jun 2023 4,785

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

27 Mar 2023 8,170

vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

11 Jan 2023 2,939

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

14 Feb 2023 3,811