Whisper Ecosystem

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

Created by

OpenAI

Released

August 2021

Community Repos

1,021

Total GitHub Stars

455,261

Keywords

openai 115 speech-to-text 88 speech-recognition 77 ai 64 python 49 transcription 42 asr 39 chatgpt 39 paper 30 llm 29

Languages

Python 244 TypeScript 43 Jupyter Notebook 38 JavaScript 25 Go 13 C++ 12 Rust 9 C 8 C# 7 HTML 6

Licenses

MIT 239 APACHE-2.0 43 GPL-3.0 21 OTHER 21 AGPL-3.0 11 GPL-2.0 5 BSD-3-CLAUSE 2 CC0-1.0 2 MPL-2.0 2 UNLICENSE 1

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

09 Dec 2022 8,782

whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

25 Jan 2023 3,362

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting

14 Nov 2017 10,329

whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

13 Jan 2023 1,898

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

17 Nov 2020 4,083

WhisperS2T

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

16 Dec 2023 284

faster-whisper-GUI

faster_whisper GUI with PySide6

18 Jul 2023 1,448

whisper.unity

Running speech to text model (whisper

26 Mar 2023 397

nexa-sdk

Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models

16 Aug 2024 1,871

whisper.api

This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model

12 Aug 2023 863

whisper-standalone-win

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python

25 Feb 2023 727

pyannote-whisper

09 Nov 2022 496

tafrigh

تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit

20 Mar 2023 101

Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data

22 Apr 2023 501

AudioNotes

快速提取音视频内容，整理成一份结构化的markdown笔记

19 Jul 2024 1,018

docker-whisperX

Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)

26 Aug 2023 124

whisper_normalizer

A python package for whisper normalizer

21 Mar 2023 31

Indic-Subtitler

Open source subtitling platform 💻 for transcribing and translating videos/audios in Indic languages

05 Feb 2024 55

pyannote-rs

pyannote audio diarization in rust

31 Jul 2024 15

docker-whisper-server

whisper

20 Jul 2024 8