Whisper Ecosystem

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

Created by

OpenAI

Released

August 2021

Community Repos

1,021

Total GitHub Stars

455,261

Keywords

openai 115 speech-to-text 88 speech-recognition 77 ai 64 python 49 transcription 42 asr 39 chatgpt 39 paper 30 llm 29

Languages

Python 244 TypeScript 43 Jupyter Notebook 38 JavaScript 25 Go 13 C++ 12 Rust 9 C 8 C# 7 HTML 6

Licenses

MIT 239 APACHE-2.0 43 GPL-3.0 21 OTHER 21 AGPL-3.0 11 GPL-2.0 5 BSD-3-CLAUSE 2 CC0-1.0 2 MPL-2.0 2 UNLICENSE 1

nexa-sdk

Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models

16 Aug 2024 1,871

pyannote-rs

pyannote audio diarization in rust

31 Jul 2024 15

voice-pro

The best gradio web-ui for ai transcription, translation and TTS

29 Jul 2024 3

docker-whisper-server

whisper

20 Jul 2024 8

AudioNotes

快速提取音视频内容，整理成一份结构化的markdown笔记

19 Jul 2024 1,018

Realtime-Whisper-Console-Transcriber

A real-time speech-to-text transcriber using the Whisper model, designed for efficiency and ease of use in the console

08 Jul 2024 3

ChineseTaiwaneseWhisper

This repository focuses on leveraging OpenAI's Whisper model for speech recognition in Chinese (Mandarin) and Taiwanese Hokkien languages

01 Jul 2024 3

voice-gulliver

The best gradio web-ui for ai subtitle, translation and dubbing

05 Jun 2024 1

Indic-Subtitler

Open source subtitling platform 💻 for transcribing and translating videos/audios in Indic languages

05 Feb 2024 55

WhisperS2T

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

16 Dec 2023 284

CapGen

A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces

16 Sep 2023 1

docker-whisperX

Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)

26 Aug 2023 124

whisper.api

This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model

12 Aug 2023 863

faster-whisper-GUI

faster_whisper GUI with PySide6

18 Jul 2023 1,448

Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data

22 Apr 2023 501

whisper.unity

Running speech to text model (whisper

26 Mar 2023 397

whisper_normalizer

A python package for whisper normalizer

21 Mar 2023 31

tafrigh

تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit

20 Mar 2023 101

shisper

A quick & dirty script to generate and view subtitles and transcriptions for your multimedia files using ggerganov/whisper

10 Mar 2023 8

malayalam_asr_benchmarking

A study to benchmark whisper based ASRs in Malayalam

04 Mar 2023 8