Whisper Ecosystem

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

Created by

OpenAI

Released

August 2021

Community Repos

1,021

Total GitHub Stars

455,261

Core Projects

whisper

64,924

Robust Speech Recognition via Large-Scale Weak Supervision

openai-cookbook

57,985

Examples and guides for using the OpenAI API

gym

34,605

A toolkit for developing and comparing reinforcement learning algorithms

Popular Projects

faster-whisper

Faster Whisper transcription with CTranslate2

11 Feb 2023 9,301

whisper.cpp

Port of OpenAI's Whisper model in C/C++

25 Sep 2022 34,855

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

09 Dec 2022 8,782

whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

25 Jan 2023 3,362

distil-whisper

Distilled variant of Whisper for speech recognition

31 Oct 2023 3,520

WhisperLive

A nearly-live implementation of OpenAI's Whisper

04 May 2023 1,194

WhisperKit

Swift native on-device speech recognition with Whisper for Apple Silicon

26 Jan 2024 2,234

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting

14 Nov 2017 10,329

whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

13 Jan 2023 1,898

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

17 Nov 2020 4,083

RTranslator

Open source real-time translation app for Android that runs locally

30 Mar 2020 6,537

Whisper-WebUI

A Web UI for easy subtitle using whisper model

02 Mar 2023 1,083

ruby-openai

OpenAI API + Ruby! 🤖❤️ NEW: Assistant Vector Stores

03 Aug 2020 2,725

whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2

17 Mar 2023 872

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc

24 Nov 2022 5,614

whisper_mic

Project that allows one to use a microphone with OpenAI whisper

23 Sep 2022 704

buzz

Buzz transcribes and translates audio offline on your personal computer

24 Sep 2022 12,075

use-whisper

React hook for OpenAI Whisper with speech recorder, real-time transcription, and silence removal built-in

06 Mar 2023 718

inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code

14 Jun 2023 5,025

auto-subtitle

Automatically generate and overlay subtitles for any video

28 Sep 2022 1,492

More Popular

Up and Coming Projects

WhisperTranscriber

Whisper Transcribe and srt Resegment

14 Oct 2024 2

educa24-speech-to-summary

Demonstrator for an open-source speech-to-summary workflow

17 Sep 2024 0

easy-subber

A Python-based tool that that takes video files and generates

06 Sep 2024 5

SpeechToText

Speech-to-Text using OpenAI's Whisper model

04 Sep 2024 0

Transcribe-Translate

Local web app for transcription and translation services for audio and video using Whisper models

28 Aug 2024 1

whisper-client

Very simple Python based client for Whisper compatible endpoint

26 Aug 2024 1

TechStormHack-1st-place

Решение соревнования ТехШторм от корпорации ТатНефть по анализу активности членов команды на ВКС

23 Aug 2024 0

amazon-ivs-webgpu-captions-demo

This repository contains an experimental demo application that shows how you can add client-side auto-generated captions to Amazon IVS Real-time and Low-latency streams using transformers

21 Aug 2024 1