Whisper Ecosystem

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

Created by
OpenAI
Released
August 2021
Community Repos
1,021
Total GitHub Stars
455,261
Core Projects
More
whisper
64,924
Robust Speech Recognition via Large-Scale Weak Supervision
Examples and guides for using the OpenAI API
gym
34,605
A toolkit for developing and comparing reinforcement learning algorithms
Popular Projects 
More

faster-whisper

Faster Whisper transcription with CTranslate2

11 Feb 2023 9,301

whisper.cpp

Port of OpenAI's Whisper model in C/C++

25 Sep 2022 34,855

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

09 Dec 2022 8,782

whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

25 Jan 2023 3,362

distil-whisper

Distilled variant of Whisper for speech recognition

31 Oct 2023 3,520

WhisperLive

A nearly-live implementation of OpenAI's Whisper

04 May 2023 1,194

WhisperKit

Swift native on-device speech recognition with Whisper for Apple Silicon

26 Jan 2024 2,234

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting

14 Nov 2017 10,329

whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

13 Jan 2023 1,898

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

17 Nov 2020 4,083

RTranslator

Open source real-time translation app for Android that runs locally

30 Mar 2020 6,537

Whisper-WebUI

A Web UI for easy subtitle using whisper model

02 Mar 2023 1,083

ruby-openai

OpenAI API + Ruby! 🤖❤️ NEW: Assistant Vector Stores

03 Aug 2020 2,725

whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2

17 Mar 2023 872

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc

24 Nov 2022 5,614

whisper_mic

Project that allows one to use a microphone with OpenAI whisper

23 Sep 2022 704

buzz

Buzz transcribes and translates audio offline on your personal computer

24 Sep 2022 12,075

use-whisper

React hook for OpenAI Whisper with speech recorder, real-time transcription, and silence removal built-in

06 Mar 2023 718

inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code

14 Jun 2023 5,025

auto-subtitle

Automatically generate and overlay subtitles for any video

28 Sep 2022 1,492
Up and Coming Projects 
More

WhisperTranscriber

Whisper Transcribe and srt Resegment

14 Oct 2024 2

educa24-speech-to-summary

Demonstrator for an open-source speech-to-summary workflow

17 Sep 2024 0

easy-subber

A Python-based tool that that takes video files and generates

06 Sep 2024 5

SpeechToText

Speech-to-Text using OpenAI's Whisper model

04 Sep 2024 0

Transcribe-Translate

Local web app for transcription and translation services for audio and video using Whisper models

28 Aug 2024 1

whisper-client

Very simple Python based client for Whisper compatible endpoint

26 Aug 2024 1

TechStormHack-1st-place

Решение соревнования ТехШторм от корпорации ТатНефть по анализу активности членов команды на ВКС

23 Aug 2024 0

amazon-ivs-webgpu-captions-demo

This repository contains an experimental demo application that shows how you can add client-side auto-generated captions to Amazon IVS Real-time and Low-latency streams using transformers

21 Aug 2024 1

nexa-sdk

Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models

16 Aug 2024 1,871

yuisub

Auto translation of new anime episodes based on Yui-MHCP001

16 Aug 2024 2

transcription_service

System/service with REST API for extracting text transcriptions from movies and audio recordings in most popular video formats

16 Aug 2024 2

Transcription

This project transcribes audio using whisper and provides an api

15 Aug 2024 0

audio-summarize

An audio summarizer (faster-whisper and BART glued together)

13 Aug 2024 1

AI_language_school_based_on_Django_and_OpenAI

Django and OpenAI API example use case

07 Aug 2024 1

voice-writing-electron

A real-time, instant dictation desktop application built on Electron that uses Whisper and GROQ under the hood

05 Aug 2024 20

call-listener_bot

A bot that downloads, transcribes and analyzes calls to find insights for sales advisors

04 Aug 2024 3

stream-translator-gpt-webui

A web ui application that utilizes the stream-translator-gpt

02 Aug 2024 2

subtitler

Creating subtitles from video

01 Aug 2024 0

pyannote-rs

pyannote audio diarization in rust

31 Jul 2024 15