Whisper Ecosystem

Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.

Created by
OpenAI
Released
August 2021
Community Repos
1,021
Total GitHub Stars
455,261

nexa-sdk

Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models

16 Aug 2024 1,871

pyannote-rs

pyannote audio diarization in rust

31 Jul 2024 15

voice-pro

The best gradio web-ui for ai transcription, translation and TTS

29 Jul 2024 3

docker-whisper-server

whisper

20 Jul 2024 8

AudioNotes

快速提取音视频内容,整理成一份结构化的markdown笔记

19 Jul 2024 1,018

Realtime-Whisper-Console-Transcriber

A real-time speech-to-text transcriber using the Whisper model, designed for efficiency and ease of use in the console

08 Jul 2024 3

ChineseTaiwaneseWhisper

This repository focuses on leveraging OpenAI's Whisper model for speech recognition in Chinese (Mandarin) and Taiwanese Hokkien languages

01 Jul 2024 3

voice-gulliver

The best gradio web-ui for ai subtitle, translation and dubbing

05 Jun 2024 1

Indic-Subtitler

Open source subtitling platform 💻 for transcribing and translating videos/audios in Indic languages

05 Feb 2024 55

WhisperS2T

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

16 Dec 2023 284

CapGen

A fast CPU-first video/audio transcriber for generating caption files with Whisper and CTranslate2, hosted on Hugging Face Spaces

16 Sep 2023 1

docker-whisperX

Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)

26 Aug 2023 124

whisper.api

This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model

12 Aug 2023 863

faster-whisper-GUI

faster_whisper GUI with PySide6

18 Jul 2023 1,448

Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data

22 Apr 2023 501

whisper.unity

Running speech to text model (whisper

26 Mar 2023 397

whisper_normalizer

A python package for whisper normalizer

21 Mar 2023 31

tafrigh

تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit

20 Mar 2023 101

shisper

A quick & dirty script to generate and view subtitles and transcriptions for your multimedia files using ggerganov/whisper

10 Mar 2023 8

malayalam_asr_benchmarking

A study to benchmark whisper based ASRs in Malayalam

04 Mar 2023 8