Open Source Ecosystems

Abstract

Transcription for Apple Silicon.

Segmentation is performed to divide the sound source into small chunks, a sound source is created by removing silent parts for each chunk, and text is extracted.

Install

$ git clone https://github.com/mbotsu/mlx_speech2text.git
$ pip install -r requirements.txt

Run

// convert to wav 16K
$ ffmpeg -i input.mp4 -ar 16000 out.wav

// run
$ python speech2text.py -i out.wav -o track -v

References

Related Projects

whisper-playground

Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/

02 Oct 2022 776

SpeechToText

Speech-to-Text using OpenAI's Whisper model

04 Sep 2024 0

speech-to-text

Real-time transcription using faster-whisper

30 Mar 2023 375

whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

25 Jan 2023 3,362

whisper-ctranslate2

Whisper command line client compatible with original OpenAI client based on CTranslate2.

17 Mar 2023 872

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

16 Sep 2022 64,924

Whisperboard

The open-source iOS app that's making quality voice transcription more accessible on mobile devices.

23 Dec 2022 615

WhisperLive

A nearly-live implementation of OpenAI's Whisper.

04 May 2023 1,194

VoiceCipher

Local Speech transcription

31 Jul 2024 2

Realtime-Whisper-Console-Transcriber

A real-time speech-to-text transcriber using the Whisper model, designed for efficiency and ease ...

08 Jul 2024 3

wscribe

ez audio transcription tool with flexible processing and post-processing options

21 Jul 2023 125

speech-to-text-osc

Speech to text with OSC output.

22 Oct 2023 4

mic2transcript

CLI tool that continuously transcribes audio from the device's built-in microphone to a text file...

21 Jun 2024 2

whisper-node

Node.js bindings for OpenAI's Whisper. (C++ CPU version by ggerganov)

18 Dec 2022 225

audio-summarize

An audio summarizer (faster-whisper and BART glued together)

13 Aug 2024 1