Whisper Console Transcriber

A real-time speech-to-text transcriber using the Whisper model, designed for efficiency and ease of use in the console. This tool leverages the faster_whisper library and Rich to provide a seamless user experience for transcribing audio inputs on the fly.

Background

Whisper is a state-of-the-art model for automatic speech recognition (ASR). This project utilizes the Whisper model and provides a practical interface for capturing live audio input, transcribing it, and displaying the results in real time. It's designed to be flexible, allowing the user to choose the language of transcription and offering a buffer system to handle continuous speech.

Features

Real-time speech-to-text using Whisper model
Support for multiple languages
Console-based application with rich text formatting
Automatic ambient noise adjustment
Saves transcriptions to a file in the Downloads folder

Installation

To install and run this project, follow these steps:

Clone the repo:

git clone https://github.com/nexuslux/Realtime-Whisper-Console-Transcriber
cd WhisperConsoleTranscriber

Set up a virtual environment (optional but recommended):

python -m venv venv
source venv/bin/activate  # On Windows, use `venv\Scripts\activate`

Install required dependencies:

pip install faster_whisper speechrecognition rich

How to Run

Run the script:
```
python script_name.py
```
Follow the prompts:
- After running the script, you will be prompted to enter the language code (e.g., 'en' for English, 'zh' for Chinese, 'es' for Spanish).
- The application will then adjust for ambient noise and start capturing audio.
Start speaking or playing audio:
- Once you start speaking, the application will transcribe your speech in real time.
- Transcriptions are buffered and displayed in chunks.
Stop listening:
- Press CTRL + C to stop the transcription process.
- The transcriptions will automatically be saved to a text file in your Downloads folder.

Example

python transcribe.py

After this you will be asked to enter the main language. • Enter the language code: en • Start speaking. The application will display transcribed text in the console. • End the session with CTRL + C. The output will be saved to a text file in the Downloads folder. Customization You can customize the following parameters in the script: • buffer_size: Number of segments to buffer before displaying the transcription. • language_code: Set your preferred default language code for transcription.

Related Projects

wubbl0rz-archiv-transcribe

Speech to text using whisper, used in....

26 Nov 2022 5

whisper_autosrt

A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using faster_whisper module ...

27 May 2023 20

Whisper-WebUI

A Web UI for easy subtitle using whisper model.

02 Mar 2023 1,083

whisper-client

Very simple Python based client for Whisper compatible endpoint

26 Aug 2024 1

LiveWhisper

A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper in...

06 Oct 2022 323

whisper-dictate

Run once. Hold left Opt+Cmd and speak. It will transcribe and type what you said, so you don't ha...

06 Nov 2023 2

Whisper-Transcription-UI

Whisper Transcription UI is a user-friendly graphical interface for whisper-standalone-win. Trans...

01 Jun 2024 4