Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
A low-footprint GPU accelerated Speech to Text Python package for the Jetpack 5 era bolstered by an optimized graph
ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3
Note-taking app for online/local video/audio using Whisper transcription, ChatGPT, and Notion
A wayland overlay providing speech-to-text functionality for any application via a global push-to-talk hotkey
🎞️ Automatically generating subtitles for video files using Whisper ASR model in Python
Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit
Simply forward a video or voice message in any language to the bot, and it will reply with a translation
A GitHub Action for checking the status of a workflow on the main branch as a status on the PR