Real Time Whisper Transcription

This is a demo of real time speech to text with OpenAI's Whisper model. It works by constantly recording audio in a thread and concatenating the raw bytes over multiple recordings.

To install dependencies simply run

pip install -r requirements.txt

in an environment of your choosing.

Whisper also requires the command-line tool ffmpeg to be installed on your system, which is available from most package managers:

# on Ubuntu or Debian
sudo apt update && sudo apt install ffmpeg

# on Arch Linux
sudo pacman -S ffmpeg

# on MacOS using Homebrew (https://brew.sh/)
brew install ffmpeg

# on Windows using Chocolatey (https://chocolatey.org/)
choco install ffmpeg

# on Windows using Scoop (https://scoop.sh/)
scoop install ffmpeg

For more information on Whisper please see https://github.com/openai/whisper

The code in this repository is public domain.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
README.md		README.md
demo.gif		demo.gif
opt_trans_demo.py		opt_trans_demo.py
requirements.txt		requirements.txt
transcribe_demo.py		transcribe_demo.py
transcribe_fast_demo.py		transcribe_fast_demo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Real Time Whisper Transcription

About

Releases

Packages

Languages

SynapticSage/mods_whisper_real_time

Folders and files

Latest commit

History

Repository files navigation

Real Time Whisper Transcription

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages