ROS 2 Whisper

ROS 2 inference for whisper.cpp.

Build

Install pyaudio, see install instructions.
Build this repository, do

mkdir -p whisper_ws/src && cd whisper_ws/src && \
git clone https://github.com/ros-ai/ros2_whisper.git && cd .. && \
colcon build --symlink-install --cmake-args -DWHISPER_CUBLAS=On

Demos

Run the inference action server (this will download models to $HOME/.cache/whisper.cpp):

ros2 launch whisper_bringup bringup.launch.py n_thread:=4

Run a client node (activated on space bar press):

ros2 run whisper_demos whisper_on_key

Available Actions

Action server under topic inference of type Inference.action.

Troubleshoot

Encoder inference time: ggerganov/whisper.cpp#10 (comment)
Compile with GPU support (might differ between platforms): https://github.com/ggerganov/whisper.cpp#nvidia-gpu-support-via-cublas WHISPER_CUBLAS=On

Name		Name	Last commit message	Last commit date
Latest commit History 129 Commits
audio_listener		audio_listener
whisper_bringup		whisper_bringup
whisper_cpp_vendor		whisper_cpp_vendor
whisper_demos		whisper_demos
whisper_msgs		whisper_msgs
whisper_server		whisper_server
whisper_util		whisper_util
.gitignore		.gitignore
CHANGELOG.rst		CHANGELOG.rst
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ROS 2 Whisper

Build

Demos

Available Actions

Troubleshoot

About

Releases

Packages

Languages

perseusdg/ros2_whisper

Folders and files

Latest commit

History

Repository files navigation

ROS 2 Whisper

Build

Demos

Available Actions

Troubleshoot

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages