Skip to content

Speech recognition plugin for GStreamer based on Mozilla's DeepSpeech model

License

Notifications You must be signed in to change notification settings

bdurepo1/gst-deepspeech

 
 

Repository files navigation

GStreamer DeepSpeech Plugin

DeepSpeech is a speech recognition project created by Mozilla.

This project provides a GStreamer element which can be placed into an audio pipeline, it will then report any recognised speech via bus messages. It automatically segments audio based on configurable silence thresholds making it suitable for continuous dictation.

Here’s a couple of example pipelines using gst-launch.

To perform speech recognition on a file, printing all bus messages to the terminal:

gst-launch-1.0 -m filesrc location=/path/to/file.ogg ! decodebin ! audioconvert ! audiorate ! audioresample ! deepspeech ! fakesink

To perform speech recognition on audio recorded from the default system microphone, with changes to the silence thresholds:

gst-launch-1.0 -m pulsesrc ! audioconvert ! audiorate ! audioresample ! deepspeech silence-threshold=0.3 silence-length=20 ! fakesink

About

Speech recognition plugin for GStreamer based on Mozilla's DeepSpeech model

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C++ 84.4%
  • M4 12.1%
  • Makefile 1.8%
  • Shell 1.7%