DeepSpeech Server

DeepSpeech SR server using Falcon.

Usage

Install the dependencies
pip install -r requirements.txt

If you want GPU acceleration
pip install deepspeech-gpu --upgrade

Download the pre-trained model
wget -O - https://github.com/mozilla/DeepSpeech/releases/download/v0.1.1/deepspeech-0.1.1-models.tar.gz | tar xvfz -

Update path to model files in the params.json file.

Run the server
python main.py --params params.json

SpeechRecognition: POST /api/reco

Params

Parameters for the service are stored in a JSON file. The follow parameters control the beam search decoder.

beam_width Used in the CTC decoder when building candidate transcription.
lm_weight Hyperparameter for the CTC decoder. Language model weight.
word_count_weight Hyperparameter of the CTC decoder. Word insertion weight (penalty).
valid_word_count_weight Valid word insertion weight. This is used to lessen the word insertion penalty.
n_features Number of MFCC features to use.
n_context Size of the context window used for producing timesteps in the input vector.

Examples

http -f POST http://localhost:8080/api/reco < audio_file.wav

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md
main.py		main.py
params.json		params.json
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeepSpeech Server

Usage

Params

Examples

About

Releases

Packages

Languages

stevemurr/deepspeech-server

Folders and files

Latest commit

History

Repository files navigation

DeepSpeech Server

Usage

Params

Examples

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages