This directory contains samples for Google Cloud Speech API. The Google Cloud Speech API enables easy integration of Google speech recognition technologies into developer applications. Send audio and receive a text transcription from the Cloud Speech API service.
- See the migration guide for information about migrating to Python client library v0.27.
This sample requires you to have authentication setup. Refer to the Authentication Getting Started Guide for instructions on setting up credentials for applications.
Clone python-docs-samples and change directory to the sample directory you want to use.
$ git clone https://github.com/GoogleCloudPlatform/python-docs-samples.git
Install pip and virtualenv if you do not already have them. You may want to refer to the Python Development Environment Setup Guide for Google Cloud Platform for instructions.
Create a virtualenv. Samples are compatible with Python 2.7 and 3.4+.
$ virtualenv env $ source env/bin/activate
Install the dependencies needed to run the samples.
$ pip install -r requirements.txt
To run this sample:
$ python quickstart.py
To run this sample:
$ python transcribe.py
usage: transcribe.py [-h] path
Google Cloud Speech API sample application using the REST API for batch
processing.
Example usage:
python transcribe.py resources/audio.raw
python transcribe.py gs://cloud-samples-tests/speech/brooklyn.flac
positional arguments:
path File or GCS path for audio file to be recognized
optional arguments:
-h, --help show this help message and exit
To run this sample:
$ python transcribe_async.py
usage: transcribe_async.py [-h] path
Google Cloud Speech API sample application using the REST API for async
batch processing.
Example usage:
python transcribe_async.py resources/audio.raw
python transcribe_async.py gs://cloud-samples-tests/speech/vr.flac
positional arguments:
path File or GCS path for audio file to be recognized
optional arguments:
-h, --help show this help message and exit
To run this sample:
$ python transcribe_word_time_offsets.py
usage: transcribe_word_time_offsets.py [-h] path
Google Cloud Speech API sample that demonstrates word time offsets.
Example usage:
python transcribe_word_time_offsets.py resources/audio.raw
python transcribe_word_time_offsets.py gs://cloud-samples-tests/speech/vr.flac
positional arguments:
path File or GCS path for audio file to be recognized
optional arguments:
-h, --help show this help message and exit
To run this sample:
$ python transcribe_streaming.py
usage: transcribe_streaming.py [-h] stream
Google Cloud Speech API sample application using the streaming API.
Example usage:
python transcribe_streaming.py resources/audio.raw
positional arguments:
stream File to stream to the API
optional arguments:
-h, --help show this help message and exit
To run this sample:
$ python transcribe_enhanced_model.py
usage: transcribe_enhanced_model.py [-h] path
Google Cloud Speech API sample that demonstrates enhanced models
and recognition metadata.
Example usage:
python transcribe_enhanced_model.py resources/commercial_mono.wav
positional arguments:
path File to stream to the API
optional arguments:
-h, --help show this help message and exit
To run this sample:
$ python transcribe_auto_punctuation.py
usage: transcribe_auto_punctuation.py [-h] path
Google Cloud Speech API sample that demonstrates auto punctuation
and recognition metadata.
Example usage:
python transcribe_auto_punctuation.py resources/commercial_mono.wav
positional arguments:
path File to stream to the API
optional arguments:
-h, --help show this help message and exit
To run this sample:
$ python transcribe_model_selection.py
usage: transcribe_model_selection.py [-h]
[--model {command_and_search,phone_call,video,default}]
path
Google Cloud Speech API sample that demonstrates how to select the model
used for speech recognition.
Example usage:
python transcribe_model_selection.py resources/Google_Gnome.wav --model video
python transcribe_model_selection.py gs://cloud-samples-tests/speech/Google_Gnome.wav --model video
positional arguments:
path File or GCS path for audio file to be recognized
optional arguments:
-h, --help show this help message and exit
--model {command_and_search,phone_call,video,default}
The speech recognition model to use
To run this sample:
$ python beta_snippets.py
usage: beta_snippets.py [-h] command
Google Cloud Speech API sample that demonstrates enhanced models
and recognition metadata.
Example usage:
python beta_snippets.py enhanced-model
python beta_snippets.py metadata
python beta_snippets.py punctuation
python beta_snippets.py diarization
python beta_snippets.py multi-channel
python beta_snippets.py multi-language
python beta_snippets.py word-level-conf
positional arguments:
command
optional arguments:
-h, --help show this help message and exit
This sample uses the Google Cloud Client Library for Python. You can read the documentation for more details on API usage and use GitHub to browse the source and report issues.