Skip to content

Commit

Permalink
Updated speech commands example to use new dataset
Browse files Browse the repository at this point in the history
PiperOrigin-RevId: 196557132
  • Loading branch information
petewarden authored and tensorflower-gardener committed May 14, 2018
1 parent e528e5a commit 52cb159
Show file tree
Hide file tree
Showing 2 changed files with 10 additions and 8 deletions.
16 changes: 9 additions & 7 deletions tensorflow/docs_src/tutorials/audio_recognition.md
Original file line number Diff line number Diff line change
Expand Up @@ -25,13 +25,15 @@ python tensorflow/examples/speech_commands/train.py
```

The script will start off by downloading the [Speech Commands
dataset](https://storage.cloud.google.com/download.tensorflow.org/data/speech_commands_v0.01.tar.gz),
which consists of 65,000 WAVE audio files of people saying thirty different
words. This data was collected by Google and released under a CC BY license, and
you can help improve it by [contributing five minutes of your own
dataset](https://storage.cloud.google.com/download.tensorflow.org/data/speech_commands_v0.02.tar.gz),
which consists of over 105,000 WAVE audio files of people saying thirty
different words. This data was collected by Google and released under a CC BY
license, and you can help improve it by [contributing five minutes of your own
voice](https://aiyprojects.withgoogle.com/open_speech_recording). The archive is
over 1GB, so this part may take a while, but you should see progress logs, and
once it's been downloaded once you won't need to do this step again.
over 2GB, so this part may take a while, but you should see progress logs, and
once it's been downloaded once you won't need to do this step again. You can
find more information about this dataset in this
[Speech Commands paper](https://arxiv.org/abs/1804.03209).

Once the downloading has completed, you'll see logging information that looks
like this:
Expand Down Expand Up @@ -229,7 +231,7 @@ You can also build this application yourself, since it's open source and
[available as part of the TensorFlow repository on
github](https://github.com/tensorflow/tensorflow/tree/master/tensorflow/examples/android#building-in-android-studio-using-the-tensorflow-aar-from-jcenter).
By default it downloads [a pretrained model from
tensorflow.org](http://download.tensorflow.org/models/speech_commands_v0.01.zip),
tensorflow.org](http://download.tensorflow.org/models/speech_commands_v0.02.zip),
but you can easily [replace it with a model you've trained
yourself](https://github.com/tensorflow/tensorflow/tree/master/tensorflow/examples/android#install-model-files-optional).
If you do this, you'll need to make sure that the constants in [the main
Expand Down
2 changes: 1 addition & 1 deletion tensorflow/examples/speech_commands/train.py
Original file line number Diff line number Diff line change
Expand Up @@ -288,7 +288,7 @@ def main(_):
'--data_url',
type=str,
# pylint: disable=line-too-long
default='http://download.tensorflow.org/data/speech_commands_v0.01.tar.gz',
default='http://download.tensorflow.org/data/speech_commands_v0.02.tar.gz',
# pylint: enable=line-too-long
help='Location of speech training data archive on the web.')
parser.add_argument(
Expand Down

0 comments on commit 52cb159

Please sign in to comment.