Skip to content

Commit

Permalink
update pre-trained models with audio demos
Browse files Browse the repository at this point in the history
  • Loading branch information
keonlee9420 committed Oct 17, 2021
1 parent 84fad6f commit f9161cf
Show file tree
Hide file tree
Showing 107 changed files with 9 additions and 6 deletions.
7 changes: 5 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,6 +6,9 @@ PyTorch Implementation of [PortaSpeech: Portable and High-Quality Generative Tex
<img src="img/model.png" width="80%">
</p>

## Audio Samples
Audio samples are available at [/demo](https://github.com/keonlee9420/PortaSpeech/tree/main/demo).

## Model Size
| Module | Normal | Small | Normal (paper) | Small (paper) |
| :----- | :-----: | :-----: | :-----: | :-----: |
Expand Down Expand Up @@ -111,11 +114,11 @@ tensorboard --logdir output/log
```

to serve TensorBoard on your localhost.
<!-- The loss curves, synthesized mel-spectrograms, and audios are shown.
The loss curves, synthesized mel-spectrograms, and audios are shown.

![](./img/tensorboard_loss.png)
![](./img/tensorboard_spec.png)
![](./img/tensorboard_audio.png) -->
![](./img/tensorboard_audio.png)

# Notes

Expand Down
2 changes: 1 addition & 1 deletion config/LJSpeech/model_small.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,7 @@ variational_generator:
encoder_layer: 8
decoder_layer: 3
conv_kernel_size: 3
conv_stride_size: 4
conv_stride_size: 2
encoder_decoder_hidden: 128
# encoder_decoder_dropout: 0.2
latent_hidden: 16
Expand Down
4 changes: 2 additions & 2 deletions config/LJSpeech/preprocess.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -12,8 +12,8 @@ preprocessing:
text:
text_cleaners: ["english_cleaners"]
language: "en"
sub_divide_word: True
max_phoneme_num: 7
sub_divide_word: True # subdivide long words into smaller pieces
max_phoneme_num: 7 # threshold for subdivision
audio:
sampling_rate: 22050
max_wav_value: 32768.0
Expand Down
2 changes: 1 addition & 1 deletion config/LJSpeech/train.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,7 @@ optimizer:
grad_clip_thresh: 1.0
grad_acc_step: 1
warm_up_step: 4000
anneal_steps: [100000, 200000, 300000]
anneal_steps: [175000, 250000, 300000]
anneal_rate: 0.3
step:
total_step: 500000
Expand Down
Binary file added demo/LJSpeech/normal/100k/LJ001-0165.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added demo/LJSpeech/normal/100k/LJ001-0165.wav
Binary file not shown.
Binary file added demo/LJSpeech/normal/100k/LJ001-0174.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added demo/LJSpeech/normal/100k/LJ001-0174.wav
Binary file not shown.
Binary file added demo/LJSpeech/normal/100k/LJ002-0002.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added demo/LJSpeech/normal/100k/LJ002-0002.wav
Binary file not shown.
Binary file added demo/LJSpeech/normal/100k/LJ011-0269.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added demo/LJSpeech/normal/100k/LJ011-0269.wav
Binary file not shown.
Binary file added demo/LJSpeech/normal/100k/LJ017-0025.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added demo/LJSpeech/normal/100k/LJ017-0025.wav
Binary file not shown.
Binary file added demo/LJSpeech/normal/100k/LJ028-0403.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added demo/LJSpeech/normal/100k/LJ028-0403.wav
Binary file not shown.
Binary file added demo/LJSpeech/normal/100k/LJ045-0141.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added demo/LJSpeech/normal/100k/LJ045-0141.wav
Binary file not shown.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file not shown.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file not shown.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file not shown.
Binary file added demo/LJSpeech/normal/125k/LJ001-0165.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added demo/LJSpeech/normal/125k/LJ001-0165.wav
Binary file not shown.
Binary file added demo/LJSpeech/normal/125k/LJ001-0174.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added demo/LJSpeech/normal/125k/LJ001-0174.wav
Binary file not shown.
Binary file added demo/LJSpeech/normal/125k/LJ002-0002.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added demo/LJSpeech/normal/125k/LJ002-0002.wav
Binary file not shown.
Binary file added demo/LJSpeech/normal/125k/LJ011-0269.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added demo/LJSpeech/normal/125k/LJ011-0269.wav
Binary file not shown.
Binary file added demo/LJSpeech/normal/125k/LJ017-0025.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added demo/LJSpeech/normal/125k/LJ017-0025.wav
Binary file not shown.
Binary file added demo/LJSpeech/normal/125k/LJ028-0403.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added demo/LJSpeech/normal/125k/LJ028-0403.wav
Binary file not shown.
Binary file added demo/LJSpeech/normal/125k/LJ045-0141.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added demo/LJSpeech/normal/125k/LJ045-0141.wav
Binary file not shown.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file not shown.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file not shown.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file not shown.
Binary file added demo/LJSpeech/small/150k/LJ001-0165.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added demo/LJSpeech/small/150k/LJ001-0165.wav
Binary file not shown.
Binary file added demo/LJSpeech/small/150k/LJ001-0174.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added demo/LJSpeech/small/150k/LJ001-0174.wav
Binary file not shown.
Binary file added demo/LJSpeech/small/150k/LJ002-0002.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added demo/LJSpeech/small/150k/LJ002-0002.wav
Binary file not shown.
Binary file added demo/LJSpeech/small/150k/LJ011-0269.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added demo/LJSpeech/small/150k/LJ011-0269.wav
Binary file not shown.
Binary file added demo/LJSpeech/small/150k/LJ017-0025.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added demo/LJSpeech/small/150k/LJ017-0025.wav
Binary file not shown.
Binary file added demo/LJSpeech/small/150k/LJ028-0403.png
Binary file added demo/LJSpeech/small/150k/LJ028-0403.wav
Binary file not shown.
Binary file added demo/LJSpeech/small/150k/LJ045-0141.png
Binary file added demo/LJSpeech/small/150k/LJ045-0141.wav
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file added demo/LJSpeech/small/175k/LJ001-0165.png
Binary file added demo/LJSpeech/small/175k/LJ001-0165.wav
Binary file not shown.
Binary file added demo/LJSpeech/small/175k/LJ001-0174.png
Binary file added demo/LJSpeech/small/175k/LJ001-0174.wav
Binary file not shown.
Binary file added demo/LJSpeech/small/175k/LJ002-0002.png
Binary file added demo/LJSpeech/small/175k/LJ002-0002.wav
Binary file not shown.
Binary file added demo/LJSpeech/small/175k/LJ011-0269.png
Binary file added demo/LJSpeech/small/175k/LJ011-0269.wav
Binary file not shown.
Binary file added demo/LJSpeech/small/175k/LJ017-0025.png
Binary file added demo/LJSpeech/small/175k/LJ017-0025.wav
Binary file not shown.
Binary file added demo/LJSpeech/small/175k/LJ028-0403.png
Binary file added demo/LJSpeech/small/175k/LJ028-0403.wav
Binary file not shown.
Binary file added demo/LJSpeech/small/175k/LJ045-0141.png
Binary file added demo/LJSpeech/small/175k/LJ045-0141.wav
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file added demo/LJSpeech/small/200k/LJ001-0165.png
Binary file added demo/LJSpeech/small/200k/LJ001-0165.wav
Binary file not shown.
Binary file added demo/LJSpeech/small/200k/LJ001-0174.png
Binary file added demo/LJSpeech/small/200k/LJ001-0174.wav
Binary file not shown.
Binary file added demo/LJSpeech/small/200k/LJ002-0002.png
Binary file added demo/LJSpeech/small/200k/LJ002-0002.wav
Binary file not shown.
Binary file added demo/LJSpeech/small/200k/LJ011-0269.png
Binary file added demo/LJSpeech/small/200k/LJ011-0269.wav
Binary file not shown.
Binary file added demo/LJSpeech/small/200k/LJ017-0025.png
Binary file added demo/LJSpeech/small/200k/LJ017-0025.wav
Binary file not shown.
Binary file added demo/LJSpeech/small/200k/LJ028-0403.png
Binary file added demo/LJSpeech/small/200k/LJ028-0403.wav
Binary file not shown.
Binary file added demo/LJSpeech/small/200k/LJ045-0141.png
Binary file added demo/LJSpeech/small/200k/LJ045-0141.wav
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file added img/tensorboard_audio.png
Binary file added img/tensorboard_loss.png
Binary file added img/tensorboard_spec.png

0 comments on commit f9161cf

Please sign in to comment.