Trying some implementations of GANs for audio in Keras.
WaveGAN implements a structure that enables working on raw audio.
- Generator structure
- Discriminator structure
- Loss function : WGAN-GP - (Experimenting some issues)
- Train function
- Datasets :
-
- Speech Commands Zero through Nine (SC09). Source : here
-
- Drum sound effects. Source : here
-
- Bach piano performances. Source : here
Incoming implementation