experimenting with Deep Markov Models for character-level language modeling.
The experiments were kind of meh, these DMMs are hard to stabilize during training so it's difficult to generate realistic text. It's decent, but doesn't quite compare to the results from [2], or even standard LSTMs, in terms of NLL loss.
python 3.6
pytorch 1.5.0
torchtext 0.6.0
pyro-ppl 1.3.1
To train an instance of a DMM with the default hyperparameters, run:
python3 main.py
