Skip to content

leetemil/thesis_code

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 

Repository files navigation

Readme

Real readme coming soon. 2020/05/26 snapshot of code.

For now, find the Bayesian VAE and WaveNet reimplementations (in PyTorch) under src/models. Each model has a python script in src/ for running training instances with that model. For example

python run_vae.py --log_interval batch

from the src folder trains a newly initialized VAE model on default arguments (which can be viewed using the --help or -h flag), with logging information per batch update (basically a training progress bar). We currently get the best scores on VAE training 5 models, each with sparse interactions (dict flag) and randomly weighted samples.

python run_vae.py -dict -rws

Then, use the evaluate_ensemble.py script in src to make ensemble predictions over the trained models. This script takes one or more directories as argument and looks for model files (all files with .torch suffix) to include in the ensemble.

Data

Place aligned data files under src/data/files/alignments/. The alignment data used by default is from the DeepSequence repository, which can easily be downloaded and extracted using their script, as provided here as well. Navigate to src/data/files/ and run this script to download and extract alignment files:

curl -o alignments.tar.gz https://marks.hms.harvard.edu/deepsequence/alignments.tar.gz
tar -xvf alignments.tar.gz
rm alignments.tar.gz

About

Repository for the code used in our thesis.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages