Skip to content

avivt/VIN

Folders and files

NameName
Last commit message
Last commit date

Latest commit

fe11bb1 · Apr 21, 2017

History

23 Commits
Oct 20, 2016
Oct 20, 2016
Oct 20, 2016
Apr 10, 2017
Dec 20, 2016
Oct 20, 2016
Oct 20, 2016
Oct 20, 2016
Oct 20, 2016
Oct 20, 2016
Oct 20, 2016
Dec 20, 2016
Oct 20, 2016
Oct 20, 2016
Apr 21, 2017
Oct 20, 2016
Oct 20, 2016
Dec 19, 2016
Dec 21, 2016
Oct 20, 2016
Jan 4, 2017
Oct 20, 2016
Oct 20, 2016

Repository files navigation

Value Iteration Networks

Code for NIPS 2016 paper:

Value Iteration Networks

Aviv Tamar, Yi Wu, Garrett Thomas, Sergey Levine, and Pieter Abbeel

UC Berkeley

Requires:

  • Python (2.7)
  • Theano (0.8)

For generating the gridworld data and visualizing results, also requires:

To start: the scripts directory contains scripts for generating the data, and training the different models.

scripts/make_data_gridworld_nips.m generates the training data (random grid worlds). Alternatively, you can use the existing data files in the data folder (instead of generating them).

scripts/nips_gridworld_experiments_VIN.sh shows how to train the VIN models.

After training, a weights file (e.g., /results/grid28_VIN.pk) will be created. You can then run:

  • script_viz_policy.m to run the trained VIN with the learned weights and view the trajectories it produces (line 17 selects the weights file).
  • test_network.m to numerically evaluate the learned network on a test set (needs to be generated).

Related implementations:

Kent Sommer's implementation of VINs (including data generation) in python + pytorch

https://github.com/kentsommer/pytorch-value-iteration-networks

Abhishek Kumar's implementation of VINs in Tensor Flow

https://github.com/TheAbhiKumar/tensorflow-value-iteration-networks

About

Value Iteration Networks

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published