Meta Reinforcement Learning with Backpropamine

Jin Yeom

This project reimplements Backpropamine: training self-modifying neural networks with differentiable neuromodulated plasticity (Miconi et al., 2018), which was presented at ICLR 2019. This implementation adds to the original experiment:

layer normalization for better performance
non-recurrent Backpropamine agent
fully observable state (to verify that recurrency is not necessary)

Currently, one of the major limitations that keeps this project from moving forward is that meta reinforcement learning is computationally expensive, as a recurrent network (or, in our case, a plastic network) must differentiate through the entire lifetime with multiple trials. From a different perspective, however, this limitation also suggests a future direction for this project.

Todo

Retroactive neuromodulation with eligibility trace
Plastic LSTM

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
envs		envs
.gitignore		.gitignore
README.md		README.md
analysis.ipynb		analysis.ipynb
evaluate.py		evaluate.py
modules.py		modules.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Meta Reinforcement Learning with Backpropamine

Todo

About

Releases

Packages

Languages

jinyeom/backpropamine

Folders and files

Latest commit

History

Repository files navigation

Meta Reinforcement Learning with Backpropamine

Todo

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages