README

Codebase for It Takes Four to Tango: Multiagent Self Play for Automatic Curriculum Generation. Built off of the Pytorch SAC implementation at https://github.com/denisyarats/pytorch_sac.

Installation Instructions

For CUDA:

sudo apt-get install -y libglew-dev
Add to .bash_rc export LD_PRELOAD=/usr/lib/x86_64-linux-gnu/libGLEW.so
install cuda https://pytorch.org/get-started/locally/

Setup repository

Install conda for creating the virtual env: https://docs.conda.io/en/latest/miniconda.html
Install Mujoco: https://www.roboti.us/index.html
Install any necessary packages for DMC: https://github.com/deepmind/dm_control
1. for Mujoco-py: sudo apt install libosmesa6-dev libgl1-mesa-glx libglfw3, sudo apt-get install patchelf.
Install dmc2gym: https://github.com/denisyarats/dmc2gym
Clone the repository and create the conda env using conda env create -f environment.yml and activate the env
Install local version of dm_control for modified envs: pip install -e dm_control/

Training

To train CuSP, run

python train.py env_name=point_mass exp_name=test num_steps=6e3 goal_algo=cusp seed=0 num_steps_alice=100 num_steps_bob=100 symmetrize=True before_update_stale_regrets=50 stale_regret_coeff=.9

Command	Description
`env_name`	Specify env to run -- `point_mass, point_mass_maze0, manipulator_reach, manipulator_toss, walker`
`num_steps`	Total number of training rounds
`goal_algo`	Goal generation algorithm -- `cusp, asp, goalgan, dr`
`num_steps_alice`	Max Alice trajectory length
`num_steps_bob`	Max Bob trajectory length
`symmetrize`	If true, ymmetrize training setup to have two goal generators
`before_update_stale_regrets`	Training episode at which we begin stale regret updates
`stale_regret_coeff`	Weighing (beta) of regret updates

For detailed configs, see config/train.yaml. Results will be logged in logdir/.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
agents		agents
config		config
dm_control		dm_control
envs		envs
utils		utils
.gitignore		.gitignore
README.md		README.md
baselines.py		baselines.py
cusp.py		cusp.py
environment.yml		environment.yml
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

README

Installation Instructions

For CUDA:

Setup repository

Training

About

Releases

Packages

Languages

yuqingd/cusp

Folders and files

Latest commit

History

Repository files navigation

README

Installation Instructions

For CUDA:

Setup repository

Training

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages