Skip to content

Commit

Permalink
open-source AlgaeDICE
Browse files Browse the repository at this point in the history
PiperOrigin-RevId: 290124611
  • Loading branch information
ofirnachum authored and copybara-github committed Jan 16, 2020
1 parent 9520097 commit 65f5543
Show file tree
Hide file tree
Showing 10 changed files with 1,023 additions and 0 deletions.
1 change: 1 addition & 0 deletions .travis.yml
Original file line number Diff line number Diff line change
Expand Up @@ -10,6 +10,7 @@ env:
matrix:
- PROJECT="abps"
- PROJECT="albert"
- PROJECT="algae_dice"
- PROJECT="attribution"
- PROJECT="axial"
- PROJECT="bam"
Expand Down
28 changes: 28 additions & 0 deletions algae_dice/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,28 @@
# AlgaeDICE

Code for AlgaeDICE as described in `AlgaeDICE: Policy Gradient from Arbitrary
Experience' by Ofir Nachum, Bo Dai, Ilya Kostrikov, Yinlam Chow, Lihong Li, and
Dale Schuurmans.

Paper available on arXiv [here](https://arxiv.org/abs/1912.02074).

If you use this codebase for your research, please cite the paper:

```
@article{nachum2019algaedice,
title={AlgaeDICE: Policy Gradient from Arbitrary Experience},
author={Nachum, Ofir and Dai, Bo and Kostrikov, Ilya and Chow, Yinlam and
Li, Lihong and Schuurmans, Dale},
journal={arXiv preprint arXiv:1912.02074},
year={2019}
}
```

## Basic Commands

Run AlgaeDICE on HalfCheetah:

```
python -m algae_dice.train_eval --logtostderr --save_dir=$HOME/algae/ \
--env_name=HalfCheetah-v2 --seed=42
```
16 changes: 16 additions & 0 deletions algae_dice/__init__.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
# coding=utf-8
# Copyright 2019 The Google Research Authors.
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.


Loading

0 comments on commit 65f5543

Please sign in to comment.