A fork of ddpg.py. Gets rid of the deterministic part of the deep deterministic policy gradient algorithm, and makes all the required changes to get it to work with the CartPole-v0 OpenAI gym environment.
This is just a reference for my CartPole actor-critic model here.
Dependencies:
So far:
- DDPG - Deep Deterministic Policy Gradients, evaluated on the Pendulum environment in OpenAI Gym