Skip to content

Implementation of vanilla stochaistic (categorical) policy gradient algorithm to play cartpole.

Notifications You must be signed in to change notification settings

shivamsaboo17/Policy-Gradient-PyTorch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

Policy-Gradient-PyTorch

Implementation of vanilla stochaistic (categorical) policy gradient algorithm to play cartpole.
Vanilla policy gradient takes longer but convergence is smoother than DQN for the cartpole, both of these properties as expected.

About

Implementation of vanilla stochaistic (categorical) policy gradient algorithm to play cartpole.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages