Policy-Gradient-PyTorch

Implementation of vanilla stochaistic (categorical) policy gradient algorithm to play cartpole.
Vanilla policy gradient takes longer but convergence is smoother than DQN for the cartpole, both of these properties as expected.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Figure_1.png		Figure_1.png
README.md		README.md
vpg_pytorch.py		vpg_pytorch.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Policy-Gradient-PyTorch

About

Releases

Packages

Languages

shivamsaboo17/Policy-Gradient-PyTorch

Folders and files

Latest commit

History

Repository files navigation

Policy-Gradient-PyTorch

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages