Pong-learning-agent

The learning agent actually did not use a markov dicision process because the initial condition is constant. At a resonable alpha gamma and epsilon combination, the learning agent is able to bounce the ball 15 times.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
simulator		simulator
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pong-learning-agent

About

Releases

Packages

Languages

Kenchmo/Pong-learning-agent

Folders and files

Latest commit

History

Repository files navigation

Pong-learning-agent

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages