about starting a new game and History #59

Richardxxxxxxx · 2018-09-12T13:17:41Z

in dqn/agent.py line 59

  if terminal:
    screen, reward, action, terminal = self.env.new_random_game()

when starting a new game due to a terminal state.

why we don't need to reset the self.history?

because it would affect the next iteration.

  # 1. predict
  action = self.predict(self.history.get())
  # 2. act
  screen, reward, terminal = self.env.act(action, is_training=True)
  # 3. observe
  self.observe(screen, reward, action, terminal)

the predicted action for self.history.get() is not depending on the current game screens, it will predict action for the previous game screen, which is ended, instead.

Do I miss anything?

Thank you very much.

The text was updated successfully, but these errors were encountered:

hipoglucido · 2018-09-12T15:47:35Z

Yeah, it would affect the next iteration, but it won't do any harm in most of the cases. In many RL environments the concept of episode/game is abstracted away from the agent and all it sees is a continuous flow of millions of frames.

Richardxxxxxxx mentioned this issue Sep 12, 2018

History is not updated with new game screen created after a terminal state is reached #48

Closed

douglasrizzo mentioned this issue Sep 23, 2018

terminal in agent.py seem not handle properly #60

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

about starting a new game and History #59

about starting a new game and History #59

Richardxxxxxxx commented Sep 12, 2018

hipoglucido commented Sep 12, 2018

about starting a new game and History #59

about starting a new game and History #59

Comments

Richardxxxxxxx commented Sep 12, 2018

hipoglucido commented Sep 12, 2018