NaNs output from the policy network #6

stevenbinhu21 · 2018-05-10T04:23:40Z

Hi, I was training the model for one of the minigames. And after about 60000 episodes, the policy for the actions probabilities output all NaNs, I haven't been able to track down the problem yet, as it takes long to get to that point for debugging. Just wonder if you have encountered such problem before?

simonmeister · 2018-05-10T09:58:26Z

Which of the games are you training? Yes, we also encountered it occasionally in DefeatRoaches and DefeatZerglingsAndBanelings (if I remember correctly), however it only was for some runs and I also wasn't able to identify the source of the problem.

jaejaywoo · 2018-06-21T14:08:57Z

I am also encountering NaN while running DefeatZerglingsAndBanelings and CollectMineralShards as well. Is there any updates on this issue?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NaNs output from the policy network #6

NaNs output from the policy network #6

stevenbinhu21 commented May 10, 2018

simonmeister commented May 10, 2018 •

edited

jaejaywoo commented Jun 21, 2018

NaNs output from the policy network #6

NaNs output from the policy network #6

Comments

stevenbinhu21 commented May 10, 2018

simonmeister commented May 10, 2018 • edited

jaejaywoo commented Jun 21, 2018

simonmeister commented May 10, 2018 •

edited