Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Getting 7 as action for a game with 3 actions #170

Open
arjung128 opened this issue Mar 13, 2021 · 1 comment
Open

Getting 7 as action for a game with 3 actions #170

arjung128 opened this issue Mar 13, 2021 · 1 comment

Comments

@arjung128
Copy link

arjung128 commented Mar 13, 2021

I have been trying to train an online agent on the environment FreewayNoFrameskip-v4. Because this gym environment is not deterministic, I seeded the environment. Specifically, in atari_lib.py, I added

  • env.seed(0) after env = gym.make(full_game_name) in create_atari_environment
  • self.environment.seed(0) at the end of the AtariPreprocessing class's __init__ function
  • self.environment.seed(0) at the start of the reset function in the AtariPreprocessing class

No other changes were made. I then used the Batch RL codebase to train an online agent.

In all of training, there was one instance of a 7 stored as the action (specifically the last action at the end of five iterations of training), even though Freeway only has three actions. All other stored actions were {0, 1, 2}. Any ideas what could be the cause of this? Going in and changing this one 7 to the most common action isn't a problem, but if this problem arises repeatedly, and for other games, it could be difficult to deal with.

@psc-g
Copy link
Collaborator

psc-g commented May 7, 2021

hi, is this still an issue? are you sure you're reloading the correct checkpoint (for the same game)?
otherwise, it seems like it might make more sense to ask in the batch rl repo?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants