Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sonic A2C not working for Pong #48

Open
slerman12 opened this issue Mar 15, 2019 · 5 comments
Open

Sonic A2C not working for Pong #48

slerman12 opened this issue Mar 15, 2019 · 5 comments

Comments

@slerman12
Copy link

I'm trying to test whether the A2C code for Sonic could be used to train an agent on another environment. I replaced the Sonic environments with 8 copies of Pong, and I varied up the number of epochs and mini batches and nsteps, but no matter what, I could not get it to learn Pong. Is there a reason this implementation won't train on Pong? Am I missing some important parameter? Could you test it for yourself and let me know? All I had to do was change the environments in agent.py with a Pong make_env() that used frame stacking and preprocessing.

@slerman12 slerman12 changed the title Sonic A2c not working for Pong Sonic A2C not working for Pong Mar 15, 2019
@pengzhi1998
Copy link

Hi, how many episodes did you run? And may I know your total reward for each episode?

@slerman12
Copy link
Author

If I recall, 100 updates on the default settings was not enough to make any progress. The reward did not go up from -20 per episode.

@pengzhi1998
Copy link

pengzhi1998 commented Mar 26, 2019 via email

@slerman12
Copy link
Author

That surprises me, since the trained Sonic model required only 270 updates. That’s already processing millions of states, which should be enough for Pong, shouldn’t it?

@slerman12
Copy link
Author

I'll try to run 1000 updates and get back to you. What if it still doesn't play Pong then? I'm hoping to use this as a baseline for my research with transfer learning. Would you not recommend that?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants