Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

After some steps, all the NNs always output same action #75

Open
Eify666666 opened this issue Feb 27, 2021 · 1 comment
Open

After some steps, all the NNs always output same action #75

Eify666666 opened this issue Feb 27, 2021 · 1 comment

Comments

@Eify666666
Copy link

I'm training a A3C these days, but the NN always take the same action, after some steps.
The game I train for is similar to playing Go. There will be few reward in the short term. So it hard to learn something useful for the NN form the game. Maybe that is where the problem is. I tried ' torch.nn.utils.clip_grad_norm(lnet.parameters(), 50) ', and used relu as activate function. But it doesn't work.

@RuoyuG
Copy link

RuoyuG commented Nov 4, 2022

I meet same problem, it looks like stuck in a local optimal. Do you solve it?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants