Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

train_ppo_a2c_for_lunar_lander_continuous的ppo算法,好像不能完全复现曲线变化情况 #344

Open
lindefoe opened this issue Dec 28, 2023 · 0 comments

Comments

@lindefoe
Copy link

QQ图片20231228105937

train_ppo_a2c_for_lunar_lander_continuous的ppo算法,好像不能完全复现曲线变化情况。
如果想完全复现曲线情况,不知道需不需要env.seed(args.random_seed)呢?
但是我尝试加了下env.seed(args.random_seed),好像起的作用不是很多大。
曲线不能完全浮现,不知道是不是因为多线程原因呢?还是别的原因呢?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant