Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

extrinsic and intrinsic combination #44

Open
murtazabasu opened this issue Dec 23, 2019 · 1 comment
Open

extrinsic and intrinsic combination #44

murtazabasu opened this issue Dec 23, 2019 · 1 comment

Comments

@murtazabasu
Copy link

Hello, I am trying to implement ICM in PPO with both extrinsic and intrinsic combination. I have seen in few repos where they weight out an extrinsic reward more than intrinsic i.e. combine_reward = (1-int_coef) * rewards + int_coef * intrinsic_reward whereint_coeff = 0.01which reduces the effect of intrinsic rewards significantly. Seeing your paper, you have nowhere mentioned this sort of equation for both the rewards. I wonder if you can tell me that the equation mentioned above can be implemented for a dual reward setting.

@Joll123
Copy link

Joll123 commented May 24, 2020

Hello, do you understand the relationship between external rewards and internal rewards? how to adjust int_coef parameters.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants