Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

is it really a3c implementation? not just actor critic? #8

Open
sevity opened this issue Nov 21, 2018 · 2 comments
Open

is it really a3c implementation? not just actor critic? #8

sevity opened this issue Nov 21, 2018 · 2 comments

Comments

@sevity
Copy link

sevity commented Nov 21, 2018

where is multi-step loss function and entropy loss function?

@KGBUSH
Copy link

KGBUSH commented Jul 17, 2020

where is multi-step loss function and entropy loss function?

For entropy exploration, you can check continuous_A3C.py in loss_func function

@jakkarn
Copy link

jakkarn commented May 4, 2021

This should be the if-condition that determines when N actions have been taken (or terminated) in the Worker:

if total_step % UPDATE_GLOBAL_ITER == 0 or done: # update global and assign to local net

N-step loss is then computed in push_and_pull in this loop:

for r in br[::-1]: # reverse buffer r

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants