How to access parameter values for each time-step with callback function policy_param? #468
blancamiller
started this conversation in
General
Replies: 1 comment
-
Hi @blancamiller , thanks for using brax! Could you please describe a bit more about your use-case? What do you mean by parameter values for each time-step? Parameters don't change during a rollout, they get updated over a batch of episodes. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am using the provided Brax Training tutorial for PPO + Ant. The notebook provides a user-defined callback function
progress
. Similarly you can specify apolicy_param
function and pass it intotrain
. This is where my issues lies. I’m able to get the parameter values for each episode, however, I’d like to access the parameters for each time-step. How can I go about doing that?Beta Was this translation helpful? Give feedback.
All reactions