You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Let's assume that I have already trained model A to predict A(x) given observation x.
I would now like to train a model B using PPO to minimize A(x) + B(x).
This means that, when stepping during training, I would need to take the action predicted by the model under training, i.e. B, plus the prediction from A (already trained model) for that same observation.
Is there a proper way to do so using SB3?
Checklist
I have checked that there is no similar issue in the repo
❓ Question
Let's assume that I have already trained model A to predict A(x) given observation x.
I would now like to train a model B using PPO to minimize A(x) + B(x).
This means that, when stepping during training, I would need to take the action predicted by the model under training, i.e. B, plus the prediction from A (already trained model) for that same observation.
Is there a proper way to do so using SB3?
Checklist
The text was updated successfully, but these errors were encountered: