Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training a model based on the prediction of another model (PPO) #1821

Open
4 tasks done
SamySam0 opened this issue Jan 31, 2024 · 1 comment
Open
4 tasks done

Training a model based on the prediction of another model (PPO) #1821

SamySam0 opened this issue Jan 31, 2024 · 1 comment
Labels
question Further information is requested

Comments

@SamySam0
Copy link

❓ Question

Let's assume that I have already trained model A to predict A(x) given observation x.
I would now like to train a model B using PPO to minimize A(x) + B(x).
This means that, when stepping during training, I would need to take the action predicted by the model under training, i.e. B, plus the prediction from A (already trained model) for that same observation.
Is there a proper way to do so using SB3?

Checklist

@SamySam0 SamySam0 added the question Further information is requested label Jan 31, 2024
@araffin
Copy link
Member

araffin commented Feb 2, 2024

Is there a proper way to do so using SB3?

You should have a look at gym wrappers/VecEnv wrapper (we have tutorials/examples in our doc).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants