Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

learn.py, expected performance, steps, and hardware? #177

Open
MatthewCWeston opened this issue Oct 22, 2023 · 2 comments
Open

learn.py, expected performance, steps, and hardware? #177

MatthewCWeston opened this issue Oct 22, 2023 · 2 comments
Labels
question Further information is requested

Comments

@MatthewCWeston
Copy link

Hello. I'm attempting to run learn.py on the hover test environment, and wondering if anyone has had any luck with this so far.

I admittedly haven't tried 1E12 training steps quite yet, but after 1E6 steps, my reward graph looks like this:

download

For reference, a dummy policy that always returns the vector [.1,.1,.1,.1] achieves a reward of roughly -450.

In practice, a typical evaluation run with this model looks like the path shown below:

image

I've tried both the standard, un-commented script, and the commented script adapted for the current versions of both this repository and SB3, and seen similar results. Does it simply require more timesteps, or more parallel CPUs/GPUs? It would be very helpful (and much appreciated) if someone could share the hardware configuration and loss curve associated with a successful run.

@MatthewCWeston MatthewCWeston changed the title learn.py, expected steps and hardware? learn.py, expected performance, steps, and hardware? Oct 22, 2023
@abdul-mannan-khan
Copy link

abdul-mannan-khan commented Nov 6, 2023

Same thing. I ran it for 20,000,000 steps. Still no success. I used the PPO algorithm. Just one question, @MatthewCWeston, how did you get this reward function? I am not able to get it? The previous functions (from the paper branch) are throwing many errors.

@JacopoPan JacopoPan added the question Further information is requested label Nov 19, 2023
@JacopoPan
Copy link
Member

See #180 and the current main

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants