[question] Tuning for GAIL and custom envs with time bottlenecks #92

prabhasak · 2020-07-08T07:41:44Z

Hello. I use SB and zoo actively for GAIL. My CustomEnv built using AirSim trains (almost) in real-time, due to which I have spent months trying to find the right set of hyperparameters (HPs) for GAIL to imitate expert trajectories (generated from an optimal TRPO policy). I had some specific questions regarding TRPO and GAIL

Since GAIL uses TRPO, I made a copy of the zoo TRPO HPs and called it GAIL. Can I do better? I have had luck imitating simple Gym envs with GAIL, but have had a hard time imitating MuJoCo envs
CustomEnv training for 1e6 timesteps takes ~1.5 days, so I've been avoiding tuning. Would you recommend tuning for GAIL? Do I just copy the trpo sampler for gail? Is there anything else I can do to speed-up tuning?
With both a lack of tuned HPs and real-time training, is there any other avenue I can try my hands on to get GAIL to work on my CustomEnv?

Any help is greatly appreciated. Thank you for these awesome repos!

CustomEnv info:
obs: 6dim, cts
action: 3dim, cts
rewards: dense, large reward at goal

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[question] Tuning for GAIL and custom envs with time bottlenecks #92

[question] Tuning for GAIL and custom envs with time bottlenecks #92

prabhasak commented Jul 8, 2020 •

edited

[question] Tuning for GAIL and custom envs with time bottlenecks #92

[question] Tuning for GAIL and custom envs with time bottlenecks #92

Comments

prabhasak commented Jul 8, 2020 • edited

prabhasak commented Jul 8, 2020 •

edited