Expert Trajectories for Gym Environments

Contains expert trajectories tailored for state-only imitation learning tasks in Gym environments.

What's State-Only Imitation Learning?

State-only imitation learning is all about mimicking the behavior of an expert using only the observations of the environment's states. It's like learning to dance by watching someone else's moves without knowing the exact steps/actions they are taking.

How to Use

In the trajectories directory, you will find the trajectories. Use np.load to use the trajectories for SOIL.

I have also provided the scripts used for collection in the scripts directory. You can change the parameters to collect more trajectories or change the reward threshold or the algorithm used.

Methods of Collection

Environment	Algorithm	Details	Trajectories Collected
`CartPole-v1`	PPO	I have collected trajectories which yielded a minimum reward of 475. This can be found in the official documentation of the environment.	1000
`Pendulum-v1`	DDPG	Pendulum is an unsolved environment. Hence the approach I have taken is first collect 1e6 trajectories of the trained agent. I have then taken the values that lie greater than `mean + standard_deviation` of the rewards. I have sampled those trajectories only.	22
`MountainCarContinuous-v0`	DDPG	I have collected trajectories which yielded a minimum reward of -110.	1000

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
cartpole		cartpole
mountaincar Continuous		mountaincar Continuous
pendulum		pendulum
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cartpole

cartpole

mountaincar Continuous

mountaincar Continuous

pendulum

pendulum

README.md

README.md

Repository files navigation

Expert Trajectories for Gym Environments

What's State-Only Imitation Learning?

How to Use

Methods of Collection

About

Releases

Packages

Languages

HridayM25/GymExpertTrajectories

Folders and files

Latest commit

History

Repository files navigation

Expert Trajectories for Gym Environments

What's State-Only Imitation Learning?

How to Use

Methods of Collection

About

Topics

Resources

Stars

Watchers

Forks

Languages