Hindsight Experience Replay and Hierarchical Reinforcement Learning

Comp 781 Project https://github.com/ArmaanSethi/Hindsight-Experience-Replay-and-Hierarchical-Reinforcement-Learning

How to use Hindsight Experience Replay and Hierarchical Reinforcement Learning

Setup using OpenAI gyms. There are many tutorials online that will do a lot better than me. Then you can clone this repo and use the code I added to the baselines they provided.

Getting started

Training an agent is very simple:

python -m baselines.herhrl.experiment.train

This will train a DDPG+HER+HRL agent on the FetchReach environment. You should see the success rate go up quickly to 1.0, which means that the agent achieves the desired goal in all of the cases. The training script logs other diagnostics as well and pickles the best policy so far (w.r.t. to its test success rate), the latest policy, and, if enabled, a history of policies every K epochs. Use the flag --env_name FetchPickAndPlace-v0 to change the environment.

To inspect what the agent has learned, use the play script:

python -m baselines.herhrl.experiment.play /path/to/an/experiment/policy_best.pkl

You can try it right now with the results of the training step (the script prints out the path for you). This should visualize the current policy for 10 episodes and will also print statistics.

I used

python -m baselines.herhrl.experiment.train.py --num_cpu 2 --env_name FetchPush-v0 --n_epochs 200 --replay_strategy future

Videos

Code

I initially created my own implementation of DDPG, HER, and added HRL to it. In order to evaluate it fairly I decided to use the baseline HER as the foundation to my method, and then adding changes to various places in order to implement HRL as well. This allowed me to use their logger, which was very helpful in creating the graphs.

The code I modified are in baselines/baselines/herhrl/

More specifically I modified replay_buffer.py, rollout.py, her.py, ddpg.py, and actor_critic.py.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
baselines		baselines
presentation		presentation
results		results
.gitignore		.gitignore
Armaan Sethi - Project Progress Report.pdf		Armaan Sethi - Project Progress Report.pdf
Armaan Sethi - Robotics Project Proposal.pdf		Armaan Sethi - Robotics Project Proposal.pdf
Armaan-Final-Report.pdf		Armaan-Final-Report.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

baselines

baselines

presentation

presentation

results

results

.gitignore

.gitignore

Armaan Sethi - Project Progress Report.pdf

Armaan Sethi - Project Progress Report.pdf

Armaan Sethi - Robotics Project Proposal.pdf

Armaan Sethi - Robotics Project Proposal.pdf

Armaan-Final-Report.pdf

Armaan-Final-Report.pdf

README.md

README.md

Repository files navigation

Hindsight Experience Replay and Hierarchical Reinforcement Learning

How to use Hindsight Experience Replay and Hierarchical Reinforcement Learning

Getting started

Videos

Code

About

Releases

Packages

Languages

ArmaanSethi/Hindsight-Experience-Replay-and-Hierarchical-Reinforcement-Learning

Folders and files

Latest commit

History

Repository files navigation

Hindsight Experience Replay and Hierarchical Reinforcement Learning

How to use Hindsight Experience Replay and Hierarchical Reinforcement Learning

Getting started

Videos

Code

About

Resources

Stars

Watchers

Forks

Languages