Skip to content

guillaumeboniface/reacher

Repository files navigation

Reacher

This repository contains a collection of agents for environments with continuous action spaces. The environment considered is a robotic arm with 2 joints operating in a 3D space. A moving target rotates around the arm and the goal of the arm is to maintain its end inside the target. At each step the agents action is made of the torque applied to each of its joint along 2 axis. For each step spent within target, the agent receives a reward of 0.1. Each episode lasts 1000 steps. The environment is considered solved if the agent achieve an average reward of 30 over 100 consecutive episodes.

reacher environment

Our implementation contains:

We train 20 agents in parallels and compare the performance of those different implementations. The best performance is obtained with the n-step DDPG, solving the environment in ~25 episodes.

performance graphics

Installing the repo

If you don't have it already, install conda and create a dedicated python environment

conda create --name reacher python=3.6

Activate the environment

conda activate reacher

From the root folder of the repo, install the python requirements

pip install -r requirements.txt

Setup the environment

Download depending on your system:

  • Linux: click here
  • Mac OSX: click here
  • Windows (32-bit): click here
  • Windows (64-bit): click here

Unzip the environment

Running the models

Open jupyter lab (or other python notebook client you favour), open solution_walkthrough.ipynb and point it to the environment file in the second cell. You can now run the full notebook.

Limitations

The repository was developed and tested on Mac OSX. Should you face compatibility issues on other system let us know. Warning: the environment itself can be quite whimsical and stop answering under certain conditions. We found that restarting the notebook kernel solved some of these issues.

About

Reinforcement learning agents for continuous action spaces, including PPO and DDPG, implemented for the Reacher environment

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published