Continuous Control with RL project

Repository for the Udacity RL Specialization second project Continuous Control using Actor-Critic methods.

Project overview

This project has the objective to train an Agent using Actor-Critic methods to solve the Reacher environment.

Enviroment & Task

The Reacher environment is a Unity-based simulation where an agent controls a double-jointed robotic arm to reach target locations. The state space is continuous, with 33 variables representing the arm's position, rotation, velocity, and angular velocities. The action space is also continuous, consisting of 4 variables for torque applied to the arm's joints, with each variable ranging from -1 to 1. The task is episodic, with the agent aiming to maximize its total reward over a fixed number of time steps. The environment is considered solved when the agent achieves an average score of 30 or more over 100 consecutive episodes.

Usage

Installing the environment

To install the env, select the environment that matches your operating system:

The Github Repo already contains the environment for MacOS. If you are using another OS, you must download the environment and place it in the folder rl-robot-movement/.

Training

To train the agent you must open the notebook Continuous_Control_20.ipynb and run all the cells. The agent will be trained and the weights will be saved in the file actor_final.pth and critic_final.pth.

Visualizing trained agent

To visualize the trained agent you must open the notebook Play.ipynb and run all the cells.

Dependencies

The dependencies are listed in the file requirements.txt in the folder python/. To install them you can run the following command:

cd python
pip install .

It is highly recommended to use a virtual environment to install the dependencies. you can do this by running the following commands:

- Linux or Mac:
```bash
conda create --name drlnd python=3.6
source activate drlnd
```
- Windows:
```bash
conda create --name drlnd python=3.6
activate drlnd
```

Files

.
├── Continuous_Control_20.ipynb -> Notebook to train the agent
├── LICENSE ---------------------> License file
├── Play.ipynb ------------------> Notebook to visualize the trained agent
├── README.md -------------------> This file
├── Report.md -------------------> Report of the project
├── actor_critic.py -------------> Actor-Critic model code
├── actor_final.pth -------------> Weights of the trained actor
├── basic_actor_critic_example_3.png -> Example of the Actor-Critic technique
├── critic_final.pth ------------> Weights of the trained critic
├── ddpg.py ---------------------> DDPG agent code
├── play.py ---------------------> Code to visualize the trained agent
├── reacher.gif -----------------> Gif of the environment
├── trained_agent.gif -----------> Gif of the trained agent
└── training_best.png -----------> Plot of the training scores

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Reacher_20.app/Contents		Reacher_20.app/Contents
python		python
.DS_Store		.DS_Store
.gitignore		.gitignore
Continuous_Control_20.ipynb		Continuous_Control_20.ipynb
LICENSE		LICENSE
Play.ipynb		Play.ipynb
README.md		README.md
Report.md		Report.md
actor_critic.py		actor_critic.py
actor_final.pth		actor_final.pth
basic_actor_critic_example_3.png		basic_actor_critic_example_3.png
critic_final.pth		critic_final.pth
ddpg.py		ddpg.py
play.py		play.py
reacher.gif		reacher.gif
trained_agent.gif		trained_agent.gif
training_best.png		training_best.png

License

gabrielcassimiro17/rl-robot-movement

Folders and files

Latest commit

History

Repository files navigation

Continuous Control with RL project

Project overview

Enviroment & Task

Usage

Installing the environment

Training

Visualizing trained agent

Dependencies

Files

References

About

Resources

License

Stars

Watchers

Forks

Languages