Unity-Technologies Continuous Control Project

Unity Machine Learning Agents (ML-Agents) is an open-source Unity plugin that enables games and simulations to serve as environments for training intelligent agents.

For game developers, these trained agents can be used for multiple purposes, including controlling NPC behaviour (in a variety of settings such as multi-agent and adversarial), automated testing of game builds and evaluating different game design decisions pre-release.

In this project, we develop a Deep Deterministic Policy Gradient (DDPG) agent that utilises its newly acquired skills to control a robotic arm, and steer it to a target location. A reward of +0.1 is provided for each step that the agent's hand is in the goal location. Thus, the goal of the agent is to maintain its position at the target location for as many time steps as possible.

The state space consists of 33 variables corresponding to position, rotation, velocity, and angular velocities of the arm. Each action is a vector with four numbers, corresponding to torque applicable to two joints. Every entry in the action vector should be a number between -1 and 1. To solve the environment, the agent must get an average score of +30 over 100 consecutive episodes.

Dependencies

To set up your python environment to run the code in this repository, follow the instructions below.

Create (and activate) a new environment with Python 3.6.

Linux or Mac:

conda create --name drl_cc python=3.6
source activate drl_cc

Windows:

conda create --name drl_cc python=3.6
activate drl_cc

Install Dependencies
- Install Pytoch by following the instructions for your system here
- To install the necessary dependencies run pip install ./python
Download the Unity Environment

For this project, you will not need to install Unity - this is because we have already built the environment for you, and you can download it from one of the links below. You need only select the environment that matches your operating system:

Linux: click here
Mac OSX: click here
Windows (32-bit): click here
Windows (64-bit): click here

Create an IPython kernel for the drl_cc environment.

python -m ipykernel install --user --name drl_cc--display-name "drl_cc"

Before running code in a notebook, change the kernel to match the drl_cc environment by using the drop-down Kernel menu.

Usage

Open the Continuous_Control.ipynb on a notebook and run the cells. In any case, the weights of a pretrained network are saved in actor_checkpoint.pth for the Actor network and critic_checkpoint.pth for the Actor network, so you can witness how a trained agent behaves.

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

References

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
python		python
.gitignore		.gitignore
Continuous_Control.ipynb		Continuous_Control.ipynb
README.md		README.md
Report.pdf		Report.pdf
actor_checkpoint.pth		actor_checkpoint.pth
critic_checkpoint.pth		critic_checkpoint.pth

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

python

python

.gitignore

.gitignore

Continuous_Control.ipynb

Continuous_Control.ipynb

README.md

README.md

Report.pdf

Report.pdf

actor_checkpoint.pth

actor_checkpoint.pth

critic_checkpoint.pth

critic_checkpoint.pth

Repository files navigation

Unity-Technologies Continuous Control Project

Dependencies

Usage

Contributing

References

License

About

Releases

Packages

Languages

dpoulopoulos/drl_continuous_control

Folders and files

Latest commit

History

Repository files navigation

Unity-Technologies Continuous Control Project

Dependencies

Usage

Contributing

References

License

About

Resources

Stars

Watchers

Forks

Languages