Solving the DRLND Unity Navigation Environment

Train an agent to solve the Banana Unity Environment from the Deep Reinforcement Learning Nanodegree on Udacity.

After 600 episodes of training, the agent achieved an average score of +13 over 100 consecutive episodes. The trained agent playing in the left GIF above achieved a score of +12 in that instance. The environment is considered solved if the average reward over 100 consecutive episodes is above +13.

The task was solved using a simple DQN architecture with experience replay and using using a separate network that stays fixed for C updates to generate the targets on the 37 dimensional state vector. The DQN learns a parametric approximation of the action-value function that maps the state vector to actions, by minimizing the mean-squared error. More information found in the detailed report.

Training

Prerequisites

conda or miniconda (recommended)
make
Download the environment that matches your OS following the Getting Started from the DRLND repo and unpack it in the root of this project

Install Environment

Automated Install

Simply run make install to install all requirements in a conda environment called drlnd_nav.

Manual Install

Create a conda environment called drlnd_nav with Python3.6 and activate it using the following commands

conda create --name drlnd_nav python=3.6
conda activate drlnd_nav

Then install the requirements file requirements.txt and install the drlnd_nav ipykernel.

pip install -r $(PWD)/requirements.txt
python -m ipykernel install --user --name drlnd_nav --display-name "drlnd_nav"

Run the Code

Next, run make start to start the Jupyter notebook server and use your favorite browser to navigate to http://localhost:8888/?token=abcd.

Train an Agent

To train an agent start the kW_Solution-Train.ipynb notebook and run all the cells (this will also plot the rewards over the number of episodes and save the model weights to disk).

Watch a Trained Agent

To watch a successful agent solve the environment, run all the cells in kW_Solution-Watch.ipynb.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
figures		figures
gifs		gifs
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
Navigation.ipynb		Navigation.ipynb
Navigation_Pixels.ipynb		Navigation_Pixels.ipynb
README.md		README.md
Report.md		Report.md
checkpoint.pth		checkpoint.pth
dqn_agent.py		dqn_agent.py
kW_Solution-Train.ipynb		kW_Solution-Train.ipynb
kW_Solution-Watch.ipynb		kW_Solution-Watch.ipynb
model.py		model.py
requirements.txt		requirements.txt

License

kw90/drlnd_navigation

Folders and files

Latest commit

History

Repository files navigation

Solving the DRLND Unity Navigation Environment

Training

Prerequisites

Install Environment

Automated Install

Manual Install

Run the Code

Train an Agent

Watch a Trained Agent

About

Resources

License

Stars

Watchers

Forks

Languages