CS 5331 - Pattern Recognition Project

The goal of this project was to train an agent to play Kung Fu Master using reinforcement learning. Report can be found here.

Dependencies

Python 3.5
numpy
CNTK
OpenAI Gym (with Atari package)
Tensorboard (to visualize training progress)

Experiments were run on an Ubuntu 16.04 virtual machine with 7GB RAM and 2 cores running on Microsoft Azure. We encountered difficulties when trying to install gym[Atari] on a machine running windows 10.

Training

To see the available arguments, run:

python3 atari_train.py -h

To train using the default options (same as in report), run:

python3 atari_train.py KungFuMaster-v0

The trained model and logs (as well as checkpoints) will be saved to <cur_dir>/chkpt

To visualize the training progress, run:

tensoroard --logdir=chkpt/logs

Although any atari environment can be specified, the preprocessing step assumes that the current environment is for Kung Fu Master. You may have to modify wrapper.py to remove those details.

Evaluation

To see the available arguments, run:

python3 atari_eval.py -h

To obtain a baseline average using a random agent, run:

python3 atari_eval.py path -rnd -r

The -r flag turns on rendering so you can see the play but slows down the evaluation significantly. You can set the number of episodes using the '-e' flag.

To evaluate using the final trained model (you can turn off rendering), run:

python3 atari_eval.py chkpt/KungFuMaster-v0.dqn -r

If training was interrupted before completion, you can still evaluate with one of the saved checkpoints:

python3 atari_eval.py chkpt/KungFuMaster-v0_.dqn -c -r

Watch the trained agent here and the random agent here

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
README.md		README.md
agent.py		agent.py
atari_eval.py		atari_eval.py
atari_train.py		atari_train.py
decay.py		decay.py
kung fu master.pdf		kung fu master.pdf
memory.py		memory.py
runner.py		runner.py
sumtree.py		sumtree.py
wrapper.py		wrapper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitignore

.gitignore

README.md

README.md

agent.py

agent.py

atari_eval.py

atari_eval.py

atari_train.py

atari_train.py

decay.py

decay.py

kung fu master.pdf

kung fu master.pdf

memory.py

memory.py

runner.py

runner.py

sumtree.py

sumtree.py

wrapper.py

wrapper.py

Repository files navigation

CS 5331 - Pattern Recognition Project

Dependencies

Training

Evaluation

About

Releases

Packages

Languages

frankibem/kung-fu-master

Folders and files

Latest commit

History

Repository files navigation

CS 5331 - Pattern Recognition Project

Dependencies

Training

Evaluation

About

Topics

Resources

Stars

Watchers

Forks

Languages