RL-agent

This is an implementation of PGQ: Combining policy gradient and Q-learning Also it contains additional hacks, including:

This agent is implemented using distributed Tensorflow + Redis for synchronising experience replay and weights

Requirements:
-Numpy
-Scipy
-Tensorflow
-Redis (and redis server)
-Joblib
-Gym
-OpenCV (for screen preprocessing)

To run:

python3 run_agent.py

After the run you should kill redis-server process and all worker processes

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback