liampetti / DDPG Public

Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function, reward discounting, works on discrete & continuous action spaces

65 stars 15 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
actor.py		actor.py
critic.py		critic.py
ddpg.py		ddpg.py
noise.py		noise.py
replay_buffer.py		replay_buffer.py
reward.py		reward.py

Repository files navigation

Implementation of DDPG - Deep Deterministic Policy Gradient

Modified from the work of Patrick Emami: Deep Deterministic Policy Gradients in TensorFlow

Algorithm and hyperparameter details can be found here: "Continuous control with deep reinforcement learning" - TP Lillicrap, JJ Hunt et al., 2015

Tested on CartPole & Pendulum

Requirements

Gym and TensorFlow.

Modifications

Removed TFLearn dependency
Added Ornstein Uhlenbeck noise function
Added reward discounting
Works with discrete and continuous action spaces

About

Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function, reward discounting, works on discrete & continuous action spaces

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%