Skip to content

Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function, reward discounting, works on discrete & continuous action spaces

liampetti/DDPG

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Implementation of DDPG - Deep Deterministic Policy Gradient

Modified from the work of Patrick Emami: Deep Deterministic Policy Gradients in TensorFlow

Algorithm and hyperparameter details can be found here: "Continuous control with deep reinforcement learning" - TP Lillicrap, JJ Hunt et al., 2015

Tested on CartPole & Pendulum

Requirements

Gym and TensorFlow.

Modifications

  • Removed TFLearn dependency
  • Added Ornstein Uhlenbeck noise function
  • Added reward discounting
  • Works with discrete and continuous action spaces

About

Implementation of DDPG (Modified from the work of Patrick Emami) - Tensorflow (no TFLearn dependency), Ornstein Uhlenbeck noise function, reward discounting, works on discrete & continuous action spaces

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages