Skip to content

rk1998/robot-sac

Repository files navigation

robot-sac

Deep Deterministic Policy Gradients

  • DDPG Paper:
  • The implementation of this algorithm can be found in ddpg.swift. This script contains code for the Actor and Critic networks and also includes the training setup for the DDPG algorithm.
  • To run this script simply run "swift ddpg.swift" This script will train a DDPG agent on the inverted pendulum problem from gym
  • I also wrote another script to train the DDPG agent on the Bipedal Walker environment. That can be found in ddpg_walker.swift
  • I also made a notebook on Google Colab with this same code:Link to Notebook

Soft Actor Critic

  • Soft Actor Critic Paper:
  • The implementation for this algorithm can be found in sac.swift. This script contains code for the Gaussian Actor as well as implementations for the Q(s, a) network and the V(s) network. The training setup can also be found in this script.
  • I based my implementation of of a python implementation of the algorithm found in this repo: https://github.com/keiohta/tf2rl
  • To run this script simply run "swift sac.swift" The script will train the SAC agent on the inverted pendulum problem from gym
  • You can also run this code on a Google Colab notebook Link to Notebook:

DDPG + Hindsight Experience Replay

  • Hindsight Experience Replay Paper:
  • The Hindsight Experience Replay Algorithm deals with Sparse Reward environments and has demonstrated great performance on complex robotic control tasks when combined with DDPG.
  • The implementation for this algorithm can be found in ddpg_her.swift. To run the script simply run "swift ddpg_her.swift" from the command line. The script will train the DDPG + HER agent on the Fetch Push problem from gym

About

Repo for CS 8903 Special Problems Course at Georgia Tech

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published