robot-sac

Repo for CS 8903 Special Problems Course at Georgia Tech.
Implementation of Soft Actor-Critic algorithm by Haarnoja et. al., Deep Deterministic Policy Gradients by Lillicrap et. al. and Hindsight Experience Replay by Andrychowicz et. al
Implemented using Swift for Tensorflow, Tested on Open AI: Gym environments
Swift for TensorFlow Repo
Final Presentation for CS 8903 Class

Deep Deterministic Policy Gradients

DDPG Paper:
The implementation of this algorithm can be found in ddpg.swift. This script contains code for the Actor and Critic networks and also includes the training setup for the DDPG algorithm.
To run this script simply run "swift ddpg.swift" This script will train a DDPG agent on the inverted pendulum problem from gym
I also wrote another script to train the DDPG agent on the Bipedal Walker environment. That can be found in ddpg_walker.swift
I also made a notebook on Google Colab with this same code:Link to Notebook

Soft Actor Critic

Soft Actor Critic Paper:
The implementation for this algorithm can be found in sac.swift. This script contains code for the Gaussian Actor as well as implementations for the Q(s, a) network and the V(s) network. The training setup can also be found in this script.
I based my implementation of of a python implementation of the algorithm found in this repo: https://github.com/keiohta/tf2rl
To run this script simply run "swift sac.swift" The script will train the SAC agent on the inverted pendulum problem from gym
You can also run this code on a Google Colab notebook Link to Notebook:

DDPG + Hindsight Experience Replay

Hindsight Experience Replay Paper:
The Hindsight Experience Replay Algorithm deals with Sparse Reward environments and has demonstrated great performance on complex robotic control tasks when combined with DDPG.
The implementation for this algorithm can be found in ddpg_her.swift. To run the script simply run "swift ddpg_her.swift" from the command line. The script will train the DDPG + HER agent on the Fetch Push problem from gym

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
results		results
.gitignore		.gitignore
README.md		README.md
compare_algos.py		compare_algos.py
ddpg.swift		ddpg.swift
ddpg_her.swift		ddpg_her.swift
ddpg_walker.swift		ddpg_walker.swift
gym-robotics-env.swift		gym-robotics-env.swift
gym_to_gif.py		gym_to_gif.py
mlp.swift		mlp.swift
replay-buffer.swift		replay-buffer.swift
sac.swift		sac.swift

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

results

results

.gitignore

.gitignore

README.md

README.md

compare_algos.py

compare_algos.py

ddpg.swift

ddpg.swift

ddpg_her.swift

ddpg_her.swift

ddpg_walker.swift

ddpg_walker.swift

gym-robotics-env.swift

gym-robotics-env.swift

gym_to_gif.py

gym_to_gif.py

mlp.swift

mlp.swift

replay-buffer.swift

replay-buffer.swift

sac.swift

sac.swift

Repository files navigation

robot-sac

Deep Deterministic Policy Gradients

Soft Actor Critic

DDPG + Hindsight Experience Replay

About

Releases

Packages

Languages

rk1998/robot-sac

Folders and files

Latest commit

History

Repository files navigation

robot-sac

Deep Deterministic Policy Gradients

Soft Actor Critic

DDPG + Hindsight Experience Replay

About

Resources

Stars

Watchers

Forks

Languages