GitHub - akashe/DeepReinforcementLearning: Deep RL implementations. DQN, SAC, DDPG, TD3, PPO and VPG implemented in pytorch. Tested Env: LunarLander-v2 and Pendulum-v0.

Deep RL algorithms implemented using Pytorch

Algo list:

Article on deeper Look into policy gradients

Experimental Results:

Algorithm	Discrete Env: LunarLander-v2	Continuous Env: Pendulum-v0
DQN		-
VPG		-
DDPG	-
TD3	-
SAC	-
PPO	-

Usage:

Just run the file/algorithm directly. There is no common structures between algorithms as I implemented them as I learnt them. Different algorithms are inspired from different sources.

Resources:

Future projects:

If time available I will add a simple program for elevator using RL.
Better graphs

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.idea		.idea
.ipynb_checkpoints		.ipynb_checkpoints
RLUtils		RLUtils
agents		agents
figures		figures
DQN.py		DQN.py
Policy Gradient Methods.ipynb		Policy Gradient Methods.ipynb
Readme.md		Readme.md
SoftActorCritic.py		SoftActorCritic.py
ddpg.py		ddpg.py
ppo_clip.py		ppo_clip.py
td3.py		td3.py
vanilla_policy_gradient.py		vanilla_policy_gradient.py

akashe/DeepReinforcementLearning

Folders and files

Latest commit

History

Repository files navigation

Deep RL algorithms implemented using Pytorch

Algo list:

Article on deeper Look into policy gradients

Experimental Results:

Usage:

Resources:

Future projects:

About

Topics

Resources

Stars

Watchers

Forks

Languages