simplementation

Clean implementations of papers I read for research in robotics + handwritten notes on some of them. It aims to offer well commented code that flows well. I vaguely structure these according to: https://spinningup.openai.com/en/latest/spinningup/keypapers.html

1. Human-level control through deep reinforcement learning, Mnih et al, Nature 2015.

Gist : Use DeepRL and Deep Q-Learning (DQN) to achieve above human level performance in ATARI Games
Paper : https://www.nature.com/articles/nature14236.pdf
Algorithm/Techniques : DQN, Experience Replay

2. Deep Reinforcement Learning with Double Q-learning, van Hasselt et al, 2015.

Gist : Illustrate vanilla DQN's tendency to overestimate Q-value's, and propose 'Double DQN' to use two seperate Neural Networks to select the action and evaluate its Q-value respectively.
Paper : https://arxiv.org/pdf/1509.06461.pdf
Algorithm/Techniques : Double-DQN, Experience Replay

3. Dueling Network Architectures for Deep Reinforcement Learning, Wang et al, 2016.

Gist : Propose the "Dueling Network Architecture" that computes an estimate of the value function & and an estimate of the advantage seperately to evaluate the Q-value.
Paper : https://arxiv.org/pdf/1511.06581.pdf
Algorithm/Techniques : Dueling Q-Network, Experience Replay

4. Prioritized Experience Replay, Schaul et al, ICLR 2016.

Gist :
Paper : ### 4. Prioritized Experience Replay, Wang et al, 2016.
Algorithm/Techniques : Prioritized Experience Replay, Q-Learning

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
1.DQN_Human-Level-control-through-deep-reinforcement-learning-Mnih-etal-2015		1.DQN_Human-Level-control-through-deep-reinforcement-learning-Mnih-etal-2015
2.DOUBLE-DQN_Deep-Reinforcement-Learning-with-Double-Q-Learning-van-Hasselt-etal-2015		2.DOUBLE-DQN_Deep-Reinforcement-Learning-with-Double-Q-Learning-van-Hasselt-etal-2015
3.DUELING-DQN_Dueling-Network-Architectures-for-Deep-Reinforcement-Learning-Wang-etal-2016		3.DUELING-DQN_Dueling-Network-Architectures-for-Deep-Reinforcement-Learning-Wang-etal-2016
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

1.DQN_Human-Level-control-through-deep-reinforcement-learning-Mnih-etal-2015

1.DQN_Human-Level-control-through-deep-reinforcement-learning-Mnih-etal-2015

2.DOUBLE-DQN_Deep-Reinforcement-Learning-with-Double-Q-Learning-van-Hasselt-etal-2015

2.DOUBLE-DQN_Deep-Reinforcement-Learning-with-Double-Q-Learning-van-Hasselt-etal-2015

3.DUELING-DQN_Dueling-Network-Architectures-for-Deep-Reinforcement-Learning-Wang-etal-2016

3.DUELING-DQN_Dueling-Network-Architectures-for-Deep-Reinforcement-Learning-Wang-etal-2016

.gitignore

.gitignore

README.md

README.md

Repository files navigation

simplementation

1. Human-level control through deep reinforcement learning, Mnih et al, Nature 2015.

2. Deep Reinforcement Learning with Double Q-learning, van Hasselt et al, 2015.

3. Dueling Network Architectures for Deep Reinforcement Learning, Wang et al, 2016.

4. Prioritized Experience Replay, Schaul et al, ICLR 2016.

About

Releases

Packages

Languages

botforge/simplementation

Folders and files

Latest commit

History

Repository files navigation

simplementation

1. Human-level control through deep reinforcement learning, Mnih et al, Nature 2015.

2. Deep Reinforcement Learning with Double Q-learning, van Hasselt et al, 2015.

3. Dueling Network Architectures for Deep Reinforcement Learning, Wang et al, 2016.

4. Prioritized Experience Replay, Schaul et al, ICLR 2016.

About

Resources

Stars

Watchers

Forks

Languages