Skip to content

jiseongHAN/reinforcement

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Implementaion RL Algorithm with Pytorch

Policy Based

1. Reinforce : Done

2-1 PPO with Continuous action space :Done

2-2 PPO with Atari Environment : Done

3. Deep Deterministic Policy Gradient : Pendulum Env Done

Value Based

1. Deep Q-Learning : Done

2. Dobule DQN : Done

3. Dueling DQN : Done

4. C51 : need fix

Sampling Method

Sparse Reward Env