Skip to content

volcinschi/Reinforcement-Learning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 

Repository files navigation

Reinforcement-Learning

Reinforcement learning homework IBIO4615

Dependencies

  • Python3.5+
  • Pytorch 1.0.1.
  • TensorFlow 1.2
  • gym, matplotlib, numpy, tensorboardx
pip install gym
pip install tensorboardx 
pip install tensorflow=1.2

DQN

Tasks:

  1. Play with the hyperparameters and show their corresponding graphs. Which parameter caused the most change? Which one didn’t affect that much? Discuss briefly your results
  2. Anneal the 𝞮 hyperparameter to decay linearly instead of being fixed? Did it help at all? Why?
  3. Try two different architectures and report any results

DDPG

Tasks:

  1. Change DDPG to Mountain car (May tune a bit the hyperparameters as constant time systems are different). Compare with DQN (# of episodes till convergence)
  2. (Optional) As you see reward/cost penalize control law/actions change it so it penalize more control energy used and plot u(t) for different initial positions of the pendulum.

Note that DDPG is feasible about hyper-parameters. You should fine-tuning if you change to another environment.

Episode reward in Pendulum-v0:

ep_r

About

Reinforcement learning homework IBIO4615

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages