antonio-f / Dynamic-Programming Star 9 Code Issues Pull requests Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncated Policy Iteration, Value Iteration . From Udacity's Deep Reinforcement Learning Nanodegree program. reinforcement-learning openai-gym gym dynamic-programming policy-evaluation policy-iteration value-iteration bellman-equation frozenlake policy-improvement state-value-function action-value-function Updated Apr 3, 2019 Jupyter Notebook
TanushGoel / Atari-Games-RL Star 2 Code Issues Pull requests a collection of python notebooks using RL agents to play Atari games in OpenAI gym environments reinforcement-learning monte-carlo q-learning policy-gradient atari-games actor-critic-methods temporal-difference state-value-function policy-based-method Updated Jun 4, 2020 Jupyter Notebook