OpenAI-Gym-Solutions

Solutions to OpenAI-Gym environments using various machine learning algorithms.

Strategies

Evolutionary Learning Strategy: Start with some initial weights and generate weights for each member in the population (by adding noise to the current weight). Track performance of each member's performance and update current weight (by a weighted sum). Learn more about Evolutionary Learning Strategies here.

REINFORCE: Monte Carlo Policy Gradient Perform gradient ascent after every episode on the weighted sum at time t, the probability of taking the particular action multiplied by the expected total discounted reward following the current policy. A discounted reward is used to enforce finishing early. Learn more about REINFORCE: Monte Carlo Policy Gradient by reading Reinforcement Learning: An Introduction (chapter 13.3) by Sutton & Barto here. (Book is unfinished and chapters order may change)

Evaluations

LunarLander-v2

ELS 1 2 3

CartPole-v0

ELS 1

REINFORCE-MCMC 1 2

CartPole-v1

ELS 1

REINFORCE-MCMC 1 2

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
CartPole		CartPole
LunarLander		LunarLander
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CartPole

CartPole

LunarLander

LunarLander

LICENSE

LICENSE

README.md

README.md

Repository files navigation

OpenAI-Gym-Solutions

Strategies

Evaluations

About

Releases

Packages

Languages

License

Adriel-M/OpenAI-Gym-Solutions

Folders and files

Latest commit

History

Repository files navigation

OpenAI-Gym-Solutions

Strategies

Evaluations

About

Topics

Resources

License

Stars

Watchers

Forks

Languages