OpenAI_gym_Taxi-v2

OpenAI_gym_Taxi-v2 solved with reinforcement learning - Expected Sarsa

This task was introduced in [Dietterich2000] to illustrate some issues in hierarchical reinforcement learning. There are 4 locations (labeled by different letters) and your job is to pick up the passenger at one location and drop him off in another. You receive +20 points for a successful dropoff, and lose 1 point for every timestep it takes. There is also a 10 point penalty for illegal pick-up and drop-off actions. [Dietterich2000] T Erez, Y Tassa, E Todorov, "Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition", 2011. source: https://gym.openai.com/envs/Taxi-v2/

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
OpenAI Gym - Taxi-V2- Reinforcement Learning Expected SARSA - simple.ipynb		OpenAI Gym - Taxi-V2- Reinforcement Learning Expected SARSA - simple.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenAI Gym - Taxi-V2- Reinforcement Learning Expected SARSA - simple.ipynb

OpenAI Gym - Taxi-V2- Reinforcement Learning Expected SARSA - simple.ipynb

README.md

README.md

Repository files navigation

OpenAI_gym_Taxi-v2

About

Releases

Packages

Languages

MikeHatchi/OpenAI_gym_Taxi-v2

Folders and files

Latest commit

History

OpenAI Gym - Taxi-V2- Reinforcement Learning Expected SARSA - simple.ipynb

OpenAI Gym - Taxi-V2- Reinforcement Learning Expected SARSA - simple.ipynb

README.md

README.md

Repository files navigation

OpenAI_gym_Taxi-v2

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages