Gym-Taxi-v3

My solution to the Gym environment Taxi-v3 using the Q learning algorithm.

The code is written to be executed in an IPython console.

Training

Once the code is executed the model can be trained for a number of training_episodes by:

agent.train(train_episodes)

The model trains until all episodes have passed.

Testing

Once the model is trained, It can be tested for a number of test_episodes by:

agent.test(test_episodes)

Technical Information

The environment is solved using a Q Learning implementation. The model performs random actions decreasingly often as a means of exploration.

After 100000 episodes the model is certainly done training and subsequent test results are:

Average amount of steps: 13.07659
Average amount of penalties: 0.0

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
Taxi.py		Taxi.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Taxi.py

Taxi.py

Repository files navigation

Gym-Taxi-v3

Training

Testing

Technical Information

About

Releases

Packages

Languages

tsfloss/Gym-Taxi-v3

Folders and files

Latest commit

History

README.md

README.md

Taxi.py

Taxi.py

Repository files navigation

Gym-Taxi-v3

Training

Testing

Technical Information

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages