Brief Introduction

In our project, we are tasked with learning an agent to traverse a frozen lake without falling into the water. The agent learns by trial-and-error, adjusting the actions it takes based on the rewards it received in the past.
We will use the Q-learning algorithm. This algorithm generates a table called the Q-table which has a mapping of every state and possible action to a value. The agent will learn which actions to take based on the values of this table.

How does the behavior of the agent differ when using a high or low value for the exploration-exploitation (ε) parameter
Does the discount factor (γ) have a noticeable impact on the score achieved by the agent
Does the learning rate (α) have a noticeable impact on the score achieved by the agent

Full report

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
(old) KTAI_Crawler_with_simple_Q-Learning.ipynb		(old) KTAI_Crawler_with_simple_Q-Learning.ipynb
README.md		README.md
q_learning_gymnasium.ipynb		q_learning_gymnasium.ipynb
q_learning_report.pdf		q_learning_report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(old) KTAI_Crawler_with_simple_Q-Learning.ipynb

(old) KTAI_Crawler_with_simple_Q-Learning.ipynb

README.md

README.md

q_learning_gymnasium.ipynb

q_learning_gymnasium.ipynb

q_learning_report.pdf

q_learning_report.pdf

Repository files navigation

Brief Introduction

About

Releases

Packages

Contributors 2

Languages

4rn3/KTA_q_learning

Folders and files

Latest commit

History

Repository files navigation

Brief Introduction

About

Topics

Resources

Stars

Watchers

Forks

Languages