Multi-Armed-Bandit

Description

This is an implementation of $\epsilon$-Greedy, Greedy and Upper Confidence Bound algorithms to solve the Multi-Armed Bandit problem. Implementation details of these algorithms can be found in Chapter 2 of Reinforcement Learning: An Introduction - Rich Sutton

How to Install:

# In project root folder
pip install -r requirements.txt

How to Run:

# In project root folder
./run.sh

Tasks

Part 1

A plot of reward over time (averaged over 100 runs each) on the same axes, for $\epsilon$-greedy with 𝜖 = 0.1, greedy with 𝑄1 = 5, and UCB with 𝑐 = 2.

Part 2

A summary comparison plot of rewards over first 1000 steps for the three algorithms with different values of the hyperparameters.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
algorithms		algorithms
classes		classes
helpers		helpers
plots		plots
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

algorithms

algorithms

classes

classes

helpers

helpers

plots

plots

.gitignore

.gitignore

README.md

README.md

main.py

main.py

requirements.txt

requirements.txt

run.sh

run.sh

Repository files navigation

Multi-Armed-Bandit

Description

How to Install:

How to Run:

Tasks

Part 1

Part 2

About

Releases

Packages

Languages

KaleabTessera/Multi-Armed-Bandit

Folders and files

Latest commit

History

Repository files navigation

Multi-Armed-Bandit

Description

How to Install:

How to Run:

Tasks

Part 1

Part 2

About

Topics

Resources

Stars

Watchers

Forks

Languages