Multi-armed Bandits: 10-armed Testbed

Based on the 10-armed testbed as described in chapter 2 of the textbook, Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto (2nd edition). The testbed, all the algorithms of the chapter, and the two programming exercises are there in this repository. I have used Python's multiprocessing module to make it possible to run the algorithms in parallel.

I have tested the code with Python 3, though it could also work with Python 2 with some minor tweaks.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data/exercise_plots		data/exercise_plots
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
RL Chapter 2 Exercises.ipynb		RL Chapter 2 Exercises.ipynb
algorithms.py		algorithms.py
bandit.py		bandit.py
config.yml		config.yml
plotting.py		plotting.py
run.py		run.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data/exercise_plots

data/exercise_plots

tests

tests

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

RL Chapter 2 Exercises.ipynb

RL Chapter 2 Exercises.ipynb

algorithms.py

algorithms.py

bandit.py

bandit.py

config.yml

config.yml

plotting.py

plotting.py

run.py

run.py

utils.py

utils.py

Repository files navigation

Multi-armed Bandits: 10-armed Testbed

About

Releases

Packages

Languages

License

aayn/multi-armed-bandits

Folders and files

Latest commit

History

Repository files navigation

Multi-armed Bandits: 10-armed Testbed

About

Topics

Resources

License

Stars

Watchers

Forks

Languages