pybandit

A Python library for all popular multiarmed bandit algorithms.

Roadmap

The following algorithms are currently being explored before we set a roadmap for the first release.

Epsilon Bandit:
1. Epsilon-greedy strategy (done)
2. Epsilon-first strategy (in progress)
3. Epsilon-decreasing strategy
4. Epsilon-adaptive strategy
Bayesian Bandit
1. Thompson Sampling
Contextual Bandit
1. Linear Classifier
  1. LinUCB (Upper Confidence Bound) algorithm:
  2. LinRel (Linear Associative Reinforcement Learning) algorithm:
2. Non-linear Classifier
  1. UCBogram algorithm
  2. NeuralBandit algorithm
  3. KernelUCB algorithm
  4. Bandit Forest algorithm
3. Constrained
  1. UCB-ALP algorithm
4. Greedy
  1. Contextual-Epsilon-greedy strategy
Adverserial Bandit
Dueling Bandit
Collaborative Bandit
1. COFIBA

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
notebooks/algos		notebooks/algos
pybandit		pybandit
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
demo.ipynb		demo.ipynb
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

notebooks/algos

notebooks/algos

pybandit

pybandit

tests

tests

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

demo.ipynb

demo.ipynb

poetry.lock

poetry.lock

pyproject.toml

pyproject.toml

Repository files navigation

pybandit

Roadmap

About

Releases

Packages

Contributors 2

Languages

License

tuhinsharma121/pybandit-archive

Folders and files

Latest commit

History

Repository files navigation

pybandit

Roadmap

About

Topics

Resources

License

Stars

Watchers

Forks

Languages