bandits

Implementation of Multi-Armed Bandit (MAB) algorithms UCB and Epsilon-Greedy. MAB is a class of problems in reinforcement learning where an agent learns to choose actions from a set of arms, each associated with an unknown reward distribution. UCB and Epsilon-Greedy are popular algorithms for solving MAB problems.

reinforcement-learning-algorithms ucb bandits mab e-greedy

Updated Mar 26, 2023
Python

sarthakmittal92 / multi-armed-bandits

Star

Repository for the course project done as part of CS-747 (Foundations of Intelligent & Learning Agents) course at IIT Bombay in Autumn 2022.

python thompson-sampling reinforcement-learning-algorithms ucb multi-armed-bandits bandits kl-ucb

Updated Oct 14, 2022
Python

doerlbh / dilemmaRL

Star

Code for our PRICAI 2022 paper: "Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior".

machine-learning reinforcement-learning game-theory multiplayer-game behavioral-cloning multiagent-systems human-behavior bandits contextual-bandits prisoner-dilemma

Updated Aug 27, 2022
Python

philinemey / BSE-T3-RL

Star

Coursework, Stochastic Models and Optimization, BSE, Term 3, Class of 2022

reinforcement-learning dynamic-programming gaussian-processes bayesian-optimization policy-iteration bandits

Updated Jul 6, 2022
Jupyter Notebook

kfoofw / applied_learning_articles

Star

Collaborative project for documenting ML/DS learnings.

causal-inference bandits uplift-modelling

Updated May 5, 2022
Jupyter Notebook

Nicolivain / trustful-bandits

Star

A two armed bandit simulation and comparison with theoritical convergence

reinforcement-learning asset-allocation bandits stochastic-algorithm stochastic-optimization stochastic-algorithms bandit-algorithms trading-agent online-optimization

Updated Apr 16, 2022
Jupyter Notebook

Nicolivain / RLD

Star

Deep Reinforcement Learning Agents in Pytorch in a modular framework

reinforcement-learning deep-reinforcement-learning pytorch bandits gym-environment

Updated Mar 16, 2022
Jupyter Notebook

doerlbh / BanditZoo

Star

Python library of bandits and RL agents in different real-world environments

reinforcement-learning simulation bandits bandit bandit-algorithms

Updated Feb 21, 2022
Python

riccardodv / COOP-learning

Star

Implementation of the experiments for "Cooperative Online Learning with Feedback Graphs" Cesa-Bianchi, Cesari, Della Vecchia (https://arxiv.org/abs/2106.04982)

bandits cooperation online-learning-algorithms

Updated Feb 16, 2022
Python

babaniyi / Deep-contextual-bandits

Star

A benchmark to test decision-making algorithms for contextual-bandits. The library implements a variety of algorithms (many of them based on approximate Bayesian Neural Networks and Thompson sampling), and a number of real and syntethic data problems exhibiting a diverse set of properties.

bandits bandit-algorithms multiarmed-bandits

Updated Jan 26, 2022
Python

MehranTaghian / prophet-inequlity-implementation

Star

Implementation of the prophet inequalities

multi-armed-bandits bandits prophet-inequality k-prophet

Updated Dec 11, 2021
Python

Improve this page

Add a description, image, and links to the bandits topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the bandits topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bandits

Here are 40 public repositories matching this topic...

iheartradio / thomas

tensorflow / agents

pappar-delle / AI-Labs-2022-23

DURUII / Replica-AUCB

thoughtworks / simplebandit

ElianBelot / bernoulli-bandits

manome / python-mab

Ralyhu / CMAB-CC

AlxBouras / NeuralRandUCB

JoelJa835 / MAB_Algorithms

sarthakmittal92 / multi-armed-bandits

doerlbh / dilemmaRL

philinemey / BSE-T3-RL

kfoofw / applied_learning_articles

Nicolivain / trustful-bandits

Nicolivain / RLD

doerlbh / BanditZoo

riccardodv / COOP-learning

babaniyi / Deep-contextual-bandits

MehranTaghian / prophet-inequlity-implementation

Improve this page

Add this topic to your repo