mab

Reinforcement learning techniques applied to solve pricing problems in e-commerce applications. Final project for "Online learning applications" course (2021-2022)

reinforcement-learning pricing thompson-sampling multi-armed-bandit montecarlo-simulation mab ucb1 online-learning-applications

Updated Oct 30, 2022
Jupyter Notebook

aldente0630 / multi_armed_bandit

Star

Experiment results using MAB algorithms in Yahoo! Front Page Today Module User Click Log dataset

contextual-bandits mab striatum

Updated Jan 2, 2020
Jupyter Notebook

MatteoGuadrini / vmam

Star

VLAN Mac-address Authentication Manager

ldap radius python3 ldap-authentication ldap-server vlan mac-address network-architecture ldap-manager radius-server nac mab 80211 pywinrm 8021x ldap3 rfc-3579 ldap-group ieee8021x

Updated Apr 5, 2021
Python

juliennonin / multiplayer-bandits

Star

Multi-Player Bandits Revisited [L. Besson & É. Kaufmann]

reinforcement-learning multi-armed-bandit mab

Updated Jan 21, 2021
Python

duchuyle108 / SDN-EgressNode-Selection

Star

The work in paper "A Reinforcement Learning-Based Solution for Intra-Domain Egress Selection" - Duc-Huy LE, Hai Anh TRAN

sdn mab

Updated Sep 11, 2022
Python

abhinavcreed13 / Multi-armed-bandits-MAB

Star

This project implements famous MAB algorithms and evaluates them on the basis of their performance - EpsilonGreedy, UCB, BetaThompson, LinUCB, LinThompson.

algorithms evaluation python3 multi-armed-bandits mab gridsearch

Updated Mar 20, 2020
Jupyter Notebook

pm3310 / mab-covid19

Star

Multi-Armed-Bandit solutions on AWS to deliver Covid-19 test kits efficiently and effectively

python aws multi-armed-bandits mab sagemaker coronavirus covid-19

Updated Mar 25, 2020
Jupyter Notebook

jiseongHAN / reinforcement

Star

My Little Reinforcement Learning

reinforcement-learning pytorch dqn reinforce ddqn mab ppo-pytorch

Updated Jul 13, 2021
Python

avorozhtsov / shipit

Star

Exploitation vs Exploration problem stated as A/B-testing with maximum profit per unit time.

continuous-testing ab-testing mab exploration-exploitation peaking

Updated Oct 4, 2023
Mathematica

DURUII / Replica-EUWR

Star

🐯REPLICA of "Combinatorial Multi-Armed Bandit Based Unknown Worker Recruitment in Heterogeneous Crowdsensing"

crowdsourcing multi-armed-bandits online-learning crowdsensing mab mobile-crowdsensing worker-recruitment

Updated Dec 24, 2023
Jupyter Notebook

vmarchaud / ts-mab

Star

Typescript implementation of a multi-armed bandit

typescript thompson-sampling mab

Updated May 17, 2020
TypeScript

sshaplygin / abcs

Star

Adaptive bandit cache selection

golang statistics lru-cache arc-cache lfu-cache mab 2q-cache lfuda-cache

Updated Apr 14, 2024
Go

aijunbai / bandit

Star

Algorithms for multi-armed bandit (MAB) problems

mab

Updated Oct 1, 2015
C++

Implementation of Multi-Armed Bandit (MAB) algorithms UCB and Epsilon-Greedy. MAB is a class of problems in reinforcement learning where an agent learns to choose actions from a set of arms, each associated with an unknown reward distribution. UCB and Epsilon-Greedy are popular algorithms for solving MAB problems.

reinforcement-learning-algorithms ucb bandits mab e-greedy

Updated Mar 26, 2023
Python

Improve this page

Add a description, image, and links to the mab topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mab topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mab

Here are 24 public repositories matching this topic...

alison-carrera / onn

LibreCat / Catmandu

alison-carrera / mabalgs

Nth-iteration-labs / streamingbandit

jacksonpradolima / coleman4hcs

v-i-s-h / MAB.jl

VladMarianCimpeanu / OLA_project

aldente0630 / multi_armed_bandit

MatteoGuadrini / vmam

juliennonin / multiplayer-bandits

duchuyle108 / SDN-EgressNode-Selection

abhinavcreed13 / Multi-armed-bandits-MAB

pm3310 / mab-covid19

jiseongHAN / reinforcement

avorozhtsov / shipit

DURUII / Replica-EUWR

vmarchaud / ts-mab

sshaplygin / abcs

aijunbai / bandit

JoelJa835 / MAB_Algorithms

Improve this page

Add this topic to your repo