mab

Star

Here are 24 public repositories matching this topic...

avorozhtsov / shipit

Star

Exploitation vs Exploration problem stated as A/B-testing with maximum profit per unit time.

continuous-testing ab-testing mab exploration-exploitation peaking

Updated Oct 4, 2023
Mathematica

VladMarianCimpeanu / OLA_project

Star

Reinforcement learning techniques applied to solve pricing problems in e-commerce applications. Final project for "Online learning applications" course (2021-2022)

reinforcement-learning pricing thompson-sampling multi-armed-bandit montecarlo-simulation mab ucb1 online-learning-applications

Updated Oct 30, 2022
Jupyter Notebook

sshaplygin / abcs

Star

Adaptive bandit cache selection

golang statistics lru-cache arc-cache lfu-cache mab 2q-cache lfuda-cache

Updated Apr 14, 2024
Go

Implementation of Multi-Armed Bandit (MAB) algorithms UCB and Epsilon-Greedy. MAB is a class of problems in reinforcement learning where an agent learns to choose actions from a set of arms, each associated with an unknown reward distribution. UCB and Epsilon-Greedy are popular algorithms for solving MAB problems.

reinforcement-learning-algorithms ucb bandits mab e-greedy

Updated Mar 26, 2023
Python

Bachfischer / COMP90051-StatML-Assignment-2

Star

Source code for Assignment 2 of COMP90051 (Semester 2 2020)

ucb multi-armed-bandit mab

Updated Oct 21, 2020
Jupyter Notebook

jiseongHAN / reinforcement

Star

My Little Reinforcement Learning

reinforcement-learning pytorch dqn reinforce ddqn mab ppo-pytorch

Updated Jul 13, 2021
Python

aijunbai / bandit

Star

Algorithms for multi-armed bandit (MAB) problems

mab

Updated Oct 1, 2015
C++

tuhinsharma121 / pybandit-archive

Star

A Python library for all popular multi-armed bandit algorithms.

optimization-algorithms mab

Updated Apr 28, 2023
Jupyter Notebook

pm3310 / mab-covid19

Star

Multi-Armed-Bandit solutions on AWS to deliver Covid-19 test kits efficiently and effectively

python aws multi-armed-bandits mab sagemaker coronavirus covid-19

Updated Mar 25, 2020
Jupyter Notebook

DURUII / Replica-EUWR

Star

🐯REPLICA of "Combinatorial Multi-Armed Bandit Based Unknown Worker Recruitment in Heterogeneous Crowdsensing"

crowdsourcing multi-armed-bandits online-learning crowdsensing mab mobile-crowdsensing worker-recruitment

Updated Dec 24, 2023
Jupyter Notebook

vmarchaud / ts-mab

Star

Typescript implementation of a multi-armed bandit

typescript thompson-sampling mab

Updated May 17, 2020
TypeScript

abhinavcreed13 / Multi-armed-bandits-MAB

Star

This project implements famous MAB algorithms and evaluates them on the basis of their performance - EpsilonGreedy, UCB, BetaThompson, LinUCB, LinThompson.

algorithms evaluation python3 multi-armed-bandits mab gridsearch