#

multi-armed-bandit

Here are 116 public repositories matching this topic...

KaranAnchan / 10_Arm_Testbed

Explore the 10-Arm Testbed Simulation! 🎲 Utilize Python to test various ε-greedy strategies in a reinforcement learning environment. Visualize and compare agents' performance as they balance exploration and exploitation. Perfect for learners and enthusiasts! 🚀📊

python machine-learning reinforcement-learning decision-making epsilon-greedy multi-armed-bandit exploration-exploitation

Updated May 27, 2024
Python

improve-ai / tracker-trainer

Contextual Multi-Armed Bandit Reward Tracker & Model Trainer

python aws machine-learning reinforcement-learning ai aws-lambda serverless ml personalization xgboost serverless-framework parquet ab-testing recommender-system decision-trees multi-armed-bandit contextual-bandits improve-ai

Updated May 21, 2024
Python

improve-ai / java-ranker

Easily Score & Rank JSON-Encodable Objects with ML

android kotlin java machine-learning reinforcement-learning ai personalization xgboost ab-testing recommender-system multi-armed-bandit multivariate-testing contextual-bandits improve-ai

Updated May 21, 2024
Java

haoyangzheng1996 / ts_ulmc

The GitHub repository for "Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo", AISTATS 2024.

monte-carlo thompson-sampling multi-armed-bandit langevin-dynamics exploration-exploitation

Updated May 19, 2024
Python

mab

stitchfix / mab

Library for multi-armed bandit selection strategies, including efficient deterministic implementations of Thompson sampling and epsilon-greedy.

go golang data-science reinforcement-learning thompson-sampling experimentation multi-armed-bandits multi-armed-bandit thompson multiarmed-bandits

Updated May 14, 2024
Go

Alanjamlu34 / Multi-Armed-Bandit--Adaptive-epsilon-greedy-

Repository tugas akhir tentang Multi-Armed Bandit

machine-learning reinforcement-learning ab-testing multi-armed-bandits multi-armed-bandit

Updated May 9, 2024
Jupyter Notebook

mweglowski / bandit_problem_simulator

🦾🤖 Visual and interactive simulator of multi-armed bandit problem.

javascript css html machine-learning reinforcement-learning algorithms reactjs multi-armed-bandit tailwindcss

Updated May 7, 2024
JavaScript

jacksonpradolima / coleman4hcs

COLEMAN (Combinatorial VOlatiLE Multi-Armed BANdit) - and strategies for HCS context

tcp continuous-integration ci multi-armed-bandit hcs coleman mab test-case-prioritization tcpci highly-configurable-system

Updated Jun 3, 2024
Jupyter Notebook

SMPyBandits

SMPyBandits / SMPyBandits

🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-player (MusicalChair, MEGA, rhoRand, MCTop/RandTopM etc).. Available on PyPI: https://pypi.org/project/SMPyBandits/ and documentation on

python open-source research internet-of-things simulations multi-arm-bandits multi-armed-bandit learning-theory bandit-algorithms cognitive-radio

Updated Apr 30, 2024
Jupyter Notebook

jakemaz66 / RecoveringSleepingBandit

A Novel Multi-Arm Bandit Optimization Implementation using reinforcement learning in Python for selecting Notifications.

reinforcement-learning optimization duolingo multi-armed-bandit

Updated Apr 26, 2024
Python

saeedghoorchian / D-LinTS-RP

Experiments for paper "Bayesian Linear Bandits for Large-Scale Recommender Systems"

bayesian-methods recommender-system multi-armed-bandit high-dimensional-decision-space

Updated Apr 18, 2024
Jupyter Notebook

FlynnOwen / multi-armed-bandits

Multi-Armed Bandit method of accurately estimating the largest parameter out of a set of candidates.

python reinforcement-learning machine multi-armed-bandits multi-armed-bandit

Updated Apr 6, 2024
Python

taoensso / touchstone

Simple A/B testing library for Clojure

clojure epl multi-armed-bandit split-testing taoensso engagement-testing

Updated Mar 19, 2024
Clojure

NicoHerrig95 / Multi-armed-Bandit-RL

Easy-to-use library for multi-armed bandit problems.

reinforcement-learning multi-armed-bandit bandit

Updated Mar 12, 2024
Jupyter Notebook

prusrafal / Click-Through-Rate-Prediction-Model

This repository is for a Decision Making Aarhus University Course assignment, focusing on using Multi-Armed Bandit algorithms, specifically the epsilon-greedy algorithm, for optimizing click-through rates in digital advertising by balancing the exploration of new ads and the exploitation of successful ones.

data-science r multi-armed-bandit modelling-databases

Updated Mar 10, 2024
R

willyfh / multi-armed-bandit

A classic reinforcement learning problem.

reinforcement-learning artificial-intelligence multi-armed-bandit

Updated Feb 17, 2024
Python

aayush97 / metadata-based-multiarmed-bandits

MetaHierTS is a novel recommendation system algorithm aimed at enhancing user experiences in online marketing. This algorithm focuses on leveraging metadata and similarities between tasks to optimize decision-making in a multi-task Multi-Armed Bandit (MAB) environment.

bayesian-methods recommender-system multi-armed-bandit

Updated Jan 2, 2024
Python

saeedghoorchian / NCC-Bandits

Experiments for paper "Online Learning with Costly Features in Non-stationary Environments"

multi-armed-bandit concept-drift contextual-bandits non-stationary-environment costly-features

Updated Dec 20, 2023
Jupyter Notebook

DURUII / Replica-AUCB

🐯REPLICA of "Auction-based combinatorial multi-armed bandit mechanisms with strategic arms"

multi-armed-bandit bandits mab cmab bandit-algorithms aution aucb

Updated Dec 17, 2023
Python

hughyi / Research-Project

Prof. Jungmin So - spring '23

multi-armed-bandit cr cognitive-radio federated-learning-algorithm cognitive-radio-network

Updated Dec 13, 2023
Python

Improve this page

Add a description, image, and links to the multi-armed-bandit topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multi-armed-bandit topic, visit your repo's landing page and select "manage topics."