bandit-algorithms

Star

Here are 82 public repositories matching this topic...

rssalessio / py-lower-bound-bai

Star

Python utilities to compute a lower bound of the expected sample complexity to identify the best arm in a bandit model

bandit-algorithms lower-bound best-arm-identification sample-complexity

Updated Sep 8, 2021
Python

rssalessio / reading-list

Star

This is a collection of interesting papers that I have read so far or want to read. Note that the list is not up-to-date. Topics: reinforcement learning, deep learning, mathematics, statistics, bandit algorithms, optimization.

learning machine-learning statistics reinforcement-learning deep-learning optimization reading-list bandit-algorithms

Updated Sep 5, 2023

fouratifares / RGL

Star

Randomized Greedy Learning Under Full-bandit Feedback

agent machine-learning reinforcement-learning machine-learning-algorithms reinforcement-learning-algorithms machinelearning bandit-learning submodular-optimization submodularity bandit-algorithms

Updated Jan 22, 2024
Python

Rajarshi1001 / CS780

Star

Repository contains codes for the course CS780: Deep Reinforcement Learning

reinforcement-learning-algorithms monte-carlo-simulation ddpg-algorithm bandit-algorithms d3qn dqn-pytorch policy-based-method td3-pytorch gymnasium-environment

Updated Apr 17, 2024
Jupyter Notebook

JurajZelman / multi-armed-bandits

Star

Several multi-armed bandit strategies with additional holding option for smoother exploration.

optimization multi-armed-bandits bandit-algorithms

Updated Feb 2, 2024
Jupyter Notebook

Sagarnandeshwar / Bandit_Algorithms

Star

Reinforcement Learning (COMP 579) Project

reinforcement-learning thompson-sampling epsilon-greedy ucb bernoulli-distribution bandit-algorithms exploration-exploitation

Updated Aug 4, 2023
Jupyter Notebook

Naereen / KullbackLeibler.jl

Sponsor

Star

💫 Fast Julia implementation of various Kullback-Leibler divergences for 1D parametric distributions. 🏋 Also provides optimized code for kl-UCB indexes

kullback-leibler-divergence julia-package divergence bandit-algorithms kl-ucb

Updated May 13, 2018
Julia

rsoaresp / bandits_notebooks

Star

a collection of google colab notebooks with educational stuff about bandits and their variations

jupyter-notebook python3 bandit-algorithms

Updated Mar 26, 2020
Jupyter Notebook

Hins-Hu / Bandit-Algorithms

Star

An illustrative project including some multi-armed bandit algorithms and contextual bandit algorithms

multi-armed-bandit contextual-bandit bandit-algorithms

Updated Feb 3, 2021
Python

chunjenpeng / pyBandit

Star

Bandit and Evolutionary Algorithms using Python

python optimization evolutionary-algorithms aco pso cmaes bandit-algorithms

Updated Feb 11, 2021
Python

hughrawlinson / bandit-algorithms

Star

🎩🤠Some Bandit Algorithms in Typescript

learning optimization bandit-algorithms

Updated Aug 27, 2021
TypeScript

duoan / OpenMultiarmedBandits

Star

A open source multi arm bandit framework for optimize your website quickly. You’ll quickly use the benefits of several simple algorithms—including the epsilon-Greedy, Softmax, and Upper Confidence Bound (UCB) algorithms—by working through this framework written in Java, which you can easily adapt for deployment on your own website.

distribution machine-learning recommendation-system recommendation-engine optimization-algorithms abtest statistical-models website-optimization multiarm-bandit bandit-algorithms openmultiarmedbandits

Updated Feb 17, 2018

swasun / BanditProblem

Star

A collection of implementations of the bandit problem.

thompson-sampling epsilon-greedy multi-armed-bandits bandit-algorithms linucb

Updated Feb 25, 2019
Jupyter Notebook

anselmeamekoe / Graphs_in_ML_MVA

Star

semi-supervised-learning recommender-system bandit-algorithms graph-neural-networks graphs-theory

Updated Jul 11, 2021
Jupyter Notebook

Khush-dev / multiplayer-multi-armed-bandits

Star

This repo contains code for multi-armed bandit algorithm testing and local multiplayer competition.

multiplayer bandit-algorithms

Updated Aug 13, 2022
Python

amirhosein-mesbah / Reinforcement_learning

Star

This repository contains the implementation of a wide variety of Reinforcement Learning Projects in different applications of Bandit Algorithms, MDPs, Distributed RL and Deep RL. These projects include university projects and projects implemented due to interest in Reinforcement Learning.