#

bandits

Here are 40 public repositories matching this topic...

tensorflow / agents

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

reinforcement-learning tensorflow dqn multi-armed-bandits bandits contextual-bandits rl-algorithms tf-agents

Updated Mar 20, 2024
Python

yfletberliac / rlss-2019

Materials for the Practical Sessions of the Reinforcement Learning Summer School 2019: Bandits, RL & Deep RL (PyTorch).

education tutorial school reinforcement-learning materials ipynb notebooks bandits google-colab

Updated Aug 21, 2019
Jupyter Notebook

banditml / banditml

A lightweight contextual bandit & reinforcement learning library designed to be used in production Python services.

reinforcement-learning pytorch personalization neural-networks bandits contextual-bandits

Updated Jun 4, 2021
Python

iheartradio / thomas

Another A/B test library

scala public functional-programming functional-reactive-programming ab-testing bayesian bandits bayesian-analysis bandit bandit-algorithm

Updated May 22, 2024
Scala

SC5 / bandits

machine-learning reinforcement-learning bandits contextual-bandit

Updated Nov 16, 2017
Python

kfoofw / applied_learning_articles

Collaborative project for documenting ML/DS learnings.

causal-inference bandits uplift-modelling

Updated May 5, 2022
Jupyter Notebook

doerlbh / BanditZoo

Python library of bandits and RL agents in different real-world environments

reinforcement-learning simulation bandits bandit bandit-algorithms

Updated Feb 21, 2022
Python

annieyan / Bandits-using-UCB-algorithm

Thompson Sampling for Bandits using UCB policy

reinforcement-learning thompson-sampling ucb bandits

Updated Jul 29, 2017
Python

gurbaaz27 / amazon-hackathon

machine-learning amazon hackathon recommendation-system bandits

Updated May 28, 2021
Jupyter Notebook

alxthm / rld-project

Play Rock, Paper, Scissors (Kaggle competition) with Reinforcement Learning: bandits, tabular Q-learning and PPO with LSTM.

q-learning rl bandits ppo rps-game

Updated Mar 2, 2021
Python

rohilrg / Online-Learning-Bandits-Reinforcement-Learning

An assignment for the implementation of Online Learning, Bandits and Reinforcement Learning

reinforcement-learning bandits online-passive-aggresive-algorithm

Updated Dec 18, 2018
Jupyter Notebook

doerlbh / ABaCoDE

Code for our ICDMW 2018 paper: "Contextual Bandit with Adaptive Feature Extraction".

reinforcement-learning feature-extraction icdm representation-learning bandits contextual-bandits nonstationary icdm2018

Updated Jun 15, 2020
MATLAB

rameshjes / RobotLearning

reinforcement-learning policy-gradient dynamic-programming markov-decision-processes bandits sarsa-lambda

Updated Aug 16, 2017
Jupyter Notebook

babaniyi / Deep-contextual-bandits

A benchmark to test decision-making algorithms for contextual-bandits. The library implements a variety of algorithms (many of them based on approximate Bayesian Neural Networks and Thompson sampling), and a number of real and syntethic data problems exhibiting a diverse set of properties.

bandits bandit-algorithms multiarmed-bandits

Updated Jan 26, 2022
Python

BrianHung / random

random python notebooks (hopefully useful in future)

jupyter random bandits

Updated Jul 1, 2020
Jupyter Notebook

YRussac / WeightedLinearBandits

Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"

bandits non-stationary-environment neurips-2019

Updated Nov 14, 2019
Jupyter Notebook

TanguyUrvoy / pmlib

A python library for (finite) Partial Monitoring algorithms

machine-learning multi-armed-bandits bandits dueling-bandits partial-monitoring feedexp3 rex3

Updated Sep 12, 2017
Jupyter Notebook

Zaidtech / OverTheWire

This repo contains all the stuff I encountered while playing OverTheWire games.

cybersecurity bandits overthewire

Updated Dec 25, 2020

sarthakmittal92 / multi-armed-bandits

Repository for the course project done as part of CS-747 (Foundations of Intelligent & Learning Agents) course at IIT Bombay in Autumn 2022.

python thompson-sampling reinforcement-learning-algorithms ucb multi-armed-bandits bandits kl-ucb

Updated Oct 14, 2022
Python

anishacharya / Bandits-Online-Learning

Simple Implementations of Bandit Algorithms in python

bandit-learning multi-armed-bandits online-learning bandits bandit online-learning-algorithms bandit-algorithms online-learning-python

Updated Dec 2, 2021
Jupyter Notebook

Improve this page

Add a description, image, and links to the bandits topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the bandits topic, visit your repo's landing page and select "manage topics."