multiarmed-bandits
Here are 55 public repositories matching this topic...
reward simulator for contextual bandits
-
Updated
Dec 20, 2018 - Perl
-
Updated
Mar 18, 2019 - Jupyter Notebook
A Comparative Evaluation of Active Learning Methods in Deep Recommendation
-
Updated
Jul 14, 2019 - Jupyter Notebook
Programming assignments of CS747 - Reinforcement Learning IIT-B
-
Updated
Nov 9, 2019 - Jupyter Notebook
Real-time decision tool for A/B-testing based on multi-armed bandit algorithm
-
Updated
Nov 29, 2019
Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)
-
Updated
Dec 11, 2019 - Python
Learning, Evaluation and Avoidance of Failure situations (LEAF) is a tool to that prevents failures in robot's task plan by learning from previous experience.
-
Updated
Feb 19, 2020 - Java
Implementation of the Adaptive Contextual Combinatorial Upper Confidence Bound (ACC-UCB) algorithm for the contextual combinatorial volatile multi-armed bandit setting.
-
Updated
Feb 24, 2020 - Python
MBIT Big Data 2019-2020 Reinforced Learning (DC-03 TP-01)
-
Updated
Jun 23, 2020 - Jupyter Notebook
[Book] :- Andrea Lonza - Reinforcement Learning Algorithms with Python_ Learn, understand, and develop smart algorithms for addressing AI challenges-Packt Publishing (2019)
-
Updated
Jul 3, 2020 - Python
A repo contains my implementation and analysis of some well-known Reinforcement Learning problems and algorithms.
-
Updated
Jul 9, 2020 - Jupyter Notebook
Source code for blog post on Thompson Sampling
-
Updated
Sep 4, 2020 - JavaScript
-
Updated
Sep 8, 2020 - Python
This repository has all the codes and sources of various RL algorithms that I have implemented.
-
Updated
Sep 22, 2020 - Python
Our project for the "Data Intelligence Applications" exam at Politecnico di Milano. The project was about Social Influence and Pricing techniques applied to networks.
-
Updated
Oct 7, 2020 - Python
A beer recommendation system using multi-armed bandit approach to solve cold start problems
-
Updated
Oct 30, 2020 - Python
Thompson Sampling equipped with Goodness of Fit test based active change-point detection in Non-Stationary Bandit environment
-
Updated
Dec 3, 2020 - Python
Multi-Armed bandit problem to predict which arm will be most successful to bet upon using approaches like epsilon greedy, UCB, and Thompson sampling.
-
Updated
Mar 5, 2021
Our project for the "Data Intelligence Applications" exam at Politecnico di Milano. The project was about Social Influence and Pricing online learning techniques applied to networks.
-
Updated
Mar 10, 2021 - Python
Improve this page
Add a description, image, and links to the multiarmed-bandits topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the multiarmed-bandits topic, visit your repo's landing page and select "manage topics."