multiarmed-bandits

Learning, Evaluation and Avoidance of Failure situations (LEAF) is a tool to that prevents failures in robot's task plan by learning from previous experience.

robotics ontology learning-by-doing multiarmed-bandits

Updated Feb 19, 2020
Java

Bilkent-CYBORG / ACC-UCB

Star

Implementation of the Adaptive Contextual Combinatorial Upper Confidence Bound (ACC-UCB) algorithm for the contextual combinatorial volatile multi-armed bandit setting.

reinforcement-learning contextual-bandit multiarmed-bandits combinatorial-bandit

Updated Feb 24, 2020
Python

lesnofla / mbit-m08-dc03-reinforced

Star

MBIT Big Data 2019-2020 Reinforced Learning (DC-03 TP-01)

reinforcement-learning catch q-learning gym keras-rl multiarmed-bandits

Updated Jun 23, 2020
Jupyter Notebook

prakHr / Reinforcement-Learning-Book

Star

[Book] :- Andrea Lonza - Reinforcement Learning Algorithms with Python_ Learn, understand, and develop smart algorithms for addressing AI challenges-Packt Publishing (2019)

Updated Jul 3, 2020
Python

vuk119 / RL

Star

A repo contains my implementation and analysis of some well-known Reinforcement Learning problems and algorithms.

jupyter-notebook q-learning python3 reinforcement-learning-algorithms multiarmed-bandits ddqn-pyotrch

Updated Jul 9, 2020
Jupyter Notebook

GjjvdBurg / ThompsonSampling

Star

Source code for blog post on Thompson Sampling

thompson-sampling multi-armed-bandit bandit-algorithms multiarmed-bandits

Updated Sep 4, 2020
JavaScript

R4j4n / Maximizing-Revenue-of-an-Online-Retail-Business

Star

thompson-sampling multiarmed-bandits thompson-algorithm revenue-systems

Updated Sep 8, 2020
Python

Sushant-ctrl / RL-IMPLEMENTATIONS

Star

This repository has all the codes and sources of various RL algorithms that I have implemented.

dqn rl temporal-differencing-learning multiarmed-bandits montecarlomethod tabular-rl

Updated Sep 22, 2020
Python

StivenMetaj / Data_Intelligence_Applications_Exam_Project

Star

Our project for the "Data Intelligence Applications" exam at Politecnico di Milano. The project was about Social Influence and Pricing techniques applied to networks.

python pricing thompson-sampling social-influence data-intelligence multiarmed-bandits

Updated Oct 7, 2020
Python

paulozip / beer-recommender-mab

Star

A beer recommendation system using multi-armed bandit approach to solve cold start problems

python recommendation-system multi-armed-bandits multiarmed-bandits

Updated Oct 30, 2020
Python

hardhik-99 / Thompsom_Sampling_GoF

Star

Thompson Sampling equipped with Goodness of Fit test based active change-point detection in Non-Stationary Bandit environment

reinforcement-learning thompson-sampling goodness-of-fit multiarmed-bandits

Updated Dec 3, 2020
Python

robinbeura / Reinforcement-Learning

Star

Multi-Armed bandit problem to predict which arm will be most successful to bet upon using approaches like epsilon greedy, UCB, and Thompson sampling.

reinforcement-learning multiarmed-bandits

Updated Mar 5, 2021

SamueleMeta / data-intelligence-applications

Star

Our project for the "Data Intelligence Applications" exam at Politecnico di Milano. The project was about Social Influence and Pricing online learning techniques applied to networks.

graphs pricing thompson-sampling greedy-algorithm online-learning social-influence multiarmed-bandits

Updated Mar 10, 2021
Python

Improve this page

Add a description, image, and links to the multiarmed-bandits topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multiarmed-bandits topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multiarmed-bandits

Here are 55 public repositories matching this topic...

rudojaksa / evgen

rudojaksa / reward

aayn / multi-armed-bandits

k9luo / Deep-Preference-Elicitation

mohit-madan / CS747-assignments

seojtix / shiva

alison-carrera / onn

Nath-R / LEAF

Bilkent-CYBORG / ACC-UCB

lesnofla / mbit-m08-dc03-reinforced

prakHr / Reinforcement-Learning-Book

vuk119 / RL

GjjvdBurg / ThompsonSampling

R4j4n / Maximizing-Revenue-of-an-Online-Retail-Business

Sushant-ctrl / RL-IMPLEMENTATIONS

StivenMetaj / Data_Intelligence_Applications_Exam_Project

paulozip / beer-recommender-mab

hardhik-99 / Thompsom_Sampling_GoF

robinbeura / Reinforcement-Learning

SamueleMeta / data-intelligence-applications

Improve this page

Add this topic to your repo