reward simulator for contextual bandits
-
Updated
Dec 20, 2018 - Perl
reward simulator for contextual bandits
This is an A/B testing project that was made to see if a new version of a sign up button in a website is better than current one.
Programming assignments of CS747 - Reinforcement Learning IIT-B
[Python] 4 multi-armed bandit algorithms are implemented to determine which one can most effectively determine the best website configuration that maximise signups.
MBIT Big Data 2019-2020 Reinforced Learning (DC-03 TP-01)
A julia package to compute Gittins Indices for Multi Armed Bandits
Real-time decision tool for A/B-testing based on multi-armed bandit algorithm
Multi-armed bandit problem in Reinforcement learning
Reinforcement Learning basic tasks
Applying anomaly detection methods on Multi-Armed Bandit problems
Profiling Vehicles for Improved Small Cell Beam-Vehicle Pairing Using Multi-Armed Bandit
A repo contains my implementation and analysis of some well-known Reinforcement Learning problems and algorithms.
Analysis of various multi armed bandit algorithms over normal and heavy-tailed distributions.
A slightly modified UCB algorithm for Multi Armed Bandits (MABs) where we have as a given that each bandit rewards with a Poisson distribution
Multi-Armed bandit problem to predict which arm will be most successful to bet upon using approaches like epsilon greedy, UCB, and Thompson sampling.
Contains all code and course work for a module on reinforcement learning.
Classical reinforcement learning problem on Go
Some examples of methods for solving multi-armed bandits problems
Add a description, image, and links to the multiarmed-bandits topic page so that developers can more easily learn about it.
To associate your repository with the multiarmed-bandits topic, visit your repo's landing page and select "manage topics."