#

thompson-sampling

Here are 95 public repositories matching this topic...

Murtazali05 / Multi-armed-bandit

Multi Armed Bandits implementation using the Jester Dataset

thompson-sampling ucb multi-armed-bandits e-greedy

Updated Apr 5, 2021
Python

aijunbai / thompson-sampling

Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs

mdp mcts thompson-sampling pomdps

Updated Jun 20, 2016
C++

dkvc / SpringSem_RL

Enhancing Warfarin Dosage Prediction using Ensemble Sampling

thompson-sampling bandit warfarin ensemble-sampling

Updated May 25, 2024
Jupyter Notebook

preferred-pictures / ruby

A Ruby client for the PreferredPictures API.

optimization ruby-client thompson-sampling ab-testing

Updated Aug 10, 2020
Ruby

haoyangzheng1996 / ts_ulmc

The GitHub repository for "Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo", AISTATS 2024.

monte-carlo thompson-sampling multi-armed-bandit langevin-dynamics exploration-exploitation

Updated May 19, 2024
Python

martind-hub / Thompson-Sampling-UCB

This is a sample code written in R that compares Thompson Sampling and UCB for three available arms sampled from a bernoulli distribution.

thompson-sampling ucb

Updated Feb 5, 2021
R

preferred-pictures / php

A PHP client for the PreferredPictures API.

optimization php-library php-client thompson-sampling ab-testing

Updated Aug 10, 2020
PHP

proceduralia / randomist

Code for Policy Optimization as Online Learning with Mediator Feedback

thompson-sampling exploration mcmc multi-armed-bandits policy-optimization

Updated Dec 27, 2020
Python

amirabbasii / A3C_Kung_Fu

thompson-sampling a3c actor-critic

Updated Feb 1, 2021
Jupyter Notebook

graciangelica / Ads_CTR_Optimisation

🖱 Figure out which ad has the highest click rate

thompson-sampling

Updated Aug 15, 2021
Jupyter Notebook

Joycechidi / MachineLearning2

Continuation of my machine learning works based on Subjects....starting with Evaluating Classification Models Performance

natural-language-processing reinforcement-learning deep-learning clustering thompson-sampling dimensionality-reduction apriori hierarchical-clustering k-means-clustering association-rule-learning kernel-pca eclat eval-classification-models-perf upper-confidence-bound-ucb natural-language-processing-algo artificial-neural-networks-ann convolutional-neural-networks-cnn principal-component-analysis-pca linear-discriminant-analysis-lda model-selection--xgboost

Updated Jul 26, 2018

dre-hh / kafka_bandits

Multiarm Bandits on Kafka Streams

docker thompson-sampling kafka-streams beta-distribution

Updated Dec 4, 2018
Scala

Suchetaaa / CS747-Assignments

Foundations Of Intelligent Learning Agents (FILA) Assignments

reinforcement-learning monte-carlo linear-programming thompson-sampling ucb bootstrapping multi-armed-bandits bellman-equation temporal-differencing-learning howards-pi sarsa-learning kl-ucb windy-gridworld intelligent-learning-agents

Updated Nov 8, 2019
Python

Waariety / Project-208499

Developing Line Chatbot for Personalized Learning with English Vocabulary

python line thompson-sampling chatbots sampling thompson

Updated Mar 27, 2022
Python

paramrathour / Intelligent-and-Learning-Agents

My programs during CS747 (Foundations of Intelligent and Learning Agents) Autumn 2021-22

linear-programming thompson-sampling epsilon-greedy mountain-car sarsa ucb markov-decision-processes multi-armed-bandit policy-iteration value-iteration tile-coding kl-ucb policy-control

Updated Apr 17, 2022
Python

elina-israyelyan / thompson-sampling

Package to implement the Thompson Sampling algorithm.

thompson-sampling multi-armed-bandits multi-armed-bandit

Updated May 14, 2022
Python

alexandrulita91 / multi-armed-bandit

The Multi-armed bandit problem is one of the classical reinforcements learning problems that describe the friction between the agent's exploration and exploitation.

reinforcement-learning thompson-sampling multi-armed-bandit

Updated Sep 22, 2020
Python

doguilmak / Random-Seleciton-Upper-Confidence-Bound-and-Thompson-Sampling-on-Advertising-Preference

The purpose of this study is to predict which ad will be the most preferred by the customers over the fictitious ads clicked by the users.

python reinforcement-learning thompson-sampling upper-confidence-bounds random-selection

Updated Sep 23, 2021
Python

volvo-cars / eene-nav-bandit-sim

EENE Navigation Bandit Simulator

python machine-learning machine-learning-algorithms thompson-sampling multi-armed-bandit bayes-ucb combinatorial-bandit navigation-algorithms

Updated Nov 16, 2023
Python

cormac-rynne / bandits

Variety of Multi-Arm Bandit (MAB) algorithms using classic and advanced strategies, including tools for experiments and simulations in stationary and nonstationary environments

reinforcement-learning thompson-sampling ucb multiarm-bandit exp3-algorithm multi-arm

Updated Feb 10, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the thompson-sampling topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the thompson-sampling topic, visit your repo's landing page and select "manage topics."