This project encompasses my master's thesis titled: Audio-Visual Attention Modeling via Reinforcement Learning.
-
Updated
Apr 2, 2024 - Python
This project encompasses my master's thesis titled: Audio-Visual Attention Modeling via Reinforcement Learning.
Code for the paper "Truncated LinUCB for Stochastic Linear Bandits"
Recommendation using LinUCB algorithm
Easily Score & Rank JSON-Encodable Objects with ML
A simple pure-python framework for dealing with the contextual multi-armed bandit problems
Contextual Bandit Engine
Predicting the outcome of League of Legends E-Sports matches using reinforcement learning contextual bandits.
Multi-objective Stochastic Linear Bandits
Challenge of solving a personalization task with RL methods.
WIP: A library and AWS sdk for non-contextual and contextual Multi-Armed-Bandit (MAB) algorithms for multiple use cases
Repo for course CSC2558: "Intelligent Adaptive Interventions" project in nonstationary contextual bandits.
Source code for the numerical experiments presented in the paper "On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks".
Proof of concept for a recommender system for Yelp, using bandit algorithms.
Awesome list about anything bandit problems
Code for our AJCAI 2020 paper: "Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward".
This project aims to implement those algorithms from different papers related to online learning
Official Implementation of On Optimal Private Online Stochastic Optimization and High Dimensional Decision Making
Bandits codes contributed by Louie Hoang at MSR.
Experiment results using MAB algorithms in Yahoo! Front Page Today Module User Click Log dataset
Add a description, image, and links to the contextual-bandits topic page so that developers can more easily learn about it.
To associate your repository with the contextual-bandits topic, visit your repo's landing page and select "manage topics."