ucb1

Star

Here are 14 public repositories matching this topic...

Twice22 / Reinforcement-Learning

Star

My reports for the reinforcement learning class given at the ENS

reinforcement-learning policy-gradient reinforce policy-iteration value-iteration ucb1

Updated Jan 16, 2018
Jupyter Notebook

akshaykhadse / reinforcement-learning

Star

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

reinforcement-learning linear-programming thompson-sampling epsilon-greedy ucb policy-evaluation mdps multi-armed-bandits policy-iteration randomised-algorithms reinforcement-learning-excercises kl-divergence markovian-epidemic-processes reinforcement-learning-analysis multiarm-bandit ucb1 howards-pi batch-switching randomized-policy-iteration

Updated May 21, 2018
Python

mykeels / multi-armed-bandit-problem

Star

An implementation of solvers for the multi-armed-bandit-problem in JavaScript.

thompson-sampling epsilon-greedy multi-armed-bandit ucb1

Updated Apr 25, 2019
JavaScript

alextanhongpin / go-bandit

Sponsor

Star

Multi-Armed Bandit (MAB) algorithm implementation in go

go ucb1 mulit-arm-bandit greedy-epsilon

Updated Nov 25, 2019
Go

sanxore / py-mcts

Star

Python implementation of Monte Carlo Tree Search

mcts uct monte-carlo-tree-search ucb1

Updated Jan 4, 2020
Python

EmanuelAlogna / Data-Intelligence-Applications

Star

Pricing and Social Influence Maximization using Reinforcement Learning algorithms in Data Intelligence Applications projects from Politechnic of Milan

reinforcement-learning social-network pricing thompson-sampling reinforcement-learning-algorithms multi-armed-bandit ucb1 social-influence

Updated Feb 12, 2020
Python

viswanath57 / Bandit-Algorithms

Star

algorithms epsilon-greedy multiarm-bandit softmax-algorithm ucb1

Updated Apr 5, 2021
Jupyter Notebook

HoangTran0410 / Reversi-mcts

Star

Reversi (Othello) AI game in C#. Using Monte Carlo Tree Search algorithm AND BTMM algorithm.

board-game machine-learning csharp bitboard mcts monte-carlo-tree-search othello-game reversi-game ucb1 othello-ai mcts-algorithm

Updated May 31, 2021
C#

Nikita-Kudrin / funcorp-bandit

Star

REST service, that returns content sorted by UCB1 algorithm.(Multi-Armed Bandit algorithm). Spring Boot, Kotlin

kotlin spring-boot ucb1

Updated Jan 29, 2022
Kotlin

kochlisGit / Reinforcement-Learning-Algorithms

Star

This project focuses on comparing different Reinforcement Learning Algorithms, including monte-carlo, q-learning, lambda q-learning epsilon-greedy variations, etc.

python reinforcement-learning monte-carlo openai-gym q-learning policy rl-agents epsilon-greedy dynamic-programming markov-chains approximation-algorithms ucb1 q-lambda exploration-exploitation thomson-sampling frozen-lake multi-bandit-army

Updated Feb 15, 2022
Python

zzmtsvv / ml_sandbox

Star

regression calibration gan style-transfer classification mlp self-organizing-map knearest-neighbor-algorithm gradient-boosting variational-autoencoder cyclegan ucb1 spectral-normalization vq-vae cnn-visualization self-normalizing-neural-networks diffusion-models

Updated Sep 3, 2022
Jupyter Notebook

VladMarianCimpeanu / OLA_project

Star

Reinforcement learning techniques applied to solve pricing problems in e-commerce applications. Final project for "Online learning applications" course (2021-2022)

reinforcement-learning pricing thompson-sampling multi-armed-bandit montecarlo-simulation mab ucb1 online-learning-applications