upper-confidence-bound

Here are 20 public repositories matching this topic...

FaydSpeare / UCTGames

A collection of games accompanied by a generalised Monte Carlo Tree Search Artificial Intelligence in combination with Upper Confidence Bounds.

python tictactoe monte-carlo-tree-search artificial-intelligence-algorithms tictactoe-ai upper-confidence-bound connect-3d

Updated Feb 11, 2019
Python

sharmaroshan / Ads-Optimization

Star

Optimizing the best Ads using Reinforcement learning Algorithms such as Thompson Sampling and Upper Confidence Bound.

data-science reinforcement-learning eda data-visualization thompson-sampling data-analysis beginner upper-confidence-bound

Updated May 24, 2019
Jupyter Notebook

singhgaurav2323 / reinforcement

Star

Reinforcement learning

reinforcement-learning thompson-sampling reinforcement-learning-algorithms upper-confidence-bounds reinforcement-learning-excercises upper-confidence-bound

Updated Jul 2, 2019
Python

Lazarus789 / Reinforcement-Models

Star

thompson-sampling upper-confidence-bound

Updated Aug 12, 2019

salimandre / Monte-Carlo-Tree-Search

Star

We implemented a Monte Carlo Tree Search (MCTS) from scratch and we successfully applied it to Tic-Tac-Toe game.

reinforcement-learning graphics mcts ucb monte-carlo-tree-search tic-tac-toe-game upper-confidence-bound

Updated Jul 9, 2020
Python

prabormukherjee / CTR_Testing

Star

Checking CTR(Click Thorugh Rate) of an ad using Thompson Sampling (Reinforcement Lrearning)

reinforcement-learning ml upper-confidence-bound ctr-testing

Updated Aug 12, 2020
Python

salimandre / Monte-Carlo-Tree-Search-for-checkers-game

Star

We compare different policies for the checkers game using reinforcement learning algorithms.

python reinforcement-learning turtle-graphics ucb monte-carlo-tree-search checkers-game upper-confidence-bound mcts-algorithm

Updated Aug 24, 2020
Python

antoine-hochart / bandit_algo_evaluation

Star

Offline evaluation of multi-armed bandit algorithms

thompson-sampling epsilon-greedy policy-evaluation multi-armed-bandit upper-confidence-bound

Updated Dec 1, 2020
Python

taylorjg / k-armed-bandit

Star

Web visualisation of the k-armed bandit problem

react web-worker epsilon-greedy multi-armed-bandit webworker k-armed-bandit upper-confidence-bound

Updated Feb 6, 2021
JavaScript

liuanji / WU-UCT

Star

A novel parallel UCT algorithm with linear speedup and negligible performance loss.

parallel-algorithm monte-carlo-tree-search upper-confidence-bound upper-confidence-trees

Updated Apr 26, 2021
Python

krishnaaxo / Reinforcement-UCB-ThompsonSampling

Star

machine-learning reinforcement-learning thompson-sampling reinforcement-learning-algorithms upper-confidence-bounds upper-confidence-bound

Updated Jun 12, 2021
Jupyter Notebook

Nikronic / Machine-Learning-Models

Star

In This repository I made some simple to complex methods in machine learning. Here I try to build template style code.

Updated Nov 7, 2021
Python

lionelsamrat10 / Machine-learning-a-to-z

Star

This repo contains code templates of all the machine learning algorithms that are used, like Regression, Classification, Clustering, etc.

python machine-learning natural-language-processing reinforcement-learning deep-learning random-forest clustering naive-bayes machine-learning-algorithms regression thompson-sampling neural-networks classification dimensionality-reduction logistic-regression convolutional-neural-networks predictive-analytics artificial-neural-network principal-component-analysis upper-confidence-bound

Updated Feb 17, 2022
Jupyter Notebook

aashish22bansal / Best-Ads-Predictor

Star

Predicting the best Ad from the given Ads.

reinforcement-learning thompson-sampling upper-confidence-bound

Updated May 26, 2022
Jupyter Notebook

simonZhou86 / Tr_LinUCB

Star

Code for the paper "Truncated LinUCB for Stochastic Linear Bandits"

linear-bandits contextual-bandits upper-confidence-bound

Updated Jun 2, 2023
Python

hritikb / Reinforcement-Learning-Algorithms

Star

reinforcement-learning q-learning grid-world epsilon-greedy sarsa dynamic-programming multi-armed-bandits policy-iteration value-iteration monte-carlo-methods temporal-differencing-learning upper-confidence-bound gradient-bandit optimistic-inital-values greedy-policy

Updated Jun 29, 2023
Jupyter Notebook

Jayavathsan / MachineLearning-SciKitLearn

Star

Using SciKit Learn few Deep Learning Rules and Algorithms are implemented

reinforcement-learning clustering svm naive-bayes model-selection thompson-sampling xgboost classification dimensionality-reduction apriori k-means association-rules decision-tree principal-component-analysis linear-discriminant-analysis k-nearest-neighbor eclat upper-confidence-bound

Updated Aug 9, 2023
Jupyter Notebook

loraalex / LoBook

Star

LoRa@FIIT algorithms comparison using jupyter notebooks

iot analysis lora ucb adr upper-confidence-bound lorafiit adaptive-data-rate

Updated Dec 10, 2023
Jupyter Notebook

Retr0-code / Pong-RL

Star

Reinforcement learning used in the game of pong

cmake reinforcement-learning cpp q-learning ucb pong-game cpp20 boost-test upper-confidence-bound the-game-of-pong

Updated May 20, 2024
C++

Bin-Cao / Bgolearn

Star

A Bayesian global optimization package for material design ｜ Adaptive Learning | Active Learning

material-design materials knowledge-gradient adaptive-learning materials-science materials-informatics active-learning expected-improvement upper-confidence-bound opportunity-cost bayesian-global-optimization predictive-entropy-search mlmd probability-of-improvement least-confidence margin-sampling entropy-based-approach augmented-expected-improvement trail-path bgolearn

Updated May 21, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the upper-confidence-bound topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the upper-confidence-bound topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

upper-confidence-bound

Here are 20 public repositories matching this topic...

FaydSpeare / UCTGames

sharmaroshan / Ads-Optimization

singhgaurav2323 / reinforcement

Lazarus789 / Reinforcement-Models

salimandre / Monte-Carlo-Tree-Search

prabormukherjee / CTR_Testing

salimandre / Monte-Carlo-Tree-Search-for-checkers-game

antoine-hochart / bandit_algo_evaluation

taylorjg / k-armed-bandit

liuanji / WU-UCT

krishnaaxo / Reinforcement-UCB-ThompsonSampling

Nikronic / Machine-Learning-Models

lionelsamrat10 / Machine-learning-a-to-z

aashish22bansal / Best-Ads-Predictor

simonZhou86 / Tr_LinUCB

hritikb / Reinforcement-Learning-Algorithms

Jayavathsan / MachineLearning-SciKitLearn

loraalex / LoBook

Retr0-code / Pong-RL

Bin-Cao / Bgolearn

Improve this page

Add this topic to your repo