bandit-learning

Star

Here are 33 public repositories matching this topic...

crenwick / Swiper

Star

🦊 A series of bandit algorithms in Swift

swift multi-arm-bandits bandit-learning multi-armed-bandits epsilon softmax bandit

Updated May 30, 2016
Swift

DenzilFrancisCrasta / bandit

Star

reinforcement-learning bandit-learning

Updated Mar 6, 2017
Python

juliakreutzer / bandit-cdec

Star

Decoder, aligner, and model optimizer for statistical machine translation and other structured prediction models based on (mostly) context-free formalisms

machine-translation bandit-learning cdec weak-feedback

Updated Mar 23, 2017
C++

florian / reinforcement-learning

Star

Implementing RL algorithms

machine-learning reinforcement-learning bandit-learning

Updated Mar 26, 2017
Jupyter Notebook

hartikainen / information-theoretic-bandit

Star

reinforcement-learning information-theory multi-arm-bandits bandit-learning perception-action-cycle k-armed-bandit information-to-go value-to-go

Updated Aug 1, 2017
Python

jonad / smartcab

Star

Train a SmartCab how to drive using reinforcement learning.

python reinforcement-learning pygame bandit-learning markov-decision-processes

Updated Nov 28, 2017
Jupyter Notebook

victor-iyi / policy-gradient

Star

A policy gradient approach to a multi-armed bandit problem

reinforcement-learning tensorflow policy-gradient bandit-learning multi-armed-bandits

Updated Nov 29, 2017
Jupyter Notebook

victor-iyi / contextual-bandit

Star

A Reinforcement Learning approach to a contextual bandit problem.

reinforcement-learning reinforcement-learning-algorithms bandit-learning markov-decision-processes contextual-bandit

Updated Dec 2, 2017
Jupyter Notebook

znreza / RL_Best_Presentation

Star

This presentation contains very precise yet detailed explanation of concepts of a very interesting topic -- Reinforcement Learning.

reinforcement-learning exploration reinforcement-learning-algorithms sarsa exploitation bandit-learning active-learning td-learning alphago model-based-rl bandit-algorithm passive-learning model-free sarsa-learning rl-vs-supervised-learning rl-vs-unsupervised-learning

Updated Dec 25, 2017

jpthanga / 10-Arm-Bandit

Star

Implementation of 10 Arm Bandit using RLGlue

reinforcement-learning cpp bandit-learning

Updated Jan 15, 2018
C

AntoineG92 / Online-Clustering-of-Bandits-ENSAE

Star

Based on Gentile-Li-Zapella article "Online Clustering of Bandits"

graph-algorithms clustering bandit-learning online-learning

Updated Jun 16, 2018
Jupyter Notebook

juliakreutzer / bandit-neuralmonkey

Star

Bandit learning on top of Neural Monkey, an open-source tool for sequence learning in NLP built on TensorFlow. Bandit online learning objectives in branch bandits-acl (ACL17) and counterfactual learning objectives in branch acl-2018 (ACL18).

machine-translation nmt bandit-learning weak-feedback neural-mt reinforce