🦊 A series of bandit algorithms in Swift
-
Updated
May 30, 2016 - Swift
🦊 A series of bandit algorithms in Swift
Decoder, aligner, and model optimizer for statistical machine translation and other structured prediction models based on (mostly) context-free formalisms
Implementing RL algorithms
Train a SmartCab how to drive using reinforcement learning.
A policy gradient approach to a multi-armed bandit problem
A Reinforcement Learning approach to a contextual bandit problem.
This presentation contains very precise yet detailed explanation of concepts of a very interesting topic -- Reinforcement Learning.
Implementation of 10 Arm Bandit using RLGlue
Based on Gentile-Li-Zapella article "Online Clustering of Bandits"
Bandit learning on top of Neural Monkey, an open-source tool for sequence learning in NLP built on TensorFlow. Bandit online learning objectives in branch bandits-acl (ACL17) and counterfactual learning objectives in branch acl-2018 (ACL18).
Bayesian bandits in Python3.
A checkers reinforcement learning AI, and all the tools needed to train it.
Contextual Bandits in R - simulation and evaluation of Multi-Armed Bandit Policies
Some visualizations of bandit algorithm outputs.
Tutorial on the Convolutional Tsetlin Machine
Aqui irei explicar como passar de cada nível do CTF Bandit fornecido pela Over The Wire
Add a description, image, and links to the bandit-learning topic page so that developers can more easily learn about it.
To associate your repository with the bandit-learning topic, visit your repo's landing page and select "manage topics."