bandit-learning

Decoder, aligner, and model optimizer for statistical machine translation and other structured prediction models based on (mostly) context-free formalisms

machine-translation bandit-learning cdec weak-feedback

Updated Mar 23, 2017
C++

vwang0 / causal_inference

Star

experiment simulation ab-testing bandit-learning bandit-algorithm

Updated Oct 28, 2020
Jupyter Notebook

SFV-CORE / Bandit_OverTheWire

Star

Aqui irei explicar como passar de cada nível do CTF Bandit fornecido pela Over The Wire

linux bandit-learning ctf-solutions ctf-challenges bandit

Updated Apr 1, 2021

ad0x99 / linux-4-fun

Star

My Linux Notes

linux bandit-learning

Updated Jun 3, 2021

anishacharya / Bandits-Online-Learning

Star

Simple Implementations of Bandit Algorithms in python

bandit-learning multi-armed-bandits online-learning bandits bandit online-learning-algorithms bandit-algorithms online-learning-python

Updated Dec 2, 2021
Jupyter Notebook

crenwick / Swiper

Star

🦊 A series of bandit algorithms in Swift

swift multi-arm-bandits bandit-learning multi-armed-bandits epsilon softmax bandit

Updated May 30, 2016
Swift

DenzilFrancisCrasta / bandit

Star

reinforcement-learning bandit-learning

Updated Mar 6, 2017
Python

fouratifares / RGL

Star

Randomized Greedy Learning Under Full-bandit Feedback

agent machine-learning reinforcement-learning machine-learning-algorithms reinforcement-learning-algorithms machinelearning bandit-learning submodular-optimization submodularity bandit-algorithms

Updated Jan 22, 2024
Python

znreza / RL_Best_Presentation

Star

This presentation contains very precise yet detailed explanation of concepts of a very interesting topic -- Reinforcement Learning.

reinforcement-learning exploration reinforcement-learning-algorithms sarsa exploitation bandit-learning active-learning td-learning alphago model-based-rl bandit-algorithm passive-learning model-free sarsa-learning rl-vs-supervised-learning rl-vs-unsupervised-learning

Updated Dec 25, 2017

victor-iyi / contextual-bandit

Star

A Reinforcement Learning approach to a contextual bandit problem.

reinforcement-learning reinforcement-learning-algorithms bandit-learning markov-decision-processes contextual-bandit

Updated Dec 2, 2017
Jupyter Notebook

0x65-e / Stats-115

Star

Homework Code for UCLA STATS 115 (Probabilistic Decision Making) Fall 22 Offering

reinforcement-learning python3 expectation-maximization reinforcement-learning-algorithms bandit-learning markov-decision-processes value-iteration decision-making-under-uncertainty bandit-algorithms decision-making-algorithms

Updated Nov 25, 2022
Python

AntoineG92 / Online-Clustering-of-Bandits-ENSAE

Star

Based on Gentile-Li-Zapella article "Online Clustering of Bandits"

graph-algorithms clustering bandit-learning online-learning

Updated Jun 16, 2018
Jupyter Notebook

florian / reinforcement-learning

Star

Implementing RL algorithms

machine-learning reinforcement-learning bandit-learning

Updated Mar 26, 2017
Jupyter Notebook

Improve this page

Add a description, image, and links to the bandit-learning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the bandit-learning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bandit-learning

Here are 33 public repositories matching this topic...

victor-iyi / policy-gradient

shashankp914 / Over-the-wire-wargames-Solutions

zeroinfiniti / bandit-wargames

hartikainen / information-theoretic-bandit

jonad / smartcab

vitorhugo13 / feup-mssi

jpthanga / 10-Arm-Bandit

juliakreutzer / bandit-cdec

vwang0 / causal_inference

SFV-CORE / Bandit_OverTheWire

ad0x99 / linux-4-fun

anishacharya / Bandits-Online-Learning

crenwick / Swiper

DenzilFrancisCrasta / bandit

fouratifares / RGL

znreza / RL_Best_Presentation

victor-iyi / contextual-bandit

0x65-e / Stats-115

AntoineG92 / Online-Clustering-of-Bandits-ENSAE

florian / reinforcement-learning

Improve this page

Add this topic to your repo