Skip to content

armmn/Smoothness-Adaptive-Contextual-Bandits

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Smoothness-Adaptive Contextual Bandits

This repository imlements the simultions for showcasing the cost of smoothness misspecification in the non-parametric contextual bandits setting. We compare the performance of three policies: the equation policy that adapts to smoothness, equation proposed by Perchet and Rigollet (2013) which is initiated by the correct smoothness parameter equation, and equation which is initiated by the misspecified smoothness parameter equation.

References

Yonatn Gur, Ahmadreza Momeni, and Stefan Wager. Smoothness-Adaptive-Contextual-Bandits. 2020. [arxiv]

About

This repository contains the codes for the simulation in the paper "Smoothnees-Adaptive Contextual Bandits"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages