Implement RR-SW-UCB# and SW-DLP from paper arXiv:1812.05165 #169
Labels
multi-player
For multi-player bandits simulations
new algo
I have to implement a new algorithm! Yay!
non-stationary
For non-stationary bandits simulations
For non-stationary multi-player bandits, the following reference introduce the awesome LM-DSEE and SW-UCB# algorithms: ["On Distributed Multi-player Multi-Armed Bandit Problems in Abruptly Changing Environment", by Lai Wei, Vaibhav Srivastava, 2018, arXiv:1812.05165]. Cf. #183
I need to:
BaseWrapperPolicy
, so any index policy can directly be used and not just UCB!The text was updated successfully, but these errors were encountered: