Implement RR-SW-UCB# and SW-DLP from paper arXiv:1812.05165 #169

Naereen · 2018-12-17T12:38:53Z

For non-stationary multi-player bandits, the following reference introduce the awesome LM-DSEE and SW-UCB# algorithms: ["On Distributed Multi-player Multi-Armed Bandit Problems in Abruptly Changing Environment", by Lai Wei, Vaibhav Srivastava, 2018, arXiv:1812.05165]. Cf. #183

I need to:

Write the RR-SW-UCB# algorithm!
Write the SW-DLP algorithm!
Implement it in a very generic way, as children of BaseWrapperPolicy, so any index policy can directly be used and not just UCB!
Test it on simple problems, and check what the authors claim in their paper,
Compare them with Selfish+SW-UCB# or MCTopM+SW-UCB# (or any efficient non stationary bandit algorithms)

The text was updated successfully, but these errors were encountered:

Naereen added new algo I have to implement a new algorithm! Yay! multi-player For multi-player bandits simulations non-stationary For non-stationary bandits simulations labels Dec 17, 2018

Naereen self-assigned this Dec 17, 2018

This was referenced Feb 28, 2019

Add a clean support for piecewise stationary multi-player MAB problems #183

Open

Implement the C&P algorithm from arXiv:1902.08036 #184

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement RR-SW-UCB# and SW-DLP from paper arXiv:1812.05165 #169

Implement RR-SW-UCB# and SW-DLP from paper arXiv:1812.05165 #169

Naereen commented Dec 17, 2018 •

edited

Implement RR-SW-UCB# and SW-DLP from paper arXiv:1812.05165 #169

Implement RR-SW-UCB# and SW-DLP from paper arXiv:1812.05165 #169

Comments

Naereen commented Dec 17, 2018 • edited

Naereen commented Dec 17, 2018 •

edited