Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement the DPE (Decentralized Parsimonious Exploration) algorithm from arXiv:1909.13079 #201

Open
4 tasks
Naereen opened this issue Oct 6, 2019 · 0 comments
Assignees
Labels
enhancement I have to improve something which already works not too badly multi-player For multi-player bandits simulations new algo I have to implement a new algorithm! Yay!

Comments

@Naereen
Copy link
Member

Naereen commented Oct 6, 2019

The recent paper [An Optimal Algorithmin Multiplayer Multi-Armed Bandits, by Alexandre Proutière, Po-An Wang, arXiv:1909.13079] proposes an efficient algorithm for the stochastic case of multi-player MAB with collision.

  • I should read it carefully,
  • And implement in SMPyBandits their algorithms,
  • To do my own comparison against RandTopM and MCTopM, and Selfish, and SIC-MMAB,
  • And check and verify their claims. (or disprove them?),

Note that their review of the current state of the art is not complete, they don't quote my article, and they forgot the subsequent works of Avner & Mannor, August 2018 #139, Lugosi & Mehrabian, August 2018 #141, Bourse & Perchet, September 2018 #145. They only quoted the most recent paper by Boursier & Kaufmann & Perchet & Merhrabian, June 2019.

@Naereen Naereen added enhancement I have to improve something which already works not too badly new algo I have to implement a new algorithm! Yay! multi-player For multi-player bandits simulations labels Oct 6, 2019
@Naereen Naereen self-assigned this Oct 6, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement I have to improve something which already works not too badly multi-player For multi-player bandits simulations new algo I have to implement a new algorithm! Yay!
Development

No branches or pull requests

1 participant