Implement the DPE (Decentralized Parsimonious Exploration) algorithm from arXiv:1909.13079 #201

Naereen · 2019-10-06T16:43:31Z

The recent paper [An Optimal Algorithmin Multiplayer Multi-Armed Bandits, by Alexandre Proutière, Po-An Wang, arXiv:1909.13079] proposes an efficient algorithm for the stochastic case of multi-player MAB with collision.

I should read it carefully,
And implement in SMPyBandits their algorithms,
To do my own comparison against RandTopM and MCTopM, and Selfish, and SIC-MMAB,
And check and verify their claims. (or disprove them?),

Note that their review of the current state of the art is not complete, they don't quote my article, and they forgot the subsequent works of Avner & Mannor, August 2018 #139, Lugosi & Mehrabian, August 2018 #141, Bourse & Perchet, September 2018 #145. They only quoted the most recent paper by Boursier & Kaufmann & Perchet & Merhrabian, June 2019.

Naereen added enhancement I have to improve something which already works not too badly new algo I have to implement a new algorithm! Yay! multi-player For multi-player bandits simulations labels Oct 6, 2019

Naereen self-assigned this Oct 6, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement the DPE (Decentralized Parsimonious Exploration) algorithm from arXiv:1909.13079 #201

Implement the DPE (Decentralized Parsimonious Exploration) algorithm from arXiv:1909.13079 #201

Naereen commented Oct 6, 2019

Implement the DPE (Decentralized Parsimonious Exploration) algorithm from arXiv:1909.13079 #201

Implement the DPE (Decentralized Parsimonious Exploration) algorithm from arXiv:1909.13079 #201

Comments

Naereen commented Oct 6, 2019