Multi-Player Bandits Revisited

Besson, Lilian; Kaufmann, Emilie

Statistics > Machine Learning

arXiv:1711.02317 (stat)

[Submitted on 7 Nov 2017 (v1), last revised 13 Mar 2018 (this version, v3)]

Title:Multi-Player Bandits Revisited

Authors:Lilian Besson (IETR, SEQUEL), Emilie Kaufmann (CRIStAL, SEQUEL)

View PDF

Abstract:Multi-player Multi-Armed Bandits (MAB) have been extensively studied in the literature, motivated by applications to Cognitive Radio systems. Driven by such applications as well, we motivate the introduction of several levels of feedback for multi-player MAB algorithms. Most existing work assume that sensing information is available to the algorithm. Under this assumption, we improve the state-of-the-art lower bound for the regret of any decentralized algorithms and introduce two algorithms, RandTopM and MCTopM, that are shown to empirically outperform existing algorithms. Moreover, we provide strong theoretical guarantees for these algorithms, including a notion of asymptotic optimality in terms of the number of selections of bad arms. We then introduce a promising heuristic, called Selfish, that can operate without sensing information, which is crucial for emerging applications to Internet of Things networks. We investigate the empirical performance of this algorithm and provide some first theoretical elements for the understanding of its behavior.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1711.02317 [stat.ML]
	(or arXiv:1711.02317v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1711.02317
Journal reference:	Algorithmic Learning Theory, Apr 2018, Lanzarote, Spain. 2018, http://www.cs.cornell.edu/conferences/alt2018/

Submission history

From: Lilian Besson [view email] [via CCSD proxy]
[v1] Tue, 7 Nov 2017 07:10:47 UTC (639 KB)
[v2] Wed, 7 Mar 2018 14:41:15 UTC (624 KB)
[v3] Tue, 13 Mar 2018 07:30:42 UTC (624 KB)

Statistics > Machine Learning

Title:Multi-Player Bandits Revisited

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Multi-Player Bandits Revisited

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators