Max-Min Grouped Bandits

Wang, Zhenlin; Scarlett, Jonathan

Statistics > Machine Learning

arXiv:2111.08862 (stat)

[Submitted on 17 Nov 2021 (v1), last revised 15 Mar 2022 (this version, v2)]

Title:Max-Min Grouped Bandits

Authors:Zhenlin Wang, Jonathan Scarlett

View PDF

Abstract:In this paper, we introduce a multi-armed bandit problem termed max-min grouped bandits, in which the arms are arranged in possibly-overlapping groups, and the goal is to find the group whose worst arm has the highest mean reward. This problem is of interest in applications such as recommendation systems and resource allocation, and is also closely related to widely-studied robust optimization problems. We present two algorithms based successive elimination and robust optimization, and derive upper bounds on the number of samples to guarantee finding a max-min optimal or near-optimal group, as well as an algorithm-independent lower bound. We discuss the degree of tightness of our bounds in various cases of interest, and the difficulties in deriving uniformly tight bounds.

Comments:	AAAI 2022 paper + technical appendix (supplementary material), single-column format
Subjects:	Machine Learning (stat.ML); Information Theory (cs.IT); Machine Learning (cs.LG)
Cite as:	arXiv:2111.08862 [stat.ML]
	(or arXiv:2111.08862v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2111.08862

Submission history

From: Jonathan Scarlett [view email]
[v1] Wed, 17 Nov 2021 01:59:15 UTC (104 KB)
[v2] Tue, 15 Mar 2022 00:58:51 UTC (108 KB)

Full-text links:

Access Paper:

view license

Current browse context:

stat.ML

< prev | next >

new | recent | 2021-11

Change to browse by:

cs
cs.IT
cs.LG
math
math.IT
stat

References & Citations

export BibTeX citation

Statistics > Machine Learning

Title:Max-Min Grouped Bandits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Max-Min Grouped Bandits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators