Representative Arm Identification: A fixed confidence approach to identify cluster representatives

Gharat, Sarvesh; Yadav, Aniket; Karamchandani, Nikhil; Nair, Jayakrishnan

Computer Science > Machine Learning

arXiv:2408.14195 (cs)

[Submitted on 26 Aug 2024]

Title:Representative Arm Identification: A fixed confidence approach to identify cluster representatives

Authors:Sarvesh Gharat, Aniket Yadav, Nikhil Karamchandani, Jayakrishnan Nair

View PDF HTML (experimental)

Abstract:We study the representative arm identification (RAI) problem in the multi-armed bandits (MAB) framework, wherein we have a collection of arms, each associated with an unknown reward distribution. An underlying instance is defined by a partitioning of the arms into clusters of predefined sizes, such that for any $j > i$, all arms in cluster $i$ have a larger mean reward than those in cluster $j$. The goal in RAI is to reliably identify a certain prespecified number of arms from each cluster, while using as few arm pulls as possible. The RAI problem covers as special cases several well-studied MAB problems such as identifying the best arm or any $M$ out of the top $K$, as well as both full and coarse ranking. We start by providing an instance-dependent lower bound on the sample complexity of any feasible algorithm for this setting. We then propose two algorithms, based on the idea of confidence intervals, and provide high probability upper bounds on their sample complexity, which orderwise match the lower bound. Finally, we do an empirical comparison of both algorithms along with an LUCB-type alternative on both synthetic and real-world datasets, and demonstrate the superior performance of our proposed schemes in most cases.

Comments:	We analyse a clustered multi-armed bandit formulation, where the learning objective is to identify representative arms from each cluster, in a fixed confidence setting
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Probability (math.PR); Machine Learning (stat.ML)
Cite as:	arXiv:2408.14195 [cs.LG]
	(or arXiv:2408.14195v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2408.14195

Submission history

From: Sarvesh Gharat [view email]
[v1] Mon, 26 Aug 2024 11:47:52 UTC (547 KB)

Computer Science > Machine Learning

Title:Representative Arm Identification: A fixed confidence approach to identify cluster representatives

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Representative Arm Identification: A fixed confidence approach to identify cluster representatives

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators