Hierarchical Bayesian Bandits

Hong, Joey; Kveton, Branislav; Zaheer, Manzil; Ghavamzadeh, Mohammad

Computer Science > Machine Learning

arXiv:2111.06929 (cs)

[Submitted on 12 Nov 2021 (v1), last revised 5 Mar 2022 (this version, v2)]

Title:Hierarchical Bayesian Bandits

Authors:Joey Hong, Branislav Kveton, Manzil Zaheer, Mohammad Ghavamzadeh

View PDF

Abstract:Meta-, multi-task, and federated learning can be all viewed as solving similar tasks, drawn from a distribution that reflects task similarities. We provide a unified view of all these problems, as learning to act in a hierarchical Bayesian bandit. We propose and analyze a natural hierarchical Thompson sampling algorithm (HierTS) for this class of problems. Our regret bounds hold for many variants of the problems, including when the tasks are solved sequentially or in parallel; and show that the regret decreases with a more informative prior. Our proofs rely on a novel total variance decomposition that can be applied beyond our models. Our theory is complemented by experiments, which show that the hierarchy helps with knowledge sharing among the tasks. This confirms that hierarchical Bayesian bandits are a universal and statistically-efficient tool for learning to act with similar bandit tasks.

Comments:	Proceedings of the 25th International Conference on Artificial Intelligence and Statistics
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2111.06929 [cs.LG]
	(or arXiv:2111.06929v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2111.06929

Submission history

From: Branislav Kveton [view email]
[v1] Fri, 12 Nov 2021 20:33:09 UTC (2,774 KB)
[v2] Sat, 5 Mar 2022 06:27:44 UTC (2,741 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2021-11

Change to browse by:

cs
cs.AI

References & Citations

DBLP - CS Bibliography

listing | bibtex

Joey Hong
Branislav Kveton
Manzil Zaheer
Mohammad Ghavamzadeh

export BibTeX citation

Computer Science > Machine Learning

Title:Hierarchical Bayesian Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Hierarchical Bayesian Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators