Copeland Dueling Bandits

Zoghi, Masrour; Karnin, Zohar; Whiteson, Shimon; de Rijke, Maarten

Computer Science > Machine Learning

arXiv:1506.00312 (cs)

[Submitted on 1 Jun 2015]

Title:Copeland Dueling Bandits

Authors:Masrour Zoghi, Zohar Karnin, Shimon Whiteson, Maarten de Rijke

View PDF

Abstract:A version of the dueling bandit problem is addressed in which a Condorcet winner may not exist. Two algorithms are proposed that instead seek to minimize regret with respect to the Copeland winner, which, unlike the Condorcet winner, is guaranteed to exist. The first, Copeland Confidence Bound (CCB), is designed for small numbers of arms, while the second, Scalable Copeland Bandits (SCB), works better for large-scale problems. We provide theoretical results bounding the regret accumulated by CCB and SCB, both substantially improving existing results. Such existing results either offer bounds of the form $O(K \log T)$ but require restrictive assumptions, or offer bounds of the form $O(K^2 \log T)$ without requiring such assumptions. Our results offer the best of both worlds: $O(K \log T)$ bounds without restrictive assumptions.

Comments:	33 pages, 8 figures
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1506.00312 [cs.LG]
	(or arXiv:1506.00312v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1506.00312

Submission history

From: Masrour Zoghi [view email]
[v1] Mon, 1 Jun 2015 00:44:37 UTC (2,140 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2015-06

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Masrour Zoghi
Zohar Shay Karnin
Shimon Whiteson
Maarten de Rijke

export BibTeX citation

Computer Science > Machine Learning

Title:Copeland Dueling Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Copeland Dueling Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators