YAHPO Gym -- Design Criteria and a new Multifidelity Benchmark for Hyperparameter Optimization

Pfisterer, Florian; Schneider, Lennart; Moosbauer, Julia; Binder, Martin; Bischl, Bernd

Computer Science > Machine Learning

arXiv:2109.03670v2 (cs)

[Submitted on 8 Sep 2021 (v1), revised 4 Oct 2021 (this version, v2), latest version 30 Jul 2022 (v4)]

Title:YAHPO Gym -- Design Criteria and a new Multifidelity Benchmark for Hyperparameter Optimization

Authors:Florian Pfisterer, Lennart Schneider, Julia Moosbauer, Martin Binder, Bernd Bischl

View PDF

Abstract:When developing and analyzing new hyperparameter optimization (HPO) methods, it is vital to empirically evaluate and compare them on well-curated benchmark suites. In this work, we list desirable properties and requirements for such benchmarks and propose a new set of challenging and relevant multifidelity HPO benchmark problems motivated by these requirements. For this, we revisit the concept of surrogate-based benchmarks and empirically compare them to more widely-used tabular benchmarks, showing that the latter ones may induce bias in performance estimation and ranking of HPO methods. We present a new surrogate-based benchmark suite for multifidelity HPO methods consisting of 9 benchmark collections that constitute over 700 multifidelity HPO problems in total. All our benchmarks also allow for querying of multiple optimization targets, enabling the benchmarking of multi-objective HPO. We examine and compare our benchmark suite with respect to the defined requirements and show that our benchmarks provide viable additions to existing suites.

Comments:	Preprint. Under review. 17 pages, 4 tables, 5 figures
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2109.03670 [cs.LG]
	(or arXiv:2109.03670v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.03670

Submission history

From: Lennart Schneider [view email]
[v1] Wed, 8 Sep 2021 14:16:31 UTC (1,171 KB)
[v2] Mon, 4 Oct 2021 09:41:20 UTC (545 KB)
[v3] Tue, 5 Apr 2022 14:49:43 UTC (5,432 KB)
[v4] Sat, 30 Jul 2022 12:33:47 UTC (9,019 KB)

Computer Science > Machine Learning

Title:YAHPO Gym -- Design Criteria and a new Multifidelity Benchmark for Hyperparameter Optimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:YAHPO Gym -- Design Criteria and a new Multifidelity Benchmark for Hyperparameter Optimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators