Multi-Fidelity Active Learning with GFlowNets

Hernandez-Garcia, Alex; Saxena, Nikita; Jain, Moksh; Liu, Cheng-Hao; Bengio, Yoshua

Computer Science > Machine Learning

arXiv:2306.11715 (cs)

[Submitted on 20 Jun 2023 (v1), last revised 1 Sep 2024 (this version, v2)]

Title:Multi-Fidelity Active Learning with GFlowNets

Authors:Alex Hernandez-Garcia, Nikita Saxena, Moksh Jain, Cheng-Hao Liu, Yoshua Bengio

View PDF HTML (experimental)

Abstract:In the last decades, the capacity to generate large amounts of data in science and engineering applications has been growing steadily. Meanwhile, machine learning has progressed to become a suitable tool to process and utilise the available data. Nonetheless, many relevant scientific and engineering problems present challenges where current machine learning methods cannot yet efficiently leverage the available data and resources. For example, in scientific discovery, we are often faced with the problem of exploring very large, structured and high-dimensional spaces. Moreover, the high fidelity, black-box objective function is often very expensive to evaluate. Progress in machine learning methods that can efficiently tackle such challenges would help accelerate currently crucial areas such as drug and materials discovery. In this paper, we propose a multi-fidelity active learning algorithm with GFlowNets as a sampler, to efficiently discover diverse, high-scoring candidates where multiple approximations of the black-box function are available at lower fidelity and cost. Our evaluation on molecular discovery tasks shows that multi-fidelity active learning with GFlowNets can discover high-scoring candidates at a fraction of the budget of its single-fidelity counterpart while maintaining diversity, unlike RL-based alternatives. These results open new avenues for multi-fidelity active learning to accelerate scientific discovery and engineering design.

Comments:	Published in Transactions on Machine Learning Research (TMLR) 07/2024 this https URL
Subjects:	Machine Learning (cs.LG); Biomolecules (q-bio.BM)
Cite as:	arXiv:2306.11715 [cs.LG]
	(or arXiv:2306.11715v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.11715
Journal reference:	Transactions on Machine Learning Research (TMLR) 07/2024 https://openreview.net/forum?id=dLaazW9zuF

Submission history

From: Alex Hernandez-Garcia [view email]
[v1] Tue, 20 Jun 2023 17:43:42 UTC (233 KB)
[v2] Sun, 1 Sep 2024 11:15:16 UTC (419 KB)

Computer Science > Machine Learning

Title:Multi-Fidelity Active Learning with GFlowNets

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Multi-Fidelity Active Learning with GFlowNets

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators