Making Scalable Meta Learning Practical

Choe, Sang Keun; Mehta, Sanket Vaibhav; Ahn, Hwijeen; Neiswanger, Willie; Xie, Pengtao; Strubell, Emma; Xing, Eric

Computer Science > Machine Learning

arXiv:2310.05674 (cs)

[Submitted on 9 Oct 2023 (v1), last revised 23 Oct 2023 (this version, v2)]

Title:Making Scalable Meta Learning Practical

Authors:Sang Keun Choe, Sanket Vaibhav Mehta, Hwijeen Ahn, Willie Neiswanger, Pengtao Xie, Emma Strubell, Eric Xing

View PDF

Abstract:Despite its flexibility to learn diverse inductive biases in machine learning programs, meta learning (i.e., learning to learn) has long been recognized to suffer from poor scalability due to its tremendous compute/memory costs, training instability, and a lack of efficient distributed training support. In this work, we focus on making scalable meta learning practical by introducing SAMA, which combines advances in both implicit differentiation algorithms and systems. Specifically, SAMA is designed to flexibly support a broad range of adaptive optimizers in the base level of meta learning programs, while reducing computational burden by avoiding explicit computation of second-order gradient information, and exploiting efficient distributed training techniques implemented for first-order gradients. Evaluated on multiple large-scale meta learning benchmarks, SAMA showcases up to 1.7/4.8x increase in throughput and 2.0/3.8x decrease in memory consumption respectively on single-/multi-GPU setups compared to other baseline meta learning algorithms. Furthermore, we show that SAMA-based data optimization leads to consistent improvements in text classification accuracy with BERT and RoBERTa large language models, and achieves state-of-the-art results in both small- and large-scale data pruning on image classification tasks, demonstrating the practical applicability of scalable meta learning across language and vision domains.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2310.05674 [cs.LG]
	(or arXiv:2310.05674v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2310.05674

Submission history

From: Sang Keun Choe [view email]
[v1] Mon, 9 Oct 2023 12:45:13 UTC (193 KB)
[v2] Mon, 23 Oct 2023 14:16:49 UTC (193 KB)

Computer Science > Machine Learning

Title:Making Scalable Meta Learning Practical

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Making Scalable Meta Learning Practical

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators