Uncovering mesa-optimization algorithms in Transformers

von Oswald, Johannes; Schlegel, Maximilian; Meulemans, Alexander; Kobayashi, Seijin; Niklasson, Eyvind; Zucchet, Nicolas; Scherrer, Nino; Miller, Nolan; Sandler, Mark; Arcas, Blaise Agüera y; Vladymyrov, Max; Pascanu, Razvan; Sacramento, João

Computer Science > Machine Learning

arXiv:2309.05858 (cs)

[Submitted on 11 Sep 2023 (v1), last revised 15 Oct 2024 (this version, v2)]

Title:Uncovering mesa-optimization algorithms in Transformers

Authors:Johannes von Oswald, Maximilian Schlegel, Alexander Meulemans, Seijin Kobayashi, Eyvind Niklasson, Nicolas Zucchet, Nino Scherrer, Nolan Miller, Mark Sandler, Blaise Agüera y Arcas, Max Vladymyrov, Razvan Pascanu, João Sacramento

View PDF HTML (experimental)

Abstract:Some autoregressive models exhibit in-context learning capabilities: being able to learn as an input sequence is processed, without undergoing any parameter changes, and without being explicitly trained to do so. The origins of this phenomenon are still poorly understood. Here we analyze a series of Transformer models trained to perform synthetic sequence prediction tasks, and discover that standard next-token prediction error minimization gives rise to a subsidiary learning algorithm that adjusts the model as new inputs are revealed. We show that this process corresponds to gradient-based optimization of a principled objective function, which leads to strong generalization performance on unseen sequences. Our findings explain in-context learning as a product of autoregressive loss minimization and inform the design of new optimization-based Transformer layers.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2309.05858 [cs.LG]
	(or arXiv:2309.05858v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2309.05858

Submission history

From: Johannes Von Oswald [view email]
[v1] Mon, 11 Sep 2023 22:42:50 UTC (4,051 KB)
[v2] Tue, 15 Oct 2024 13:43:50 UTC (1,567 KB)

Computer Science > Machine Learning

Title:Uncovering mesa-optimization algorithms in Transformers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Uncovering mesa-optimization algorithms in Transformers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators