Composed Fine-Tuning: Freezing Pre-Trained Denoising Autoencoders for Improved Generalization

Xie, Sang Michael; Ma, Tengyu; Liang, Percy

Computer Science > Machine Learning

arXiv:2006.16205v4 (cs)

[Submitted on 29 Jun 2020 (v1), last revised 24 Oct 2023 (this version, v4)]

Title:Composed Fine-Tuning: Freezing Pre-Trained Denoising Autoencoders for Improved Generalization

Authors:Sang Michael Xie, Tengyu Ma, Percy Liang

View PDF

Abstract:We focus on prediction problems with structured outputs that are subject to output validity constraints, e.g. pseudocode-to-code translation where the code must compile. While labeled input-output pairs are expensive to obtain, "unlabeled" outputs, i.e. outputs without corresponding inputs, are freely available (e.g. code on GitHub) and provide information about output validity. We can capture the output structure by pre-training a denoiser to denoise corrupted versions of unlabeled outputs. We first show that standard fine-tuning after pre-training destroys some of this structure. We then propose composed fine-tuning, which fine-tunes a predictor composed with the pre-trained denoiser, which is frozen to preserve output structure. For two-layer ReLU networks, we prove that composed fine-tuning significantly reduces the complexity of the predictor, thus improving generalization. Empirically, we show that composed fine-tuning improves over standard fine-tuning on two pseudocode-to-code translation datasets (3% and 6% relative). The improvement from composed fine-tuning is magnified on out-of-distribution (OOD) examples (4% and 25% relative).

Comments:	ICML 2021 Long talk
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2006.16205 [cs.LG]
	(or arXiv:2006.16205v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2006.16205

Submission history

From: Sang Michael Xie [view email]
[v1] Mon, 29 Jun 2020 17:14:35 UTC (739 KB)
[v2] Tue, 10 Nov 2020 06:36:18 UTC (875 KB)
[v3] Fri, 11 Jun 2021 23:45:47 UTC (3,005 KB)
[v4] Tue, 24 Oct 2023 23:44:37 UTC (6,005 KB)

Computer Science > Machine Learning

Title:Composed Fine-Tuning: Freezing Pre-Trained Denoising Autoencoders for Improved Generalization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Composed Fine-Tuning: Freezing Pre-Trained Denoising Autoencoders for Improved Generalization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators