Exposure Bias versus Self-Recovery: Are Distortions Really Incremental for Autoregressive Text Generation?

He, Tianxing; Zhang, Jingzhao; Zhou, Zhiming; Glass, James

Computer Science > Machine Learning

arXiv:1905.10617 (cs)

[Submitted on 25 May 2019 (v1), last revised 3 Sep 2021 (this version, v10)]

Title:Exposure Bias versus Self-Recovery: Are Distortions Really Incremental for Autoregressive Text Generation?

Authors:Tianxing He, Jingzhao Zhang, Zhiming Zhou, James Glass

View PDF

Abstract:Exposure bias has been regarded as a central problem for auto-regressive language models (LM). It claims that teacher forcing would cause the test-time generation to be incrementally distorted due to the training-generation discrepancy. Although a lot of algorithms have been proposed to avoid teacher forcing and therefore alleviate exposure bias, there is little work showing how serious the exposure bias problem actually is. In this work, we focus on the task of open-ended language generation, propose metrics to quantify the impact of exposure bias in the aspects of quality, diversity, and consistency. Our key intuition is that if we feed ground-truth data prefixes (instead of prefixes generated by the model itself) into the model and ask it to continue the generation, the performance should become much better because the training-generation discrepancy in the prefix is removed. Both automatic and human evaluations are conducted in our experiments. On the contrary to the popular belief in exposure bias, we find that the the distortion induced by the prefix discrepancy is limited, and does not seem to be incremental during the generation. Moreover, our analysis reveals an interesting self-recovery ability of the LM, which we hypothesize to be countering the harmful effects from exposure bias.

Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
Cite as:	arXiv:1905.10617 [cs.LG]
	(or arXiv:1905.10617v10 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1905.10617
Journal reference:	EMNLP 2021

Submission history

From: Tianxing He [view email]
[v1] Sat, 25 May 2019 15:34:43 UTC (539 KB)
[v2] Mon, 19 Aug 2019 04:36:54 UTC (641 KB)
[v3] Mon, 2 Dec 2019 16:00:27 UTC (1,114 KB)
[v4] Sat, 8 Feb 2020 19:18:44 UTC (1,618 KB)
[v5] Fri, 17 Apr 2020 15:59:05 UTC (1,582 KB)
[v6] Thu, 24 Dec 2020 01:11:21 UTC (1 KB) (withdrawn)
[v7] Thu, 31 Dec 2020 03:12:32 UTC (1,802 KB)
[v8] Wed, 31 Mar 2021 00:38:34 UTC (1,997 KB)
[v9] Sun, 29 Aug 2021 19:33:12 UTC (2,011 KB)
[v10] Fri, 3 Sep 2021 01:21:17 UTC (2,014 KB)

Computer Science > Machine Learning

Title:Exposure Bias versus Self-Recovery: Are Distortions Really Incremental for Autoregressive Text Generation?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Exposure Bias versus Self-Recovery: Are Distortions Really Incremental for Autoregressive Text Generation?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators