Inherent Biases in Reference based Evaluation for Grammatical Error Correction and Text Simplification

Choshen, Leshem; Abend, Omri

Computer Science > Computation and Language

arXiv:1804.11254 (cs)

[Submitted on 30 Apr 2018 (v1), last revised 18 Sep 2019 (this version, v3)]

Title:Inherent Biases in Reference based Evaluation for Grammatical Error Correction and Text Simplification

Authors:Leshem Choshen, Omri Abend

View PDF

Abstract:The prevalent use of too few references for evaluating text-to-text generation is known to bias estimates of their quality ({\it low coverage bias} or LCB). This paper shows that overcoming LCB in Grammatical Error Correction (GEC) evaluation cannot be attained by re-scaling or by increasing the number of references in any feasible range, contrary to previous suggestions. This is due to the long-tailed distribution of valid corrections for a sentence. Concretely, we show that LCB incentivizes GEC systems to avoid correcting even when they can generate a valid correction. Consequently, existing systems obtain comparable or superior performance compared to humans, by making few but targeted changes to the input. Similar effects on Text Simplification further support our claims.

Comments:	Accepted to ACL 2018 (figures currently omitted due to technical arxiv issues
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1804.11254 [cs.CL]
	(or arXiv:1804.11254v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1804.11254

Submission history

From: Leshem Choshen [view email]
[v1] Mon, 30 Apr 2018 14:59:56 UTC (377 KB)
[v2] Tue, 1 May 2018 17:45:17 UTC (821 KB)
[v3] Wed, 18 Sep 2019 08:30:56 UTC (821 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2018-04

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Leshem Choshen
Omri Abend

export BibTeX citation

Computer Science > Computation and Language

Title:Inherent Biases in Reference based Evaluation for Grammatical Error Correction and Text Simplification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Inherent Biases in Reference based Evaluation for Grammatical Error Correction and Text Simplification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators