Enhancing Supervised Learning with Contrastive Markings in Neural Machine Translation Training

Berger, Nathaniel; Exel, Miriam; Huck, Matthias; Riezler, Stefan

Computer Science > Computation and Language

arXiv:2307.08416 (cs)

[Submitted on 17 Jul 2023]

Title:Enhancing Supervised Learning with Contrastive Markings in Neural Machine Translation Training

Authors:Nathaniel Berger, Miriam Exel, Matthias Huck, Stefan Riezler

View PDF

Abstract:Supervised learning in Neural Machine Translation (NMT) typically follows a teacher forcing paradigm where reference tokens constitute the conditioning context in the model's prediction, instead of its own previous predictions. In order to alleviate this lack of exploration in the space of translations, we present a simple extension of standard maximum likelihood estimation by a contrastive marking objective. The additional training signals are extracted automatically from reference translations by comparing the system hypothesis against the reference, and used for up/down-weighting correct/incorrect tokens. The proposed new training procedure requires one additional translation pass over the training set per epoch, and does not alter the standard inference setup. We show that training with contrastive markings yields improvements on top of supervised learning, and is especially useful when learning from postedits where contrastive markings indicate human error corrections to the original hypotheses. Code is publicly released.

Comments:	Proceedings of the 24th Annual Conference of the European Association for Machine Translation, p. 69-78 Tampere, Finland, June 2023
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2307.08416 [cs.CL]
	(or arXiv:2307.08416v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2307.08416

Submission history

From: Nathaniel Berger [view email]
[v1] Mon, 17 Jul 2023 11:56:32 UTC (85 KB)

Computer Science > Computation and Language

Title:Enhancing Supervised Learning with Contrastive Markings in Neural Machine Translation Training

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Enhancing Supervised Learning with Contrastive Markings in Neural Machine Translation Training

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators