MvSR-NAT: Multi-view Subset Regularization for Non-Autoregressive Machine Translation

Xie, Pan; Li, Zexian; Hu, Xiaohui

Computer Science > Computation and Language

arXiv:2108.08447 (cs)

[Submitted on 19 Aug 2021]

Title:MvSR-NAT: Multi-view Subset Regularization for Non-Autoregressive Machine Translation

Authors:Pan Xie, Zexian Li, Xiaohui Hu

View PDF

Abstract:Conditional masked language models (CMLM) have shown impressive progress in non-autoregressive machine translation (NAT). They learn the conditional translation model by predicting the random masked subset in the target sentence. Based on the CMLM framework, we introduce Multi-view Subset Regularization (MvSR), a novel regularization method to improve the performance of the NAT model. Specifically, MvSR consists of two parts: (1) \textit{shared mask consistency}: we forward the same target with different mask strategies, and encourage the predictions of shared mask positions to be consistent with each other. (2) \textit{model consistency}, we maintain an exponential moving average of the model weights, and enforce the predictions to be consistent between the average model and the online model. Without changing the CMLM-based architecture, our approach achieves remarkable performance on three public benchmarks with 0.36-1.14 BLEU gains over previous NAT models. Moreover, compared with the stronger Transformer baseline, we reduce the gap to 0.01-0.44 BLEU scores on small datasets (WMT16 RO$\leftrightarrow$EN and IWSLT DE$\rightarrow$EN).

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2108.08447 [cs.CL]
	(or arXiv:2108.08447v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2108.08447

Submission history

From: Pan Xie [view email]
[v1] Thu, 19 Aug 2021 02:30:38 UTC (4,447 KB)

Computer Science > Computation and Language

Title:MvSR-NAT: Multi-view Subset Regularization for Non-Autoregressive Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:MvSR-NAT: Multi-view Subset Regularization for Non-Autoregressive Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators