Unsupervised Neural Machine Translation

Artetxe, Mikel; Labaka, Gorka; Agirre, Eneko; Cho, Kyunghyun

Computer Science > Computation and Language

arXiv:1710.11041 (cs)

[Submitted on 30 Oct 2017 (v1), last revised 26 Feb 2018 (this version, v2)]

Title:Unsupervised Neural Machine Translation

Authors:Mikel Artetxe, Gorka Labaka, Eneko Agirre, Kyunghyun Cho

View PDF

Abstract:In spite of the recent success of neural machine translation (NMT) in standard benchmarks, the lack of large parallel corpora poses a major practical problem for many language pairs. There have been several proposals to alleviate this issue with, for instance, triangulation and semi-supervised learning techniques, but they still require a strong cross-lingual signal. In this work, we completely remove the need of parallel data and propose a novel method to train an NMT system in a completely unsupervised manner, relying on nothing but monolingual corpora. Our model builds upon the recent work on unsupervised embedding mappings, and consists of a slightly modified attentional encoder-decoder model that can be trained on monolingual corpora alone using a combination of denoising and backtranslation. Despite the simplicity of the approach, our system obtains 15.56 and 10.21 BLEU points in WMT 2014 French-to-English and German-to-English translation. The model can also profit from small parallel corpora, and attains 21.81 and 15.24 points when combined with 100,000 parallel sentences, respectively. Our implementation is released as an open source project.

Comments:	Published as a conference paper at ICLR 2018
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:1710.11041 [cs.CL]
	(or arXiv:1710.11041v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1710.11041

Submission history

From: Mikel Artetxe [view email]
[v1] Mon, 30 Oct 2017 16:17:34 UTC (155 KB)
[v2] Mon, 26 Feb 2018 16:54:14 UTC (151 KB)

Computer Science > Computation and Language

Title:Unsupervised Neural Machine Translation

Submission history

Access Paper:

References & Citations

2 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Unsupervised Neural Machine Translation

Submission history

Access Paper:

References & Citations

2 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators