Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework

Wang, Zirui; Xie, Jiateng; Xu, Ruochen; Yang, Yiming; Neubig, Graham; Carbonell, Jaime

Computer Science > Computation and Language

arXiv:1910.04708 (cs)

[Submitted on 10 Oct 2019 (v1), last revised 18 Feb 2020 (this version, v4)]

Title:Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework

Authors:Zirui Wang, Jiateng Xie, Ruochen Xu, Yiming Yang, Graham Neubig, Jaime Carbonell

View PDF

Abstract:Learning multilingual representations of text has proven a successful method for many cross-lingual transfer learning tasks. There are two main paradigms for learning such representations: (1) alignment, which maps different independently trained monolingual representations into a shared space, and (2) joint training, which directly learns unified multilingual representations using monolingual and cross-lingual objectives jointly. In this paper, we first conduct direct comparisons of representations learned using both of these methods across diverse cross-lingual tasks. Our empirical results reveal a set of pros and cons for both methods, and show that the relative performance of alignment versus joint training is task-dependent. Stemming from this analysis, we propose a simple and novel framework that combines these two previously mutually-exclusive approaches. Extensive experiments demonstrate that our proposed framework alleviates limitations of both approaches, and outperforms existing methods on the MUSE bilingual lexicon induction (BLI) benchmark. We further show that this framework can generalize to contextualized representations such as Multilingual BERT, and produces state-of-the-art results on the CoNLL cross-lingual NER benchmark.

Comments:	Published as a conference paper at ICLR 2020. First two authors contributed equally. Source code is available at this https URL
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:1910.04708 [cs.CL]
	(or arXiv:1910.04708v4 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1910.04708

Submission history

From: Zirui Wang [view email]
[v1] Thu, 10 Oct 2019 17:04:30 UTC (4,485 KB)
[v2] Sun, 13 Oct 2019 16:34:25 UTC (4,485 KB)
[v3] Sat, 1 Feb 2020 20:48:45 UTC (4,485 KB)
[v4] Tue, 18 Feb 2020 00:59:03 UTC (4,485 KB)

Computer Science > Computation and Language

Title:Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators