As Good as New. How to Successfully Recycle English GPT-2 to Make Models for Other Languages

de Vries, Wietse; Nissim, Malvina

doi:10.18653/v1/2021.findings-acl.74

Computer Science > Computation and Language

arXiv:2012.05628 (cs)

[Submitted on 10 Dec 2020 (v1), last revised 9 Jun 2021 (this version, v3)]

Title:As Good as New. How to Successfully Recycle English GPT-2 to Make Models for Other Languages

Authors:Wietse de Vries, Malvina Nissim

View PDF

Abstract:Large generative language models have been very successful for English, but other languages lag behind, in part due to data and computational limitations. We propose a method that may overcome these problems by adapting existing pre-trained models to new languages. Specifically, we describe the adaptation of English GPT-2 to Italian and Dutch by retraining lexical embeddings without tuning the Transformer layers. As a result, we obtain lexical embeddings for Italian and Dutch that are aligned with the original English lexical embeddings. Additionally, we scale up complexity by transforming relearned lexical embeddings of GPT-2 small to the GPT-2 medium embedding space. This method minimises the amount of training and prevents losing information during adaptation that was learned by GPT-2. English GPT-2 models with relearned lexical embeddings can generate realistic sentences in Italian and Dutch. Though on average these sentences are still identifiable as artificial by humans, they are assessed on par with sentences generated by a GPT-2 model fully trained from scratch.

Comments:	Findings of ACL 2021 Camera Ready
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2012.05628 [cs.CL]
	(or arXiv:2012.05628v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2012.05628
Journal reference:	Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021
Related DOI:	https://doi.org/10.18653/v1/2021.findings-acl.74

Submission history

From: Wietse de Vries [view email]
[v1] Thu, 10 Dec 2020 12:27:16 UTC (121 KB)
[v2] Sat, 22 May 2021 09:21:35 UTC (130 KB)
[v3] Wed, 9 Jun 2021 07:57:32 UTC (130 KB)

Computer Science > Computation and Language

Title:As Good as New. How to Successfully Recycle English GPT-2 to Make Models for Other Languages

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:As Good as New. How to Successfully Recycle English GPT-2 to Make Models for Other Languages

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators