Towards Effective Disambiguation for Machine Translation with Large Language Models

Iyer, Vivek; Chen, Pinzhen; Birch, Alexandra

Computer Science > Computation and Language

arXiv:2309.11668 (cs)

[Submitted on 20 Sep 2023 (v1), last revised 21 Oct 2023 (this version, v2)]

Title:Towards Effective Disambiguation for Machine Translation with Large Language Models

Authors:Vivek Iyer, Pinzhen Chen, Alexandra Birch

View PDF

Abstract:Resolving semantic ambiguity has long been recognised as a central challenge in the field of Machine Translation. Recent work on benchmarking translation performance on ambiguous sentences has exposed the limitations of conventional Neural Machine Translation (NMT) systems, which fail to handle many such cases. Large language models (LLMs) have emerged as a promising alternative, demonstrating comparable performance to traditional NMT models while introducing new paradigms for controlling the target outputs. In this paper, we study the capabilities of LLMs to translate "ambiguous sentences" - i.e. those containing highly polysemous words and/or rare word senses. We also propose two ways to improve their disambiguation capabilities, through a) in-context learning and b) fine-tuning on carefully curated ambiguous datasets. Experiments show that our methods can match or outperform state-of-the-art systems such as DeepL and NLLB in four out of five language directions. Our research provides valuable insights into effectively adapting LLMs to become better disambiguators during Machine Translation. We release our curated disambiguation corpora and resources at this https URL.

Comments:	WMT 2023
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2309.11668 [cs.CL]
	(or arXiv:2309.11668v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2309.11668

Submission history

From: Vivek Iyer [view email]
[v1] Wed, 20 Sep 2023 22:22:52 UTC (190 KB)
[v2] Sat, 21 Oct 2023 16:02:07 UTC (191 KB)

Computer Science > Computation and Language

Title:Towards Effective Disambiguation for Machine Translation with Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Towards Effective Disambiguation for Machine Translation with Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators