Pitfalls and Outlooks in Using COMET

Zouhar, Vilém; Chen, Pinzhen; Lam, Tsz Kin; Moghe, Nikita; Haddow, Barry

Computer Science > Computation and Language

arXiv:2408.15366 (cs)

[Submitted on 27 Aug 2024 (v1), last revised 30 Sep 2024 (this version, v3)]

Title:Pitfalls and Outlooks in Using COMET

Authors:Vilém Zouhar, Pinzhen Chen, Tsz Kin Lam, Nikita Moghe, Barry Haddow

View PDF HTML (experimental)

Abstract:The COMET metric has blazed a trail in the machine translation community, given its strong correlation with human judgements of translation quality. Its success stems from being a modified pre-trained multilingual model finetuned for quality assessment. However, it being a machine learning model also gives rise to a new set of pitfalls that may not be widely known. We investigate these unexpected behaviours from three aspects: 1) technical: obsolete software versions and compute precision; 2) data: empty content, language mismatch, and translationese at test time as well as distribution and domain biases in training; 3) usage and reporting: multi-reference support and model referencing in the literature. All of these problems imply that COMET scores are not comparable between papers or even technical setups and we put forward our perspective on fixing each issue. Furthermore, we release the sacreCOMET package that can generate a signature for the software and model configuration as well as an appropriate citation. The goal of this work is to help the community make more sound use of the COMET metric.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2408.15366 [cs.CL]
	(or arXiv:2408.15366v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2408.15366

Submission history

From: Vilém Zouhar [view email]
[v1] Tue, 27 Aug 2024 19:03:11 UTC (53 KB)
[v2] Mon, 2 Sep 2024 08:18:52 UTC (53 KB)
[v3] Mon, 30 Sep 2024 13:44:11 UTC (76 KB)

Computer Science > Computation and Language

Title:Pitfalls and Outlooks in Using COMET

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Pitfalls and Outlooks in Using COMET

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators