On Length Divergence Bias in Textual Matching Models

Jiang, Lan; Lyu, Tianshu; Lin, Yankai; Chong, Meng; Lyu, Xiaoyong; Yin, Dawei

Computer Science > Computation and Language

arXiv:2109.02431 (cs)

[Submitted on 6 Sep 2021 (v1), last revised 4 May 2022 (this version, v3)]

Title:On Length Divergence Bias in Textual Matching Models

Authors:Lan Jiang, Tianshu Lyu, Yankai Lin, Meng Chong, Xiaoyong Lyu, Dawei Yin

View PDF

Abstract:Despite the remarkable success deep models have achieved in Textual Matching (TM) tasks, it still remains unclear whether they truly understand language or measure the semantic similarity of texts by exploiting statistical bias in datasets. In this work, we provide a new perspective to study this issue -- via the length divergence bias. We find the length divergence heuristic widely exists in prevalent TM datasets, providing direct cues for prediction. To determine whether TM models have adopted such heuristic, we introduce an adversarial evaluation scheme which invalidates the heuristic. In this adversarial setting, all TM models perform worse, indicating they have indeed adopted this heuristic. Through a well-designed probing experiment, we empirically validate that the bias of TM models can be attributed in part to extracting the text length information during training. To alleviate the length divergence bias, we propose an adversarial training method. The results demonstrate we successfully improve the robustness and generalization ability of models at the same time.

Comments:	Accepted to Findings of ACL 2022
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2109.02431 [cs.CL]
	(or arXiv:2109.02431v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2109.02431

Submission history

From: Lan Jiang [view email]
[v1] Mon, 6 Sep 2021 13:12:06 UTC (53 KB)
[v2] Mon, 25 Oct 2021 07:28:23 UTC (1 KB) (withdrawn)
[v3] Wed, 4 May 2022 04:40:14 UTC (107 KB)

Computer Science > Computation and Language

Title:On Length Divergence Bias in Textual Matching Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:On Length Divergence Bias in Textual Matching Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators