A Pragmatics-Centered Evaluation Framework for Natural Language Understanding

Sileo, Damien; Van-de-Cruys, Tim; Pradel, Camille; Muller, Philippe

Computer Science > Computation and Language

arXiv:1907.08672 (cs)

[Submitted on 19 Jul 2019 (v1), last revised 4 Apr 2022 (this version, v2)]

Title:A Pragmatics-Centered Evaluation Framework for Natural Language Understanding

Authors:Damien Sileo, Tim Van-de-Cruys, Camille Pradel, Philippe Muller

View PDF

Abstract:New models for natural language understanding have recently made an unparalleled amount of progress, which has led some researchers to suggest that the models induce universal text representations. However, current benchmarks are predominantly targeting semantic phenomena; we make the case that pragmatics needs to take center stage in the evaluation of natural language understanding. We introduce PragmEval, a new benchmark for the evaluation of natural language understanding, that unites 11 pragmatics-focused evaluation datasets for English. PragmEval can be used as supplementary training data in a multi-task learning setup, and is publicly available, alongside the code for gathering and preprocessing the datasets. Using our evaluation suite, we show that natural language inference, a widely used pretraining task, does not result in genuinely universal representations, which presents a new challenge for multi-task learning.

Comments:	Accepted at LREC2022
Subjects:	Computation and Language (cs.CL)
ACM classes:	I.2.7; I.2.6
Cite as:	arXiv:1907.08672 [cs.CL]
	(or arXiv:1907.08672v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1907.08672

Submission history

From: Damien Sileo [view email]
[v1] Fri, 19 Jul 2019 20:09:03 UTC (42 KB)
[v2] Mon, 4 Apr 2022 13:38:21 UTC (189 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2019-07

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Damien Sileo
Tim Van de Cruys
Camille Pradel
Philippe Muller

export BibTeX citation

Computer Science > Computation and Language

Title:A Pragmatics-Centered Evaluation Framework for Natural Language Understanding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:A Pragmatics-Centered Evaluation Framework for Natural Language Understanding

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators