From Bag of Sentences to Document: Distantly Supervised Relation Extraction via Machine Reading Comprehension

Yan, Lingyong; Han, Xianpei; Sun, Le; Liu, Fangchao; Bian, Ning

Computer Science > Computation and Language

arXiv:2012.04334 (cs)

[Submitted on 8 Dec 2020 (v1), last revised 9 Dec 2020 (this version, v2)]

Title:From Bag of Sentences to Document: Distantly Supervised Relation Extraction via Machine Reading Comprehension

Authors:Lingyong Yan, Xianpei Han, Le Sun, Fangchao Liu, Ning Bian

View PDF

Abstract:Distant supervision (DS) is a promising approach for relation extraction but often suffers from the noisy label problem. Traditional DS methods usually represent an entity pair as a bag of sentences and denoise labels using multi-instance learning techniques. The bag-based paradigm, however, fails to leverage the inter-sentence-level and the entity-level evidence for relation extraction, and their denoising algorithms are often specialized and complicated. In this paper, we propose a new DS paradigm--document-based distant supervision, which models relation extraction as a document-based machine reading comprehension (MRC) task. By re-organizing all sentences about an entity as a document and extracting relations via querying the document with relation-specific questions, the document-based DS paradigm can simultaneously encode and exploit all sentence-level, inter-sentence-level, and entity-level evidence. Furthermore, we design a new loss function--DSLoss (distant supervision loss), which can effectively train MRC models using only $\langle$document, question, answer$\rangle$ tuples, therefore noisy label problem can be inherently resolved. Experiments show that our method achieves new state-of-the-art DS performance.

Comments:	12 pages, 3 figures
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2012.04334 [cs.CL]
	(or arXiv:2012.04334v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2012.04334

Submission history

From: Lingyong Yan [view email]
[v1] Tue, 8 Dec 2020 10:16:27 UTC (276 KB)
[v2] Wed, 9 Dec 2020 03:05:41 UTC (276 KB)

Computer Science > Computation and Language

Title:From Bag of Sentences to Document: Distantly Supervised Relation Extraction via Machine Reading Comprehension

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:From Bag of Sentences to Document: Distantly Supervised Relation Extraction via Machine Reading Comprehension

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators