Conversational Machine Reading Comprehension for Vietnamese Healthcare Texts

Luu, Son T.; Bui, Mao Nguyen; Nguyen, Loi Duc; Tran, Khiem Vinh; Van Nguyen, Kiet; Nguyen, Ngan Luu-Thuy

doi:10.1007/978-3-030-88113-9_44

Computer Science > Computation and Language

arXiv:2105.01542 (cs)

[Submitted on 4 May 2021 (v1), last revised 30 Sep 2021 (this version, v6)]

Title:Conversational Machine Reading Comprehension for Vietnamese Healthcare Texts

Authors:Son T. Luu, Mao Nguyen Bui, Loi Duc Nguyen, Khiem Vinh Tran, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen

View PDF

Abstract:Machine reading comprehension (MRC) is a sub-field in natural language processing that aims to assist computers understand unstructured texts and then answer questions related to them. In practice, the conversation is an essential way to communicate and transfer information. To help machines understand conversation texts, we present UIT-ViCoQA, a new corpus for conversational machine reading comprehension in the Vietnamese language. This corpus consists of 10,000 questions with answers over 2,000 conversations about health news articles. Then, we evaluate several baseline approaches for conversational machine comprehension on the UIT-ViCoQA corpus. The best model obtains an F1 score of 45.27%, which is 30.91 points behind human performance (76.18%), indicating that there is ample room for improvement. Our dataset is available at our website: this http URL for research purposes.

Comments:	Published at The 13th International Conference on Computational Collective Intelligence (ICCCI 2021)
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2105.01542 [cs.CL]
	(or arXiv:2105.01542v6 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2105.01542
Related DOI:	https://doi.org/10.1007/978-3-030-88113-9_44

Submission history

From: Son T. Luu [view email]
[v1] Tue, 4 May 2021 14:50:39 UTC (398 KB)
[v2] Wed, 5 May 2021 01:48:26 UTC (400 KB)
[v3] Sat, 15 May 2021 10:43:18 UTC (398 KB)
[v4] Fri, 21 May 2021 09:47:05 UTC (399 KB)
[v5] Fri, 2 Jul 2021 01:59:01 UTC (445 KB)
[v6] Thu, 30 Sep 2021 14:59:03 UTC (443 KB)

Computer Science > Computation and Language

Title:Conversational Machine Reading Comprehension for Vietnamese Healthcare Texts

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Conversational Machine Reading Comprehension for Vietnamese Healthcare Texts

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators