The Utility of General Domain Transfer Learning for Medical Language Tasks

Ranti, Daniel; Hanss, Katie; Zhao, Shan; Arvind, Varun; Titano, Joseph; Costa, Anthony; Oermann, Eric

Computer Science > Computation and Language

arXiv:2002.06670 (cs)

[Submitted on 16 Feb 2020]

Title:The Utility of General Domain Transfer Learning for Medical Language Tasks

Authors:Daniel Ranti, Katie Hanss, Shan Zhao, Varun Arvind, Joseph Titano, Anthony Costa, Eric Oermann

View PDF

Abstract:The purpose of this study is to analyze the efficacy of transfer learning techniques and transformer-based models as applied to medical natural language processing (NLP) tasks, specifically radiological text classification. We used 1,977 labeled head CT reports, from a corpus of 96,303 total reports, to evaluate the efficacy of pretraining using general domain corpora and a combined general and medical domain corpus with a bidirectional representations from transformers (BERT) model for the purpose of radiological text classification. Model performance was benchmarked to a logistic regression using bag-of-words vectorization and a long short-term memory (LSTM) multi-label multi-class classification model, and compared to the published literature in medical text classification. The BERT models using either set of pretrained checkpoints outperformed the logistic regression model, achieving sample-weighted average F1-scores of 0.87 and 0.87 for the general domain model and the combined general and biomedical-domain model. General text transfer learning may be a viable technique to generate state-of-the-art results within medical NLP tasks on radiological corpora, outperforming other deep models such as LSTMs. The efficacy of pretraining and transformer-based models could serve to facilitate the creation of groundbreaking NLP models in the uniquely challenging data environment of medical text.

Comments:	8 pages, 5 figures, 2 tables
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2002.06670 [cs.CL]
	(or arXiv:2002.06670v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2002.06670

Submission history

From: Daniel Ranti [view email]
[v1] Sun, 16 Feb 2020 20:20:38 UTC (1,850 KB)

Computer Science > Computation and Language

Title:The Utility of General Domain Transfer Learning for Medical Language Tasks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:The Utility of General Domain Transfer Learning for Medical Language Tasks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators