0% found this document useful (0 votes)

14 views17 pages

Diagnostics 10 00358

Uploaded by

hannounisalma6

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views17 pages

Diagnostics 10 00358

Uploaded by

hannounisalma6

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

diagnostics

Article
Weakly Labeled Data Augmentation for Deep
Learning: A Study on COVID-19 Detection in
Chest X-Rays
Sivaramakrishnan Rajaraman * and Sameer Antani
Lister Hill National Center for Biomedical Communications, National Library of Medicine, 8600 Rockville Pike,
Bethesda, MD 20894, USA; [email protected]
* Correspondence: [email protected]; Tel.: +1-301-827-2383

Received: 24 April 2020; Accepted: 29 May 2020; Published: 30 May 2020

Abstract: The novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has caused a
pandemic resulting in over 2.7 million infected individuals and over 190,000 deaths and growing.
Assertions in the literature suggest that respiratory disorders due to COVID-19 commonly present
with pneumonia-like symptoms which are radiologically confirmed as opacities. Radiology serves
as an adjunct to the reverse transcription-polymerase chain reaction test for confirmation and
evaluating disease progression. While computed tomography (CT) imaging is more specific than
chest X-rays (CXR), its use is limited due to cross-contamination concerns. CXR imaging is commonly
used in high-demand situations, placing a significant burden on radiology services. The use of artificial
intelligence (AI) has been suggested to alleviate this burden. However, there is a dearth of sufficient
training data for developing image-based AI tools. We propose increasing training data for recognizing
COVID-19 pneumonia opacities using weakly labeled data augmentation. This follows from a
hypothesis that the COVID-19 manifestation would be similar to that caused by other viral pathogens
affecting the lungs. We expand the training data distribution for supervised learning through the use
of weakly labeled CXR images, automatically pooled from publicly available pneumonia datasets,
to classify them into those with bacterial or viral pneumonia opacities. Next, we use these selected
images in a stage-wise, strategic approach to train convolutional neural network-based algorithms
and compare against those trained with non-augmented data. Weakly labeled data augmentation
expands the learned feature space in an attempt to encompass variability in unseen test distributions,
enhance inter-class discrimination, and reduce the generalization error. Empirical evaluations
demonstrate that simple weakly labeled data augmentation (Acc: 0.5555 and Acc: 0.6536) is better
than baseline non-augmented training (Acc: 0.2885 and Acc: 0.5028) in identifying COVID-19
manifestations as viral pneumonia. Interestingly, adding COVID-19 CXRs to simple weakly labeled
augmented training data significantly improves the performance (Acc: 0.7095 and Acc: 0.8889),
suggesting that COVID-19, though viral in origin, creates a uniquely different presentation in CXRs
compared with other viral pneumonia manifestations.

Keywords: augmentation; chest X-rays; convolutional neural network; COVID-19; deep learning;
pneumonia; localization

1. Introduction
The novel coronavirus disease 2019 (COVID-19) is caused by a strain of coronavirus called severe
acute respiratory syndrome coronavirus 2 (SARS-CoV-2) that originated in Wuhan in the Hubei
province in China. On 11 March 2020, the World Health Organization (WHO) declared the disease
as a pandemic [1], and as of this writing (in late April 2020), there are more than 2.7 million globally

Diagnostics 2020, 10, 358; doi:10.3390/diagnostics10060358 www.mdpi.com/journal/diagnostics

Diagnostics 2020, 10, 358 2 of 17
Diagnostics 2020, 10, x FOR PEER REVIEW 2 of 17

confirmed cases withcases

globally confirmed over 190,000
with over reported
190,000 deaths with unabated
reported deaths with growth. The disease
unabated growth. is detected
The diseaseusingis
reverse transcription-polymerase chain reaction (RT-PCR) tests that are shown
detected using reverse transcription-polymerase chain reaction (RT-PCR) tests that are shown to to exhibit high specificity
but variable
exhibit sensitivitybut
high specificity in detecting the presence
variable sensitivity of the disease
in detecting [2]. However,
the presence these test
of the disease [2]. kits are in
However,
limited
these testsupply
kits arein some geographical
in limited supply inregions, particularly regions,
some geographical third-world countriesthird-world
particularly [3]. The turnaround
countries
time is reported
[3]. The turnaround to betime
24 hisinreported
major cities
to beand24 even greater
h in major in rural
cities and regions. This in
even greater necessitates the need
rural regions. This
to explore other
necessitates options
the need to to identify
explore the options
other disease to and facilitate
identify theswift referrals
disease for the COVID-19-affected
and facilitate swift referrals for
patient population in need
the COVID-19-affected of urgent
patient medical
population in care.
need of urgent medical care.
A study of the the literature
literature shows
shows that that individuals
individuals suffering from from COVID-19
COVID-19 disease
disease commonly
present with hyperthermia
hyperthermia and and difficulty
difficulty withwith breathing.
breathing. The The disease
disease manifests
manifests in the lungs as
ground-glass
ground-glassopacities,
opacities,withwithperipheral,
peripheral, bilateral, and predominant
bilateral, and predominant basal distribution [2]. These
basal distribution patterns
[2]. These
are visually
patterns are similar
visually to,similar
yet distinct
to, yetfrom, thosefrom,
distinct caused by non-COVID-19-related
those viral pneumonia
caused by non-COVID-19-related viral
and those caused
pneumonia by other
and those bacterial
caused and fungal
by other bacterialpathogens
and fungal[2]. Further,
pathogens the current literature
[2]. Further, studies
the current
revealed
literaturethat it is difficult
studies revealedtothatdistinguish viral to
it is difficult pneumonia
distinguish from others
viral caused by
pneumonia frombacterial
othersand fungal
caused by
pathogens
bacterial and [4].fungal
Figurepathogens
1 shows instances
[4]. Figure of chest
1 showsX-rays (CXRs)
instances ofofchest
clearX-rays
lungs,(CXRs)
bacterialof pneumonia,
clear lungs,
and COVID-19-related
bacterial pneumonia, and pneumonia, respectively.
COVID-19-related pneumonia, respectively.

X-rays (CXRs)
Figure 1. Chest X-rays (CXRs)showing
showing(a)(a)clear
clearlungs;
lungs;(b)
(b)bacterial
bacterial pneumonia
pneumonia infection
infection manifesting
manifesting as
as consolidations
consolidations in right
in the the right
upperupper
lobe andlobe and retro-cardiac
retro-cardiac left lowerleft lower
lobe; lobe;
and (c) and (c) pneumonia
COVID-19 COVID-19
pneumonia
infection infection
showing showing
bilateral bilateral Blue
manifestations. manifestations.
frames in (c) Blue
denoteframes in (c)
radiologist denote radiologist
annotations indicating
disease regions,
annotations which serve
indicating asregions,
disease ground truth
whichinserve
our analysis.
as ground truth in our analysis.

While not
While not recommended
recommended as as aa primary
primary diagnostic
diagnostic tool
tool due
due to
to the
the risk
risk of
of increased
increased transmission,
transmission,
chest radiography
chest radiography and computed tomography
and computed tomography (CT) (CT) scans
scans are used to
are used to screen/confirm
screen/confirm respiratory
respiratory
damage in COVID-19 disease and evaluate its progression [3]. CT scans are reported to specific
damage in COVID-19 disease and evaluate its progression [3]. CT scans are reported to be less be less
than RT-PCR
specific but highly
than RT-PCR but sensitive in detecting
highly sensitive COVID-19,
in detecting and can and
COVID-19, playcan a pivotal
play a role
pivotalin disease
role in
diagnosis/treatment [3]. However, the American College of Radiology
disease diagnosis/treatment [3]. However, the American College of Radiology has recommended has recommended against the
use of CT scans as a first-line test [5]. Additional considerations of the increased
against the use of CT scans as a first-line test [5]. Additional considerations of the increased risk of risk of transmission,
access, and cost
transmission, also and
access, contribute
cost alsotocontribute
the recommendation. When radiological
to the recommendation. imaging is considered
When radiological imaging is
necessary, portable chest X-rays (CXRs) are considered a good and viable
considered necessary, portable chest X-rays (CXRs) are considered a good and viable alternative alternative [2]. However, [2].
in a pandemic situation, the assessment of the images places a huge burden
However, in a pandemic situation, the assessment of the images places a huge burden on radiological on radiological expertise,
which is often
expertise, whichlacking in regions
is often lackingwith limitedwith
in regions resources.
limited Automated
resources.decision-making tools could be
Automated decision-making
valuable in alleviating some of this burden, and also as a research tool
tools could be valuable in alleviating some of this burden, and also as a research tool for quantifying for quantifying disease
progression.
disease progression.
A study
A study of ofthe
theliterature
literatureshows
shows that
that automated
automated computer-aided
computer-aided diagnostic
diagnostic (CADx)
(CADx) toolstools
built built
with
with data-driven deep learning (DL) algorithms using convolutional
data-driven deep learning (DL) algorithms using convolutional neural networks (CNN) have shown neural networks (CNN) have
shown promise
promise in detecting,
in detecting, classifying,
classifying, and quantifying
and quantifying COVID-19-related
COVID-19-related disease patternsdiseaseusing
patterns
CXRs using
and
CXRs and CT scans [2,3,6], and can serve as a triage under resource-constrained
CT scans [2,3,6], and can serve as a triage under resource-constrained settings, thereby facilitating settings, thereby
facilitating
swift swift
referrals thatreferrals
need urgentthat need
patient urgent
care. patient
These toolscare.combine
These tools combine
elements elementsand
of radiology of radiology
computer
and computer
vision to learn vision to learn the
the hierarchical hierarchical
feature feature representations
representations from medical images from medical images
to identify to identify
typical disease
typical disease manifestations and localize suspicious
manifestations and localize suspicious regions of interest (ROI). regions of interest (ROI).
It is customary to train and test a DL model with the data coming from the same target
distribution to offer probabilistic predictions toward categorizing the medical images to their
Diagnostics 2020, 10, 358 3 of 17

It is customary to train and test a DL model with the data coming from the same target distribution
to offer probabilistic predictions toward categorizing the medical images to their respective categories.
Often, this idealized target is not possible due to the limited data availability, or weak labels. In the
present situation, despite a large number of cases worldwide, we have very limited COVID-19
CXR image data that are publicly available to train DL models where the goal is to recognize CXR
images showing COVID-19-related viral pneumonia from those caused by other non-COVID-19 viral,
bacterial, and other pathogens. Acquiring such data remains a goal for medical societies such as the
Radiological Society of North America (RSNA) [7] and Imaging COVID-19 AI Initiative in Europe [8].
The large number of training data enables a diversified feature space across categories that help to
enhance inter-class variance, leading to a better DL performance. The absence of such data leads to
model overfitting and poor generalization to unseen real-world data. Under these circumstances, data
augmentation has been proven to be effective in training discriminative DL models [9]. There are several
data augmentation methods discussed in the literature for improving performance in natural computer
vision tasks. These include traditional augmentation techniques like flipping, rotations, color jittering,
random cropping, and elastic distortions and generative adversarial networks (GAN)-based synthetic
data generation [10]. Other methods such as random image cropping and patching (RICAP) [11]
are proposed for natural images to augment the training data to achieve superior performance on
CIFAR-100 and ImageNet classification tasks.
Unlike natural images, such as those found in ImageNet [12], medical images tend to have different
visual characteristics exhibiting high inter-class similarities and highly localized ROI. Under these
circumstances, traditional augmentation methods that introduce simple pixel-wise image modifications
are shown to be less effective [13]. On the other hand, GAN-based DL models that are used for
synthetic data generation are computationally complex and the jury is still out on the anatomical and
pathological validity of synthesized images. These networks are hard to train due to the problem of
Nash equilibria, defined as the zero-sum game between the generator and the discriminator networks,
where they contest with each other in improving performance [14]. Further, these networks are shown
to be sensitive to the selection of architecture and hyperparameters and often get into mode collapse,
resulting in a degraded performance [14]. In general, there is a great opportunity for research in
developing effective data augmentation strategies for medical visual recognition tasks. Goals for such
medical data augmentation techniques include reducing overfitting and regularization errors in a
data-scarce situation. The urgency offered by the pandemic has led to the motivation behind this study.
In this work, we use weakly labeled CXR images that are automatically pooled from publicly
available pneumonia datasets to augment training data toward classifying them into bacterial and viral
pneumonia classes and compare the performance with non-augmented training. The goal is to improve
COVID-19 detection in CXRs on the hypothesis that it is a kind of viral pneumonia. This would
leverage the large collections of images toward meeting an emergent goal.

2. Materials and Methods

2.1. Data and Workflow

This retrospective analysis was performed using four publicly available CXR collections:
(i) Pediatric CXR dataset [4]: A set of 5232 anterior–posterior (AP) projection CXR images of
children of 1 to 5 years of age acquired as part of the routine clinical care at the Guangzhou Children’s
Medical Center in China. The set contains 1583 normal, 2780 bacterial pneumonia, and 1493 CXRs
showing non-COVID-19 viral pneumonia, respectively;
(ii) RSNA CXR dataset [15]: The RSNA, Society of Thoracic Radiology (STR), and the National
Institutes of Health (NIH) jointly organized the Kaggle pneumonia detection challenge to develop image
analysis and machine learning algorithms to automatically categorize the CXRs as showing normal,
non-pneumonia-related or pneumonia-related opacities. The publicly available data are a curated
subset of 26,684 AP and posterior–anterior (PA) CXRs showing normal and abnormal radiographic
Diagnostics 2020, 10, 358 4 of 17

patterns, taken from the NIH CXR-14 dataset [16]. It includes 6012 CXRs showing pneumonia-related
opacities with ground truth (GT) bounding box annotations for these on 1241 CXRs;
(iii) CheXpert CXR dataset [17]: A subset of 4683 CXRs showing pneumonia-related opacities
selected from a collection of 223,648 CXRs in frontal and lateral projections, collected from 65,240
patients at Stanford Hospital, California, and labeled for 14 thoracic diseases by extracting the
labels from radiological texts using an automated natural language processing (NLP)-based labeler,
conforming to the glossary of the Fleischner Society;
(iv) NIH CXR-14 dataset [16]: A subset of 307 CXRs showing pneumonia-related opacities selected
from a collection of 112,120 CXRs in frontal projection, collected from 30,805 patients. Images are
labeled with 14 thoracic disease labels extracted automatically from radiological reports using an
NLP-based labeler;
(v) Twitter COVID-19 CXR dataset: A collection of 135 CXRs showing COVID-19-related viral
pneumonia, collected from SARS-CoV-2-positive subjects, has been made available by a cardiothoracic
radiologist from Spain via Twitter (https://twitter.com/ChestImaging). The images are made available
in JFIF format at approximately a 2K × 2K resolution;
(vi) Montreal COVID-19 CXR dataset: As of 14 April 2020, a collection of 179 SARS-CoV-2-positive
CXRs and others showing non-COVID-19 viral disease manifestations has been made publicly available
by the authors of [18] in their GitHub repository. The CXRs are made available in AP and PA projections.
Tables 1–3 show the distribution of the data used toward the baseline training and evaluation,
weak-label augmentation, and COVID-19 classification, respectively. The GT disease bounding box
annotations for a sample of the COVID-19 CXR data, containing 27 CXRs collectively from the Twitter
COVID-19 and Montreal COVID-19 CXR collections, were set by the verification of publicly identified
cases from an expert radiologist who annotated the sample test collection.

Table 1. Baseline dataset characteristics. Numerator and denominator denote the number of train and
test data, respectively. Note that this dataset predates the onset of SARS-CoV2 virus, and therefore the
viral pneumonia is of non-COVID-19 type.

Dataset Bacterial (Proven) Pneumonia Viral (Proven) Pneumonia

Pediatric 2538/242 1345/148

Table 2. Characteristics of datasets used for weak-label classification.

Dataset Pneumonia of Unknown Type

RSNA 6012
CheXpert 4683
NIH 307

Table 3. Distribution of COVID-19 CXR data.

Dataset COVID-19 Viral Pneumonia

Twitter COVID-19 135
Montreal COVID-19 179

Figure 2 illustrates the graphical abstract of the proposed study. Broadly, our workflow consisted
of the following steps: First, we preprocessed the images to make them suitable for use in DL. Then, as
shown in Figure 2a, we evaluated the performance of a custom CNN and a selection of pre-trained
CNN models for categorizing the pediatric CXR collection, referred to as baseline, into bacterial or
viral pneumonia. The trained model was further evaluated for its ability to categorize the publicly
available COVID-19 CXR collections as showing viral pneumonia. Next, as shown in Figure 2b, we used
the trained model from Figure 2a to weakly label CXRs as showing bacterial or viral pneumonia in
other pneumonia datasets (RSNA, CheXpert, and NIH). Then, as shown in Figure 2c, the baseline
Diagnostics 2020, 10, x FOR PEER REVIEW 5 of 17

This2020,
Diagnostics discriminative
10, 358 training data augmentation strategy recognizes biological similarity in5viral
of 17
and COVID-19 pneumonia, i.e., both are viral; however, it also notes the distinct radiological
manifestations between each other as well as with non-viral pneumonia-related opacities. Rejects
training data were augmented with these weakly labeled CXRs to improve the detection performance
from the classifier developed in this study are not necessarily normal and should be subjected to a
with both (i) the baseline test data and (ii) the COVID-19 CXR collections.
separate clinical assessment.

c
Figure 2. Graphical
Figure 2. Graphical abstract of the proposed
proposed study.
study. (a)
(a) Model
Model training
training and
and evaluation
evaluation with
with baseline
baseline
pediatric
pediatric CXR
CXR data;
data; (b) using
using the best
best performing
performing model from (a) to to weakly
weakly classify
classify CXRs
CXRs from
from
Radiological
Radiological Society of North America (RSNA), National Institutes
North America (RSNA), National Institutes of Health (NIH), and CheXpert
Health (NIH), and CheXpert
containing
containing pneumonia-related
pneumonia-relatedopacities,
opacities,as
as showing
showing bacterial
bacterial or
or viral
viral pneumonia;
pneumonia; and
and (c) augmenting
augmenting
the
the baseline
baseline training
training data
data with
with weakly
weakly labeled
labeled CXRs
CXRs to
to check
check for
for performance
performanceimprovement.
improvement.

This ROI
2.2. Lung discriminative
Segmentationtraining data augmentation strategy recognizes biological similarity in
and Preprocessing
viral and COVID-19 pneumonia, i.e., both are viral; however, it also notes the distinct radiological
It is important
manifestations to add
between eachcontrols
other as during the training
well as with ofpneumonia-related
non-viral the data-driven DLopacities.
methodsRejects
for disease
from
screening/diagnosis. Learning irrelevant feature representations could adversely impact the clinical
the classifier developed in this study are not necessarily normal and should be subjected to a separate
decision-making.
clinical assessment.To assist the DL model to focus on pulmonary abnormalities, we used a dilated
Diagnostics 2020, 10, 358 6 of 17

2.2. Lung ROI Segmentation and Preprocessing

It is important to add controls during the training of the data-driven DL methods for disease
Diagnostics 2020, 10, x FOR PEER REVIEW 6 of 17
screening/diagnosis. Learning irrelevant feature representations could adversely impact the clinical
decision-making. To assist the DL model to focus on pulmonary abnormalities, we used a dilated
dropout U-Net [19] to segment the lung ROI from the background. Dilated convolutions are shown
dropout U-Net [19] to segment the lung ROI from the background. Dilated convolutions are shown to
to improve performance [20] with exponential receptive field expansion while preserving spatial
improve performance [20] with exponential receptive field expansion while preserving spatial resolution
resolution with no added computational complexity. A Gaussian dropout with an empirically
with no added computational complexity. A Gaussian dropout with an empirically determined value
determined value of 0.2 was used after the convolutional layers in the network encoder to avoid
of 0.2 was used after the convolutional layers in the network encoder to avoid overfitting and improve
overfitting and improve generalization. A publicly available collection of CXRs and their associated
generalization. A publicly available collection of CXRs and their associated lung masks [21] was
lung masks [21] was used to train the dilated dropout U-Net model to generate lung masks of 224 ×
used to train the dilated dropout U-Net model to generate lung masks of 224 × 224 pixel resolution.
224 pixel resolution. Callbacks were used to store the best model weights after each epoch. The
Callbacks were used to store the best model weights after each epoch. The generated masks were
generated masks were superimposed on the original CXRs to delineate the lung boundaries, crop
superimposed on the original CXRs to delineate the lung boundaries, crop them to the size of a
them to the size of a bounding box, and re-scale them to 224 × 224 pixel resolution to reduce the
bounding box, and re-scale them to 224 × 224 pixel resolution to reduce the computational complexity.
computational complexity. Figure 3 shows the segmentation steps performed in this study.
Figure 3 shows the segmentation steps performed in this study.

Figure 3. The segmentation approach showing dilated dropout U-Net-based mask generation and lung
Figure 3. The segmentation approach showing dilated dropout U-Net-based mask generation and
ROI cropping.
lung ROI cropping.
Additional preprocessing steps performed were as follows: (i) CXRs were thresholded at to
Additional preprocessing steps performed were as follows: (i) CXRs were thresholded at to
remove very bright pixels to remove text annotations (empirically determined to be in the range
remove very bright pixels to remove text annotations (empirically determined to be in the range (235–
(235–255) that might be present in the cropped images. Missing pixels were in-painted using the
255) that might be present in the cropped images. Missing pixels were in-painted using the
surrounding pixel values. (ii) Images were normalized to make the pixel values lie in the range (0–1).
surrounding pixel values. (ii) Images were normalized to make the pixel values lie in the range (0–1).
(iii) CXR images were median-filtered to remove noise and preserve edges. (iv) Image pixel values
(iii) CXR images were median-filtered to remove noise and preserve edges. (iv) Image pixel values
were centered and standardized to reduce the computational complexity. Next, the cropped CXRs
were centered and standardized to reduce the computational complexity. Next, the cropped CXRs
were used to train and evaluate a custom CNN and a selection of pretrained models at the different
were used to train and evaluate a custom CNN and a selection of pretrained models at the different
learning stages performed in this study.
learning stages performed in this study.
2.3. Models and Computational Resources
2.3. Models and Computational Resources
The performance of a custom CNN model whose design is inspired by the wide residual network
(WRN) The performance
architecture of a custom
proposed in [22] andCNN model ofwhose
a selection ImageNetdesign is inspired
pretrained CNNby the wide
models residual
was evaluated
networkthe
during (WRN) architecture
different proposed
stages of learning in [22] and
performed in athis
selection
study. Theof ImageNet
benefit ofpretrained
using a WRN CNN models
compared
was evaluated during the different stages of learning performed in this study. The
with the traditional residual networks (ResNets) [23] is that it is shallower, resulting in shorter training benefit of using a
WRN compared with the traditional residual networks (ResNets) [23] is
times while producing similar or improved accuracy. In this study, we used a WRN-based custom that it is shallower, resulting
in shorter
CNN trainingwith
architecture times while producing
dropouts used in every similar or improved
residual block. Afteraccuracy. In this
the pilot study,evaluations,
empirical we used a
we used a network depth of 28, a width of 10, and a dropout ratio of 0.3 for the custom WRNthe
WRN-based custom CNN architecture with dropouts used in every residual block. After pilot
used in
empirical
this study.evaluations, we used a network depth of 28, a width of 10, and a dropout ratio of 0.3 for
the custom WRN used
We evaluated in this study. of the following pretrained CNN models, viz., (a) VGG-16 [24],
the performance
We evaluated the performance
(b) Inception-V3 [25], (c) Xception of the
[26],following pretrained CNN
(d) DenseNet-121 [27], models,
and (e)viz., a) VGG-16 [24],
NasNet-mobile b)
[28].
Inception-V3
The pretrained[25],CNNsc) Xception [26], d) DenseNet-121
were instantiated [27], and
with their ImageNet e) NasNet-mobile
[12] pretrained weights [28].and
Thetruncated
pretrainedat
their fully connected layers. The output feature maps were global average-pooled and fedtheir
CNNs were instantiated with their ImageNet [12] pretrained weights and truncated at to a fully
final
connected
dense layerlayers. The output
with Softmax feature maps
activations werethe
to output global average-pooled
prediction probabilities.and fed to a final dense layer
with The
Softmax activations
following to output theof
hyperparameters prediction
the custom probabilities.
WRN and pretrained CNNs were optimized
The following hyperparameters of the
through a randomized grid search method: (i) momentum, custom WRN (ii)and pretrained
L2-weight decay,CNNs were
and (iii) optimized
initial learning
through
rate of thea randomized grid search
stochastic gradient method:
descent (SGD)i)optimizer.
momentum, Weii) L2-weight
initialized thedecay,
searchand iii) initial
ranges learning
to (0.80–0.99),
rate
(1 × of
10−8the–1stochastic gradient
× 10−2 ), and (1 × 10descent
−7 –1 ×(SGD) optimizer.
10−3 ) for We initialized
the learning momentum, the L2-weight
search ranges to (0.80–0.99)
decay, and initial,
(1 × 10−8 – 1 × 10−2), and (1 × 10−7 – 1 × 10−3) for the learning momentum, L2-weight decay, and initial
learning rate, respectively. The custom WRN was initialized with random weights and the pretrained
models were fine-tuned end-to-end with smaller weight updates to make them data-specific and
classify the CXRs to their respective categories. Callbacks were used to monitor the model
performance with the validation data and store the best model weights for further analysis with the
Diagnostics 2020, 10, 358 7 of 17

learning rate, respectively. The custom WRN was initialized with random weights and the pretrained
models were fine-tuned end-to-end with smaller weight updates to make them data-specific and
classify the CXRs to their respective categories. Callbacks were used to monitor the model performance
with the validation data and store the best model weights for further analysis with the hold-out
test data.
The performances of the custom WRN and the pretrained CNN models were evaluated in terms
of (i) accuracy, (ii) area under the curve (AUC), (iii) sensitivity or recall, (iv) specificity, (v) precision,
(vi) F-score, and (vii) Mathews correlation coefficient (MCC). The models were trained and evaluated on
a Windows System with Intel Xeon CPU 3.80 GHz with 32 GB RAM and NVIDIA GeForce GTX 1070 GPU.
We used Keras 2.2.4 API version with Tensorflow backend and CUDA/CUDNN dependencies.

2.4. Weakly Labeled Data Augmentation

Our approach builds from following the literature which stated that CXRs showing COVID-19
viral pneumonia manifestations are visually similar to, yet distinct from, those caused by bacterial,
fungal, and other non-COVID-19-related viral pneumonia [2]. First, we trained the custom WRN and
the pretrained models on the pediatric CXR collection [4] and evaluated them on the ability to categorize
the hold-out test data, listed in Table 1, into bacterial or viral pneumonia types. We selected the best
performing model on this baseline data. Next, we conducted two evaluations with this model: (i) we
identified viral pneumonia CXRs from the Twitter-COVID-19 and Montreal-COVID-19 collections;
and (ii) we evaluated its performance in weakly categorized CXRs showing pneumonia of an unknown
type from the RSNA, CheXpert, and NIH CXR collections, listed in Table 2, as belonging to the bacterial
or viral pneumonia opacity categories. These weakly classified CXRs were used to augment the baseline
training data. This weakly labeled augmentation was motivated by the need to expand the learned
feature space. The augmentation enabled the following: (i) to make the training distribution encompass
the variability in the test distribution; (ii) to enhance the inter-class discrimination; and (iii) to decrease
the generalization error by training with samples from a diversified distribution. The model was
trained with various combinations of the augmented training data and evaluated against the baseline
test data and CXRs identifying viral pneumonia from the Twitter-COVID-19 and Montreal-COVID-19
CXR collections.

2.5. Salient ROI Localization

Visualization helps in interpreting the model predictions and identify the salient ROI involved
in decision-making. In this study, the learned behavior of the best performing baseline model
in categorizing the CXRs to the bacterial and viral pneumonia classes was visualized through
gradient-weighted class activation maps (Grad-CAM) [29]. Grad-CAM is a gradient-based visualization
method where the gradients for a given class are computed concerning the features extracted from
the deepest convolutional layer in a trained model and are fed to a global average pooling layer to
obtain the weights of importance involved in decision-making. This results in a two-dimensional heat
map which is a weighted combination of the feature maps involved in categorizing the image to its
respective class.

3. Results
Table 4 shows the optimal hyperparameter values obtained using a randomized grid search for
the custom WRN and pretrained CNNs. These are used for the model training and evaluation. For the
model validation, we allocated 20% of the training data which was randomly selected. The performance
achieved by the models is shown in Table 5.
It can be observed that the VGG-16 model demonstrates superior performance in terms of accuracy
and AUC with the baseline test data. The Xception model gives higher precision and specificity
than the other models. However, the VGG-16 model outperformed the others in classifying the
pediatric CXRs as showing bacterial or viral pneumonia when considering the F-score and MCC. Both
Diagnostics 2020, 10, 358 8 of 17

these scores provide a balanced precision and sensitivity measure. The performance excellence of
the VGG-16 model can be attributed to (i) the optimal architecture depth for learning the data and
(ii) the ability to extract diversified features that categorize the CXRs to their respective categories.
These deductions are supported by the reduced performance of deeper models like DenseNet-121
which possibly suffered from overfitting. Therefore, we select the VGG-16 model for further evaluating
against the Twitter-COVID-19 and Montreal-COVID-19 CXR collections as showing viral pneumonia.
The performance achieved is shown in Table 6. Figure 4 shows the confusion matrix obtained toward
classifying the Twitter- and Montreal-COVID-19 CXR collections as showing viral pneumonia.

Table 4. Optimal values for the hyperparameters for the custom wide residual network (WRN) and
pretrained convolutional neural networks (CNNs) obtained through the randomized grid search
(M: momentum, ILR: initial learning rate, and L2: L2-weight decay).

Optimal Values
Models
M ILR L2
Custom 0.90 1 × 10−3 1 × 10−5
Pretrained 0.95 1 × 10−3 1 × 10−6

Table 5. Performance achieved by the deep learning (DL) models in classifying the pediatric CXR
dataset (baseline) into bacterial and viral categories. Here, Acc.: accuracy, Sens.: sensitivity, Prec.:
precision, F: F-score, and MCC: Matthews correlation coefficient.

Models Acc. AUC Sens. Spec. Prec. F MCC

Custom WRN 0.8974 0.9534 0.9381 0.8311 0.9008 0.9191 0.7806
VGG-16 0.9308 0.9565 0.9711 0.8649 0.9216 0.9457 0.8527
Inception-V3 0.9103 0.937 0.9587 0.8311 0.9028 0.9299 0.8085
Xception 0.9282 0.954 0.9546 0.8852 0.9315 0.9429 0.8469
DenseNet-121 0.9026 0.9408 0.967 0.7973 0.8864 0.925 0.7931
NASNet-mobile 0.9282 0.9479 0.9753 0.8514 0.9148 0.944 0.8477
Bold numerical values denote superior performance.

Table 6. Performance metrics achieved in classifying the Twitter- and Montreal-COVID-19 CXR
collections as showing viral pneumonia.

Accuracy
Model
Twitter-COVID-19 Montreal-COVID-19
VGG-16 0.2885 0.5028

It was surprising to observe, from Table 6 and Figure 4, that the baseline-trained VGG-16 model did
not deliver superior performance in identifying COVID-19 CXRs in the Twitter- and Montreal-COVID-19
CXR collections. We attribute this to two possibilities: (i) limited variance in the training distribution
and hence a narrow feature space to learn the related patterns; or (ii) that COVID-19 manifestation is
distinct from viral pneumonia even though it is caused by the SARS-CoV-2 virus.
The learned behavior of the baseline-trained VGG-16 model with the pediatric CXR and COVID-19
CXR collections is interpreted through Grad-CAM visualizations and is shown in Figure 5.
Diagnostics 2020, 10, 358 9 of 17
Diagnostics 2020, 10, x FOR PEER REVIEW 9 of 17

b
Figure
Figure4.4.Confusion
Confusionmatrix
matrix after
after classifying
classifying bacterial and
and viral
viral pneumonia
pneumoniain inthe
the(a)
(a)Twitter-COVID-19
Twitter-COVID-
19 and (b) Montreal-COVID-19 CXR collections. Enlarged text labels
and (b) Montreal-COVID-19 CXR collections. Enlarged text labels have been manually have been manually
superimposed
superimposed
for clarity. for clarity.
did not deliver superior performance in identifying COVID-19 CXRs in the Twitter- and Montreal-
COVID-19 CXR collections. We attribute this to two possibilities: (i) limited variance in the training
distribution and hence a narrow feature space to learn the related patterns; or (ii) that COVID-19
manifestation is distinct from viral pneumonia even though it is caused by the SARS-CoV-2 virus.
Diagnostics
The 2020, 10, 358
learned 10 of 17
behavior of the baseline-trained VGG-16 model with the pediatric CXR and COVID-
19 CXR collections is interpreted through Grad-CAM visualizations and is shown in Figure 5.

Figure 5. Original CXRs and their salient ROI visualization: (a,b) show a CXR with bilateral bacterial
Figure 5. Original CXRs and their salient ROI visualization: (a) and (b) show a CXR with bilateral
pneumonia and the corresponding Grad-CAM visualization; (c,d) show a CXR with viral pneumonia
bacterial pneumonia and the corresponding Grad-CAM visualization; (c) and (d) show a CXR with
manifestations and the corresponding salient ROI visualization; and (e,f) show a sample CXR from the
viral pneumonia manifestations and the corresponding salient ROI visualization; and (e) and (f) show
Montreal-COVID-19 CXR collection with ground truth (GT) annotations and corresponding salient
a sample CXR from the Montreal-COVID-19 CXR collection with ground truth (GT) annotations and
ROI visualization. Blue frames in (e) denote radiologist annotations indicating disease regions, which
corresponding salient ROI visualization. Blue frames in (e) denote radiologist annotations indicating
serve as ground truth in our analysis.
disease regions, which serve as ground truth in our analysis.
The gradients for the bacterial and viral pneumonia classes that are flowing into the deepest
The gradients
convolutional layerforofthethebacterial
trained and modelviralare
pneumonia
used to classes
interpretthatthe
areneurons
flowing involved
into the deepest
in the
decision-making. The heat maps obtained as a result of weighing these feature maps areinsuperimposed
convolutional layer of the trained model are used to interpret the neurons involved the decision-
making.
on The heatCXRs
the original maps obtained
to identify as atheresult of weighing
salient these feature
ROI involved maps are superimposed
in categorizing the CXRs toon the
their
original CXRs to identify the salient ROI involved in categorizing the CXRs
respective classes. It is observed that the model is correctly focusing on the salient ROI for the to their respective classes.
It is observed
baseline that the
test data model
coming is correctly
from the same focusing
trainingondistribution
the salient ROIthatfor the baseline
helps test data
to categorize themcoming
into
from the same training distribution that helps to categorize them into bacterial
bacterial and viral pneumonia classes. However, the salient ROI involved in categorizing an image and viral pneumonia
classes.
from theHowever, the salient ROI
Montreal-COVID-19 CXR involved
collectionin categorizing
that comes from an image from the
a different Montreal-COVID-19
distribution compared
CXR the
with collection
baseline that
datacomes
did notfrom a different
properly overlapdistribution
with the GT compared with the
annotations. Thisbaseline
further data did not
underscores
properly overlap with the GT annotations. This further underscores the
the inference above that the model did not learn the disease manifestations in the aforementioned inference above that the
model did not learn the disease manifestations in the
COVID-19 CXR collections, suggesting that their appearances are distinct. aforementioned COVID-19 CXR collections,
suggesting that their appearances
With data-driven DL methods, arethedistinct.
training data may contain samples that do not contribute to
With data-driven DL methods,
decision-making. Modifying the training the training data may
distribution couldcontain
providesamples that solution
an active do not contribute
to improve to
decision-making. Modifying the training distribution could provide an active
performance with a similar and/or different test distribution. In response, our approach is to expand solution to improve
performance
the withfeature
training data a similar
spaceand/or different
to create test distribution.
a diversified In response,
distribution that could ourhelp
approach is toimprove
learn and expand
the performance with the baseline test data coming from the same distribution as the training data
and/or with other test data coming from a different distribution. In this study, we propose to expand
the training data feature spaces by augmenting them with weakly classified CXR images. For this,
the best-performing, baseline-trained VGG-16 model is used to weakly classify the CXR images from
the NIH, RSNA, and CheXpert collections showing pneumonia-related opacities as showing bacterial
Diagnostics 2020, 10, 358 11 of 17

or viral pneumonia. The weakly labeled images are further used to augment the baseline training
data to evaluate for an improvement in performance toward categorizing the pediatric CXR test,
Twitter-COVID-19, and Montreal-COVID-19 CXR collections. Table 7 shows the number of samples
across the bacterial and viral pneumonia categories after augmenting the baseline pediatric CXR
training data with weakly labeled images from the respective CXR collections. The performance
metrics achieved with the augmented training data are shown in Table 8.

Table 7. Number of samples in weakly labeled augmented training data.

Dataset BP VP
Baseline + NIH 2720 1470
Baseline + CheXpert 4683 3883
Baseline + RSNA 6577 3318
Baseline + NIH + CheXpert 4865 4008
Baseline + NIH + RSNA 6759 3443
Baseline + CheXpert + RSNA 8722 5856
Baseline + NIH + CheXpert + RSNA 8904 5981

Table 8. Performance metrics achieved with the different combinations of augmented training data
toward classifying the pediatric CXR (baseline) test data into bacterial and viral pneumonia categories.

Dataset Acc. AUC Sens. Spec. Prec. F MCC

Baseline 0.9308 0.9565 0.9711 0.8649 0.9216 0.9457 0.8527
Data augmentation with weakly labeled images
Baseline + NIH 0.9179 0.9600 0.9587 0.8514 0.9134 0.9355 0.8249
Baseline + CheXpert 0.9405 0.9689 0.9877 0.8624 0.9201 0.9542 0.8716
Baseline + RSNA 0.9359 0.9592 0.9877 0.8514 0.9158 0.9503 0.8653
Baseline + NIH + CheXpert 0.9333 0.9606 0.9835 0.8514 0.9154 0.9483 0.8594
Baseline + NIH + RSNA 0.9231 0.9642 0.9959 0.8041 0.8926 0.9415 0.8411
Baseline + CheXpert + RSNA 0.9359 0.9628 0.9835 0.8582 0.919 0.9501 0.8647
Baseline + NIH + CheXpert + RSNA 0.9154 0.9542 0.9794 0.8109 0.8944 0.935 0.8217
Bold numerical values denote superior performance.

Note that the baseline training data augmented with weakly labeled CXR images from the
CheXpert CXR collection demonstrated superior performance in all metrics compared with the
non-augmented and other training data augmentations. This underscores the fact that this augmentation
approach resulted in a favorable increase in the training data size, encompassing a diversified
distribution to learn and improve the performance in the baseline test data, compared with that of the
non-augmented training. We studied the effect of weakly labeled data augmentation in classifying
the Twitter- and Montreal-COVID-19 CXR collections as belonging to the viral pneumonia category.
The results are as shown in Table 9.
The empirical evaluations demonstrate that the baseline training data augmented with the weakly
labeled CXR images from the CheXpert collection improved the performance with an accuracy of
0.5555 and 0.6536, as compared with the non-augmented baseline (0.2885 and 0.5028) in classifying the
Twitter- and Montreal-COVID-19 CXR collection, respectively, as belonging to the viral pneumonia
category. The performance degradation with other combinations of weakly labeled data augmentation
underscores the fact that (i) adding more data introduces noise into the training process and (ii)
increasing the number of training samples does not always improve performance.
Diagnostics 2020, 10, x FOR PEER REVIEW 12 of 17

Table 9. Performance metrics achieved through weakly labeled data augmentation toward classifying
the Twitter- and Montreal-COVID-19 CXR collections as belonging to the viral pneumonia category.

Diagnostics 2020, 10, 358

Accuracy 12 of 17
Dataset
Twitter-COVID-19 Montreal-COVID-19
Baseline 0.2885 0.5028
Table 9. Performance metricsData augmentation
achieved throughwith weakly
weakly labeled
labeled dataimages
augmentation toward classifying
Baseline + NIH CXR collections as
the Twitter- and Montreal-COVID-19 0.1037
belonging to the viral0.2625
pneumonia category.
Baseline + CheXpert 0.5555 0.6536
Baseline + RSNA 0.2296 Accuracy0.4469
Dataset
Baseline + NIH + CheXpert 0.1852
Twitter-COVID-19 0.4078
Montreal-COVID-19
Baseline + NIH + RSNA 0.1407 0.4413
Baseline 0.2885 0.5028
Baseline + CheXpert + RSNA 0.2222 0.4357
Baseline + NIH + Data augmentation
CheXpert + RSNA with weakly labeled images
0.1852 0.4413
Baseline + NIH 0.1037
Bold numerical values denote superior performance. 0.2625
Baseline + CheXpert 0.5555 0.6536
The empirical
Baselineevaluations
+ RSNA demonstrate that the0.2296
baseline training data augmented
0.4469 with the
weakly labeled + NIHimages
Baseline CXR + CheXpert
from the CheXpert collection
0.1852 improved the performance
0.4078 with an
0.5555 +and
accuracy ofBaseline NIH + RSNA
0.6536, 0.1407
as compared with the non-augmented 0.4413
baseline (0.2885 and 0.5028) in
Baseline + CheXpert + RSNA 0.2222 0.4357
classifying the Twitter- and Montreal-COVID-19 CXR collection, respectively, as belonging to the
Baseline + NIH + CheXpert + RSNA 0.1852 0.4413
viral pneumonia category. The performance degradation with other combinations of weakly labeled
Bold numerical values denote superior performance.
data augmentation underscores the fact that (i) adding more data introduces noise into the training
process and (ii) increasing the number of training samples does not always improve performance.
4. Discussion
4. Discussion
In this section, we present the results from our analyses following our suspicion that even
In this section, we present the results from our analyses following our suspicion that even
though COVID-19 pneumonia is caused by a virus (SARS-CoV-2), its manifestations in CXRs are
though COVID-19 pneumonia is caused by a virus (SARS-CoV-2), its manifestations in CXRs are
distinct from other viral pneumonia patterns. To test our hypothesis, we introduced the Twitter- and
distinct from other viral pneumonia patterns. To test our hypothesis, we introduced the Twitter- and
Montreal-COVID-19 CXR collections, separately, to the best-performing weakly labeled augmented
Montreal-COVID-19 CXR collections, separately, to the best-performing weakly labeled augmented
training data, i.e., Baseline + CheXpert. This is illustrated in Figure 6, and the results are shown in
training data, i.e., Baseline + CheXpert. This is illustrated in Figure 6, and the results are shown in
Table 10, below.
Table 10, below.

6. Evaluating the performance against a collection of COVID-19 CXRs when augmenting the
Figure 6.
best-performing
best-performingweakly
weaklylabeled
labeledaugmented training
augmented datadata
training withwith
a different collection
a different of COVID-19
collection CXRs.
of COVID-19
CXRs.
Diagnostics 2020, 10, 358 13 of 17

Table 10. Performance metrics achieved using augmenting the best-performing weakly labeled
augmented training data with one of the COVID-19 CXR collections toward classifying another
COVID-19 CXR collection as belonging to the viral pneumonia category.

Accuracy
Dataset
Twitter-COVID-19 Montreal-COVID-19
Baseline 0.2885 0.5028
Baseline + CheXpert 0.5555 0.6536
Baseline + CheXpert + Twitter - 0.7095
Baseline + CheXpert + Montreal 0.8889 -
Bold numerical values denote superior performance.

The results in Table 10 support our hypothesis that augmenting the best-performing weakly
labeled augmented training data with class-specific data, since they are sufficiently distinct, is necessary
to obtain improvement. We are intrigued by the disparity in improvement, however. Recall that the
Twitter-COVID-19 collection was posted from a hospital in Spain. In contrast, the Montreal-COVID-19
collection is sourced broadly and does not typify the pneumonia opacity from a select population.
Thus, the variety introduced by augmenting with the Montreal-COVID-19 data results in a much
greater boost in performance as compared with Twitter-COVID-19.
Next, to test the degree to which COVID-19 is distinct from routine viral pneumonia manifestations,
we augmented the baseline directly with the individual COVID-19 images.
It is observed from Table 11 that augmenting the baseline training data with the Twitter-COVID-19
CXR collection significantly improved the performance in detecting COVID-19 CXRs in the Montreal
collection as belonging to the viral pneumonia category. We observed similar improvements in
performance with the Twitter-COVID-19 CXRs when the baseline training data is augmented with
the Montreal-COVID-19 CXR collection. This suggests that weakly labeled augmentation might be
hurting rather than helping the detection of COVID-19. While this may seem counter to our original
hypothesis, recall that weakly labeled augmentation is very valuable when there are insufficient data
for a subclass. This is supported by the results shown in Table 8 above. In the case of COVID-19,
note that the collections are very small and need some additional training images. Therefore, these
augmented training images must be selected wisely.

Table 11. Performance metrics achieved using augmenting training data directly with one of the
COVID-19 CXR collections toward classifying another COVID-19 CXR collection as belonging to the
viral pneumonia category.

Accuracy
Dataset
Twitter-COVID-19 Montreal-COVID-19
Baseline 0.2885 0.5028
Baseline + Twitter-COVID-19 - 0.9778
Baseline + Montreal-COVID-19 0.9926 -
Bold numerical values denote superior performance.

Confusion matrices for the results in Table 11 above are shown in Figure 7, while Figure 8 shows
the learned behavior of the trained model. We observe that the learned interpretation is correctly
focusing on the salient ROI, matching with the GT annotations that help to categorize COVID-19 CXRs
as showing viral pneumonia. This is a significant improvement over the non-augmented training
results shown in Figure 5.
Diagnostics 2020, 10, 358 14 of 17
Diagnostics 2020, 10, x FOR PEER REVIEW 14 of 17

b
Figure 7.
7. Confusion
Confusionmatrix
matrixafter
afterclassifying bacterial
classifying and
bacterial viral
and pneumonia
viral pneumonia in the (a) Twitter-
in the andand
(a) Twitter- (b)
Montreal-COVID-19 CXR
(b) Montreal-COVID-19 CXRcollections after
collections augmenting
after augmenting the
thebaseline
baselinetraining
trainingdata
datawith
with individual
COVID-19 CXR collections. Enlarged
Enlarged text
text labels
labels have been manually superimposed for clarity.
Diagnostics 2020, 10, 358 15 of 17
Diagnostics 2020, 10, x FOR PEER REVIEW 15 of 17

8. Original
Figure 8. OriginalCXRs, heat
CXRs, maps,
heat andand
maps, salient ROI visualization:
salient (a–c) show
ROI visualization: (a), a(b),
sample
and Montreal-COVID-19
(c) show a sample
CXR with GT annotations,
Montreal-COVID-19 CXRthewith
corresponding heat map,the
GT annotations, and corresponding
Grad-CAM visualization;
heat map, (d–f) show
and a sample
Grad-CAM
Twitter-COVID-19
visualization; (d), (e), and (f) show a sample Twitter-COVID-19 CXR with GT annotations, themaps.
CXR with GT annotations, the heat map, and its associated class activation heat
Blue
map,frames
and itsin (a,d) denoteclass
associated radiologist annotations
activation maps. indicating
Blue frames disease regions,
in (a) and (d)which serveradiologist
denote as ground
truth in our analysis.
annotations indicating disease regions, which serve as ground truth in our analysis.

5. Conclusions and Future Work

5. Conclusions and Future Work
Weakly labeled data augmentation helped to improve performance with the baseline test data
Weakly labeled data augmentation helped to improve performance with the baseline test data
because the CXRs with pneumonia-related opacities in the CheXpert collection have a similar
because the CXRs with pneumonia-related opacities in the CheXpert collection have a similar
distribution to bacterial and non-COVID-19 viral pneumonia. This similarity helped to expand
distribution to bacterial and non-COVID-19 viral pneumonia. This similarity helped to expand the
the training feature space by introducing a controlled class-specific feature variance that improves
training feature space by introducing a controlled class-specific feature variance that improves
performance with the baseline test data. However, with COVID-19 CXRs, weakly labeled data
performance with the baseline test data. However, with COVID-19 CXRs, weakly labeled data
augmentation did not deliver superior performance on its own, primarily due to the small data set
augmentation did not deliver superior performance on its own, primarily due to the small data set
size—which is the base reason for weakly labeled augmentation with data from other collections—and
size—which is the base reason for weakly labeled augmentation with data from other collections—
distinct opacity patterns compared with other viral and bacterial pneumonia. In clinical use, it could
and distinct opacity patterns compared with other viral and bacterial pneumonia. In clinical use, it
quickly help to separate patients with COVID-19 opacities (true positives) and refer the rest for
could quickly help to separate patients with COVID-19 opacities (true positives) and refer the rest for
further clinical assessment. As future work, we aim to expand the analysis to multi-class problems.
further clinical assessment. As future work, we aim to expand the analysis to multi-class problems.
Constructing model ensembles to combine the predictions of models trained on various combinations
Constructing model ensembles to combine the predictions of models trained on various combinations
of augmented training data might further improve the COVID-19 detection performance.
of augmented training data might further improve the COVID-19 detection performance.
Author Contributions: Conceptualization, S.R.; Data curation, S.R.; Formal analysis, S.R.; Funding acquisition, S.A.;
Author Contributions:
Investigation, Conceptualization,
S.R. and S.A.; Methodology, S.R. S.R.;
andData
S.A.; curation, S.R.; Formal analysis,
Project administration, S.R.; Funding
S.A.; Resources, acquisition,
S.A.; Software, S.R.;
Supervision, S.R. and
S.A.; Investigation, S.A.;
S.R. Visualization,
and S.R.; Writing—Original
S.A.; Methodology, draft, S.R.;
S.R. and S.A.; Project Writing—Review
administration, S.A.; and editing,S.A.;
Resources, S.R.
and S.A. All
Software, authors
S.R.; have read
Supervision, and
S.R. agreed
and S.A.; to the publishedS.R.;
Visualization, version of the manuscript.
Writing—Original draft, S.R.; Writing—Review
and editing,
Funding: S.R.
This andwas
work S.A. All authors
supported have
by the read andResearch
Intramural agreed toProgram
the published version Library
of the National of the manuscript.
of Medicine (NLM),
and the U.S. National Institutes of Health (NIH).
Funding: This work was supported by the Intramural Research Program of the National Library of Medicine
(NLM), and the U.S. We
Acknowledgments: are grateful
National to Jenifer
Institutes Siegelman
of Health (NIH).of Takeda Pharmaceuticals for her radiological expertise
in annotating a sample of COVID-19 test data and discussions related to the radiology of COVID-19.
Acknowledgments: We are grateful to Jenifer Siegelman of Takeda Pharmaceuticals for her radiological
Conflicts of Interest: The authors declare no conflict of interest.
expertise in annotating a sample of COVID-19 test data and discussions related to the radiology of COVID-19.

Conflicts of Interest: The authors declare no conflict of interest.

References
Diagnostics 2020, 10, 358 16 of 17

References
1. World Health Organization (WHO). Coronavirus Disease (COVID-2019) Situation Reports. Available
online: https://www.who.int/emergencies/diseases/novel-coronavirus-2019/situation-reports (accessed on
2 March 2020).
2. Rubin, G.D.; Ryerson, C.J.; Haramati, L.B.; Sverzellati, N.; Kanne, J.P.; Raoof, S.; Schluger, N.W.; Volpi, A.;
Yim, J.-J.; Martin, I.B.K.; et al. The Role of Chest Imaging in Patient Management during the COVID-19
Pandemic: A Multinational Consensus Statement from the Fleischner Society. Radiology 2020, 201365.
[CrossRef] [PubMed]
3. Bai, H.X.; Hsieh, B.; Xiong, Z.; Halsey, K.; Choi, J.W.; Tran, T.M.L.; Pan, I.; Shi, L.-B.; Wang, D.-C.; Mei, J.; et al.
Performance of radiologists in differentiating COVID-19 from viral pneumonia on chest CT. Radiology 2020,
200823. [CrossRef] [PubMed]
4. Kermany, D.S.; Goldbaum, M.; Cai, W.; Valentim, C.C.; Liang, H.; Baxter, S.L.; McKeown, A.; Yang, G.; Wu, X.;
Yan, F.; et al. Identifying Medical Diagnoses and Treatable Diseases by Image-Based Deep Learning. Cell
2018, 172, 1122–1131.e9. [CrossRef] [PubMed]
5. ACR Recommendations for the use of Chest Radiography and Computed Tomography (CT) for Suspected
COVID-19 Infection. Available online: https://www.acr.org/Advocacy-and-Economics/ACR-Position-
Statements/Recommendations-for-Chest-Radiography-and-CT-for-Suspected-COVID19-Infection
(accessed on 12 March 2020).
6. Li, L.; Qin, L.; Xu, Z.; Yin, Y.; Wang, X.; Kong, B.; Bai, J.; Lu, Y.; Fang, Z.; Song, Q.; et al. Artificial Intelligence
Distinguishes COVID-19 from Community Acquired Pneumonia on Chest CT. Radiology 2020, 200905.
[CrossRef] [PubMed]
7. RSNA Announces COVID-19 Imaging Data Repository. Available online: https://press.rsna.org/timssnet/
media/pressreleases/14_pr_target.cfm?ID=2167 (accessed on 1 April 2020).
8. A European Initiative for Automated Diagnosis and Quantitative Analysis of COVID-19 on Imaging.
Available online: https://imagingcovid19ai.eu/ (accessed on 2 April 2020).
9. Perez, L.; Wang, J. The Effectiveness of Data Augmentation in Image Classification using Deep Learning.
arXiv 2017, arXiv:1712.04621.
10. Ganesan, P.; Rajaraman, S.; Long, R.; Ghoraani, B.; Antani, S. Assessment of Data Augmentation Strategies
Toward Performance Improvement of Abnormality Classification in Chest Radiographs. In Proceedings of
the 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC);
Institute of Electrical and Electronics Engineers (IEEE): Piscataway, NJ, USA, 2019; Volume 2019, pp. 841–844.
11. Takahashi, R.; Matsubara, T.; Uehara, K. Data Augmentation using Random Image Cropping and Patching
for Deep CNNs. IEEE Trans. Circuits Syst. Video Technol. 2020, 1. [CrossRef]
12. Deng, J.; Dong, W.; Socher, R.; Li, L.; Li, K.; Li, F.-F. ImageNet: A large-scale hierarchical image database.
In Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA,
20–25 June 2009; pp. 248–255.
13. Ben-Cohen, A.; Klang, E.; Amitai, M.M.; Goldberger, J.; Greenspan, H. Anatomical data augmentation for
CNN based pixel-wise classification. In Proceedings of the 2018 IEEE 15th International Symposium on
Biomedical Imaging (ISBI 2018), Washington, DC, USA, 4–7 April 2018; pp. 1096–1099. [CrossRef]
14. Goodfellow, I. Nips 2016 tutorial: Generative adversarial networks. arXiv 2016, arXiv:1701.00160.
15. Shih, G.; Wu, C.C.; Halabi, S.S.; Kohli, M.D.; Prevedello, L.M.; Cook, T.S.; Sharma, A.; Amorosa, J.K.;
Arteaga, V.; Galperin-Aizenberg, M.; et al. Augmenting the National Institutes of Health Chest Radiograph
Dataset with Expert Annotations of Possible Pneumonia. Radiol. Artif. Intell. 2019, 1, e180041. [CrossRef]
16. Wang, X.; Peng, Y.; Lu, L.; Lu, Z.; Bagheri, M.; Summers, R.M. ChestX-Ray8: Hospital-Scale Chest X-Ray
Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax Diseases.
In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR); Institute of
Electrical and Electronics Engineers (IEEE): Piscataway, NJ, USA, 2017; pp. 3462–3471.
17. Irvin, J.; Rajpurkar, P.; Ko, M.; Yu, Y.; Ciurea-Ilcus, S.; Chute, C.; Marklund, H.; Haghgoo, B.; Ball, R.;
Shpanskaya, K.; et al. CheXpert: A Large Chest Radiograph Dataset with Uncertainty Labels and Expert
Comparison. In Proceedings of the Proceedings of the AAAI Conference on Artificial Intelligence; Association for
the Advancement of Artificial Intelligence (AAAI): Menlo Park, CA, USA, 2019; Volume 33, pp. 590–597.
18. Cohen, J.P.; Morrison, P.; Dao, L. COVID-19 Image Data Collection 2020. arXiv 2020, arXiv:2003.11597.
Diagnostics 2020, 10, 358 17 of 17

19. Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation.
Appl. Evol. Comput. 2015, 9351, 234–241.
20. Yu, F.; Koltun, V. Multi-Scale Context Aggregation by Dilated Convolutions. arXiv 2015, arXiv:1511.07122.
21. Candemir, S.; Antani, S.; Jaeger, S.; Browning, R.; Thoma, G.R. Lung boundary detection in pediatric chest
x-rays. In Proceedings of the Medical Imaging 2015: PACS and Imaging Informatics: Next Generation and
Innovations, Orlando, FL, USA, 21–26 February 2015; Volume 9418, p. 94180Q. [CrossRef]
22. Zagoruyko, S.; Komodakis, N.; Wilson, R.C.; Hancock, E.R.; Smith, W.A.P.; Pears, N.E.; Bors, A.G.
Wide Residual Networks. In Proceedings of the British Machine Vision Conference 2016, York, UK,
19–22 September 2016.
23. He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016
IEEE Conference on Computer Vision and Pattern Recognition (CVPR); Institute of Electrical and Electronics
Engineers (IEEE): Piscataway, NJ, USA, 2016; pp. 770–778.
24. Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition.
In Proceedings of the International Conference on Learning Representations, San Diego, CA, USA,
7–9 May 2015; pp. 1–14.
25. Szegedy, C.; Vanhoucke, V.; Ioffe, S.; Shlens, J.; Wojna, Z. Rethinking the Inception Architecture for Computer
Vision. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR),
Las Vegas Valley, NV, USA, 26 June–1 July 2016; pp. 2818–2826. [CrossRef]
26. Chollet, F. Xception: Deep Learning with Depthwise Separable Convolutions. In 2017 IEEE Conference on
Computer Vision and Pattern Recognition (CVPR); Institute of Electrical and Electronics Engineers (IEEE):
Piscataway, NJ, USA, 2017; pp. 1800–1807.
27. Huang, G.; Liu, Z.; Van Der Maaten, L.; Weinberger, K.Q. Densely Connected Convolutional Networks.
In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR); Institute of
Electrical and Electronics Engineers (IEEE): Piscataway, NJ, USA, 2017; pp. 2261–2269.
28. Pham, H.; Guan, M.Y.; Zoph, B.; Le, Q.V.; Dean, J. Efficient Neural Architecture Search via Parameter Sharing.
In Proceedings of the Proc. Int. Conf. Machine Learning (ICML), Stockholm, Sweden, 10–15 July 2018;
pp. 4092–4101.
29. Selvaraju, R.R.; Cogswell, M.; Das, A.; Vedantam, R.; Parikh, D.; Batra, D. Grad-CAM: Visual Explanations
from Deep Networks via Gradient-Based Localization. In Proceedings of the 2017 IEEE International Conference
on Computer Vision (ICCV); Institute of Electrical and Electronics Engineers (IEEE): Piscataway, NJ, USA, 2017;
pp. 618–626.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Machine Learning?
100% (2)
Machine Learning?
114 pages
Automatic Detection of COVID-19 Using X-Ray Images With Deep Convolutional Neural Networks and Machine Learning
No ratings yet
Automatic Detection of COVID-19 Using X-Ray Images With Deep Convolutional Neural Networks and Machine Learning
13 pages
Bashar Et Al. - 2021
No ratings yet
Bashar Et Al. - 2021
18 pages
CovidGAN Data Augmentation Using Auxiliary
No ratings yet
CovidGAN Data Augmentation Using Auxiliary
8 pages
Automated Diagnosis of COVID-19 With Limited Poste
No ratings yet
Automated Diagnosis of COVID-19 With Limited Poste
17 pages
Duong Et Al. - 2023
No ratings yet
Duong Et Al. - 2023
13 pages
COVID-19 Detection Using Deep Learning
No ratings yet
COVID-19 Detection Using Deep Learning
6 pages
Presentation 1
No ratings yet
Presentation 1
22 pages
Coronavirus and Pneumonia Detection From X-Ray Images Harnessing Deep Learning and Transfer Learning Techniques
No ratings yet
Coronavirus and Pneumonia Detection From X-Ray Images Harnessing Deep Learning and Transfer Learning Techniques
5 pages
Detecting COVID-19 Pneumonia Using Chest X-Rays Through Deep Learning Techniques
No ratings yet
Detecting COVID-19 Pneumonia Using Chest X-Rays Through Deep Learning Techniques
6 pages
v1 Stamped
No ratings yet
v1 Stamped
9 pages
Constantinou Et Al. - 2023
No ratings yet
Constantinou Et Al. - 2023
13 pages
46 - COVID-19 - Chest - X-Ray - Case - Detection - With - Ensemble
No ratings yet
46 - COVID-19 - Chest - X-Ray - Case - Detection - With - Ensemble
8 pages
#Pulmonary Diseases Decision Support System Using Deep Learning Approach
No ratings yet
#Pulmonary Diseases Decision Support System Using Deep Learning Approach
16 pages
COVIDX-Net: A Framework of Deep Learning Classifiers To Diagnose COVID-19 in X-Ray Images
No ratings yet
COVIDX-Net: A Framework of Deep Learning Classifiers To Diagnose COVID-19 in X-Ray Images
14 pages
Bassi, Attux - 2022
No ratings yet
Bassi, Attux - 2022
10 pages
1 s2.0 S2214785322034812 Main
No ratings yet
1 s2.0 S2214785322034812 Main
7 pages
Novel Deep Transfer Learning Model For COVID 19 Patient Detection Using X Ray Chest Images
No ratings yet
Novel Deep Transfer Learning Model For COVID 19 Patient Detection Using X Ray Chest Images
10 pages
19 - DetectingCOVID-19PneumoniausingChest-XraysthroughDeepLearningTechniques
No ratings yet
19 - DetectingCOVID-19PneumoniausingChest-XraysthroughDeepLearningTechniques
7 pages
Fi Pagenumber
No ratings yet
Fi Pagenumber
22 pages
COVID 19 Detection From Chest X Ray Images Using Transfer Learning
No ratings yet
COVID 19 Detection From Chest X Ray Images Using Transfer Learning
13 pages
(IJCST-V9I3P10:Akshat Rustagi, Rudrangshu Tarafder, J. Rene Beulah
No ratings yet
(IJCST-V9I3P10:Akshat Rustagi, Rudrangshu Tarafder, J. Rene Beulah
7 pages
ICMR - Reproducible AI in Medicine and Health
No ratings yet
ICMR - Reproducible AI in Medicine and Health
9 pages
621 Submission
No ratings yet
621 Submission
5 pages
Maldonado Et Al. - 2023
No ratings yet
Maldonado Et Al. - 2023
6 pages
Neurocomputing: Contents Lists Available at
No ratings yet
Neurocomputing: Contents Lists Available at
14 pages
Malik Et Al. - 2023
No ratings yet
Malik Et Al. - 2023
26 pages
Detection of COVID-19 From Chest X-Ray Images Using Convolutional Neural Networks
No ratings yet
Detection of COVID-19 From Chest X-Ray Images Using Convolutional Neural Networks
13 pages
Bhargavi IEEE 2021
No ratings yet
Bhargavi IEEE 2021
6 pages
COVID 19 Pneumonia and Other Disease Classification Using Chest X-Ray Images
No ratings yet
COVID 19 Pneumonia and Other Disease Classification Using Chest X-Ray Images
4 pages
Human Disease3
No ratings yet
Human Disease3
6 pages
1 s2.0 S0208521621001303 Main
No ratings yet
1 s2.0 S0208521621001303 Main
15 pages
Deep Learning To Distinguish COVID-19 From Other Lung Infections, Pleural Diseases, and Lung Tumors
No ratings yet
Deep Learning To Distinguish COVID-19 From Other Lung Infections, Pleural Diseases, and Lung Tumors
4 pages
Application of Machine Learning in The Automated Detection of Covid-19 From Chest X-Rays and Ct-Scans
No ratings yet
Application of Machine Learning in The Automated Detection of Covid-19 From Chest X-Rays and Ct-Scans
10 pages
Development of A Deep Learning Model To Classify X-Ray of Covid-19, Normal and Pneumonia-Affected Patients
No ratings yet
Development of A Deep Learning Model To Classify X-Ray of Covid-19, Normal and Pneumonia-Affected Patients
6 pages
Paper 1
No ratings yet
Paper 1
11 pages
Viral Pneumonia Screening On Chest X-Rays Using Confidence-Aware Anomaly Detection
No ratings yet
Viral Pneumonia Screening On Chest X-Rays Using Confidence-Aware Anomaly Detection
12 pages
Intership Report1
No ratings yet
Intership Report1
32 pages
Convolutional Sparse Support Estimator-Based COVID-19 Recognition From X-Ray Images
No ratings yet
Convolutional Sparse Support Estimator-Based COVID-19 Recognition From X-Ray Images
11 pages
Myresearch Paper
No ratings yet
Myresearch Paper
6 pages
Prediction of COVID-19 Cases Using CNN With X-Rays: DR D. Haritha N. Swaroop M. Mounika
No ratings yet
Prediction of COVID-19 Cases Using CNN With X-Rays: DR D. Haritha N. Swaroop M. Mounika
6 pages
A Deep Learning-Based COVID-19 Automatic Diagnostic Framework Using Chest X-Ray Images
No ratings yet
A Deep Learning-Based COVID-19 Automatic Diagnostic Framework Using Chest X-Ray Images
16 pages
AI in Pnuemonia Review
No ratings yet
AI in Pnuemonia Review
15 pages
Deep Neural Network Ensemble For Pneumonia Localization
No ratings yet
Deep Neural Network Ensemble For Pneumonia Localization
12 pages
Pneumonia Classification Using Deep Learning From Chest X-Ray Images During COVID-19
No ratings yet
Pneumonia Classification Using Deep Learning From Chest X-Ray Images During COVID-19
13 pages
Deep Learning-Aided Automated Pneumonia Detection
No ratings yet
Deep Learning-Aided Automated Pneumonia Detection
19 pages
Nawaz Et Al. - 2023
No ratings yet
Nawaz Et Al. - 2023
16 pages
Covid-19 Detection From Chest X-Ray Images Using Deep Learning
No ratings yet
Covid-19 Detection From Chest X-Ray Images Using Deep Learning
13 pages
Base Paper
No ratings yet
Base Paper
6 pages
Chaos, Solitons and Fractals: Suat Toraman, Talha Burak Alakus, Ibrahim Turkoglu
No ratings yet
Chaos, Solitons and Fractals: Suat Toraman, Talha Burak Alakus, Ibrahim Turkoglu
11 pages
Covid-19 Pneumonia Classification Using Deep Learning Technique
100% (1)
Covid-19 Pneumonia Classification Using Deep Learning Technique
20 pages
48 - Paper2
No ratings yet
48 - Paper2
6 pages
Pneumonia
No ratings yet
Pneumonia
7 pages
An Artificial Intelligence Based Technique For COVID-19 Diagnosis From Chest X-Ray
No ratings yet
An Artificial Intelligence Based Technique For COVID-19 Diagnosis From Chest X-Ray
5 pages
COVID Detection From Chest X-Rays With DeepLearning CheXNet
No ratings yet
COVID Detection From Chest X-Rays With DeepLearning CheXNet
5 pages
IJEET Template
No ratings yet
IJEET Template
5 pages
Synopsis
No ratings yet
Synopsis
10 pages
Deep Supervised Domain Adaptation For Pneumonia Diagnosis From Chest X-Ray Images
No ratings yet
Deep Supervised Domain Adaptation For Pneumonia Diagnosis From Chest X-Ray Images
11 pages
Journal Pre-Proofs
No ratings yet
Journal Pre-Proofs
40 pages
Ahmad Et Al. - 2023
No ratings yet
Ahmad Et Al. - 2023
19 pages
China's Property Glut: Economic Failures? COVID Whammy? Is Housing Shortage Better Than Housing Glut?
From Everand
China's Property Glut: Economic Failures? COVID Whammy? Is Housing Shortage Better Than Housing Glut?
Terry Nettle
No ratings yet
Wang 2021
No ratings yet
Wang 2021
11 pages
A CNN-Based Transfer Learning Method For Defect Classification in Semiconductor Manufacturing
No ratings yet
A CNN-Based Transfer Learning Method For Defect Classification in Semiconductor Manufacturing
5 pages
1 s2.0 S2351978920315808 Main
No ratings yet
1 s2.0 S2351978920315808 Main
6 pages
Recent Advances of Artificial Intelligence in Manufacturing Industrial Sectors: A Review
No ratings yet
Recent Advances of Artificial Intelligence in Manufacturing Industrial Sectors: A Review
19 pages
Deep Learning Research Paper
No ratings yet
Deep Learning Research Paper
4 pages
A High Probability Analysis of Adaptive SGD With Momentum
No ratings yet
A High Probability Analysis of Adaptive SGD With Momentum
13 pages
MLT Kai601 2022-23 External
No ratings yet
MLT Kai601 2022-23 External
36 pages
Automatic Recognition of Guava Leaf Diseases Using Deep Convolution Neural Network
No ratings yet
Automatic Recognition of Guava Leaf Diseases Using Deep Convolution Neural Network
5 pages
Computers and Electronics in Agriculture: Suharjito, Gregorius Natanael Elwirehardja, Jonathan Sebastian Prayoga
No ratings yet
Computers and Electronics in Agriculture: Suharjito, Gregorius Natanael Elwirehardja, Jonathan Sebastian Prayoga
13 pages
Lecture 7: Stochastic Gradient Descent
No ratings yet
Lecture 7: Stochastic Gradient Descent
4 pages
Ch2 - Fundamental of Deep Learning
No ratings yet
Ch2 - Fundamental of Deep Learning
33 pages
Deep Learning-Based Active Noise Control On Construction Sites
No ratings yet
Deep Learning-Based Active Noise Control On Construction Sites
15 pages
Variational Quantum Algorithms
No ratings yet
Variational Quantum Algorithms
33 pages
Predicting Market Performance Using Machine and Deep Learning Techniques
No ratings yet
Predicting Market Performance Using Machine and Deep Learning Techniques
8 pages
cs224n Practice Midterm 3 Sol
No ratings yet
cs224n Practice Midterm 3 Sol
14 pages
Gradient Descent
No ratings yet
Gradient Descent
58 pages
Notes For - Batch Normalization - Accelerating Deep Network Training by Reducing Internal Covariate Shift - Paper GitHub
No ratings yet
Notes For - Batch Normalization - Accelerating Deep Network Training by Reducing Internal Covariate Shift - Paper GitHub
3 pages
An Introduction To Deep Learning For The Physical Layer
No ratings yet
An Introduction To Deep Learning For The Physical Layer
13 pages
Deep Reinforcement Learning Algorithm With Experience Replay and Target Network
No ratings yet
Deep Reinforcement Learning Algorithm With Experience Replay and Target Network
10 pages
IATQ Set
No ratings yet
IATQ Set
3 pages
Training NNs
No ratings yet
Training NNs
34 pages
التعلم العميق
No ratings yet
التعلم العميق
192 pages
Near Field Localization Via AI-Aided Subspace Methods
No ratings yet
Near Field Localization Via AI-Aided Subspace Methods
13 pages
Papr Reduction Based On Tone Reservation
No ratings yet
Papr Reduction Based On Tone Reservation
4 pages
Optimization Techniques in Deep Learning
No ratings yet
Optimization Techniques in Deep Learning
14 pages
Ai 2024
No ratings yet
Ai 2024
5 pages
Deep Learning
No ratings yet
Deep Learning
35 pages
Unit 2
No ratings yet
Unit 2
31 pages
DataDriven ReservoirModeling NAGAO THESIS 2021
No ratings yet
DataDriven ReservoirModeling NAGAO THESIS 2021
119 pages
Deep Learning Module 1
No ratings yet
Deep Learning Module 1
46 pages
Foundations of Machine Learning - 3
No ratings yet
Foundations of Machine Learning - 3
38 pages
Methodological Introduction Texts in Computer Science 3rd Edition 42309098
No ratings yet
Methodological Introduction Texts in Computer Science 3rd Edition 42309098
81 pages
DL Activation Functions Question Bank
No ratings yet
DL Activation Functions Question Bank
27 pages

Diagnostics 10 00358

Uploaded by

Diagnostics 10 00358

Uploaded by

diagnostics

Diagnostics 2020, 10, 358; doi:10.3390/diagnostics10060358 www.mdpi.com/journal/diagnostics

confirmed cases withcases

2. Materials and Methods

2.1. Data and Workflow

Dataset Bacterial (Proven) Pneumonia Viral (Proven) Pneumonia

Table 2. Characteristics of datasets used for weak-label classification.

Dataset Pneumonia of Unknown Type

Table 3. Distribution of COVID-19 CXR data.

Dataset COVID-19 Viral Pneumonia

2.2. Lung ROI Segmentation and Preprocessing

2.4. Weakly Labeled Data Augmentation

2.5. Salient ROI Localization

Models Acc. AUC Sens. Spec. Prec. F MCC

Table 7. Number of samples in weakly labeled augmented training data.

Dataset Acc. AUC Sens. Spec. Prec. F MCC

Diagnostics 2020, 10, 358

5. Conclusions and Future Work

Conflicts of Interest: The authors declare no conflict of interest.

You might also like