0% found this document useful (0 votes)

12 views

Automated Chest Screening Based On A Hybrid Model of Transfer Learning and Convolutional Sparse Denoising Autoencoder

This document describes a deep learning model for automated chest screening to classify chest x-rays (CXRs) as normal, abnormal, or uncertain. The model uses transfer learning with a convolutional sparse denoising autoencoder to extract features from CXR images. It was trained and validated on four public datasets containing CXR images and reports. The results showed the model could accurately classify CXRs as normal or abnormal with 98.7% and 94.3% precision, respectively. This automated system has the potential to reduce radiologists' workload by prioritizing higher-risk exams.

Uploaded by

Alejandro De Jesus Romo Rosales

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

Automated Chest Screening Based On A Hybrid Model of Transfer Learning and Convolutional Sparse Denoising Autoencoder

Uploaded by

Alejandro De Jesus Romo Rosales

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Wang

et al. BioMed Eng OnLine (2018) 17:63

https://doi.org/10.1186/s12938-018-0496-2 BioMedical Engineering
OnLine

RESEARCH Open Access

Automated chest screening based

on a hybrid model of transfer learning
and convolutional sparse denoising
autoencoder
Changmiao Wang1,2, Ahmed Elazab3,4, Fucang Jia1, Jianhuang Wu1 and Qingmao Hu1,5*

*Correspondence:
[email protected] Abstract
1
Shenzhen Institutes Objective: In this paper, we aim to investigate the effect of computer-aided triage
of Advanced Technology,
Chinese Academy system, which is implemented for the health checkup of lung lesions involving tens
of Sciences, 1068 Xueyuan of thousands of chest X-rays (CXRs) that are required for diagnosis. Therefore, high
Boulevard, Shenzhen 518055, accuracy of diagnosis by an automated system can reduce the radiologist’s workload
China
Full list of author information on scrutinizing the medical images.
is available at the end of the Method: We present a deep learning model in order to efficiently detect abnormal
article
levels or identify normal levels during mass chest screening so as to obtain the proba-
bility confidence of the CXRs. Moreover, a convolutional sparse denoising autoencoder
is designed to compute the reconstruction error. We employ four publicly available
radiology datasets pertaining to CXRs, analyze their reports, and utilize their images for
mining the correct disease level of the CXRs that are to be submitted to a computer
aided triaging system. Based on our approach, we vote for the final decision from
multi-classifiers to determine which three levels of the images (i.e. normal, abnormal,
and uncertain cases) that the CXRs fall into.
Results: We only deal with the grade diagnosis for physical examination and propose
multiple new metric indices. Combining predictors for classification by using the area
under a receiver operating characteristic curve, we observe that the final decision
is related to the threshold from reconstruction error and the probability value. Our
method achieves promising results in terms of precision of 98.7 and 94.3% based on
the normal and abnormal cases, respectively.
Conclusion: The results achieved by the proposed framework show superiority in
classifying the disease level with high accuracy. This can potentially save the radiolo-
gists time and effort, so as to allow them to focus on higher-level risk CXRs.
Keywords: Chest screening, Computer aided diagnosis, Deep learning, Autoencoder,
Receiver operating characteristic

© The Author(s) 2018. This article is distributed under the terms of the Creative Commons Attribution 4.0 International License
(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium,
provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license,
and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publi
cdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
Wang et al. BioMed Eng OnLine (2018) 17:63 Page 2 of 19

Background
Chest screening is a basic procedure in radiology for lung disease prediction and
diagnosis. However, such diagnosis is very time consuming and subjective. The turna-
round time is of great importance in radiology as it is an important criterion to evalu-
ate radiologists rather than the quality of their reports [1]. Especially in rural areas,
direct care providers rely mainly on teleradiology for their chest X-rays (CXRs) inter-
pretation. The emphasis on turnaround time can result in sub-standard reports, con-
fusion, misdiagnosis, and gaps in communication with primary care physicians. All
of these can severely and negatively impact patients’ care and may have life-changing
consequences for patients. Our work is inspired by the recent progresses in image
classification and segmentation. The former has substantially improved performance,
largely due to the introduction of ImageNet database [2] and the advances in deep
convolutional neural networks (CNNs) that effectively helps to recognize the images
with a large pool of hierarchical representations. The CNNs can be used successfully
in medical image classification and segmentation [3–6]. Many other techniques on
lung disease detection and classification have been proposed [7–19].
Sufficient performance, no increase in reading time, seamless workflow integration,
regulatory approval, and cost efficiency are the key points to the radiologists [20].
Tataru et al. applied three different neural network models for abnormality detection
in CXR images that are not public released [21]. Our work will focus on classifying
the CXRs as normal or abnormal in order to assist radiologists to move more quickly
and efficiently. We then use the features extracted from deep neural network model
for the classification of abnormalities in the CXRs. That is to say, we will classify the
CXRs at three status levels: obvious abnormal, obvious normal, and uncertainty. Fur-
thermore, a large collection of medical images can be automatically classified to three
main levels, and the uncertainty status level images can be checked carefully by radi-
ologist. A computer-aided triage system can mitigate these issues in several ways.
Firstly, it will allow radiologists to focus their attention immediately on higher-risk
cases. Secondly, it will provide radiologists with more information to help them cor-
rect potential misdiagnoses. The CXRs input to our algorithm will be a digital image
format along with a label stating ‘normal’ or ‘abnormal’.
In recent years, many methods were proposed for CXRs classification. There are
many conventional machine learning methods to study the classification of X-ray
chest radiographs [22, 23], such as based on texture and deformation features [24]
or ensemble methods [25]. In addition, with the development of deep learning tech-
nology, many deep learning method are also applied to the field of medical image
analysis. Yao et al. [26] explored the correlation among the 14 pathologic labels based
on global images in Chest X-ray 14 [27] in order to classify CXRs, and rendering it
as a multi label recognition problem. Using a variant of DenseNet [28] as an image
encoder, they adopted the long-short term memory networks (LSTM) [29] so as to
capture dependencies. Kumar et al. [30] investigated that which loss function is more
suitable for training CNNs from scratch and presented a boosted cascaded CNN for
global image classification. CheXNet [31] is the recent effective method for fine-tun-
ing a 121-layer DenseNet on the global chest X-ray images, which has a modified last
fully-connected layer.
Wang et al. BioMed Eng OnLine (2018) 17:63 Page 3 of 19

Table 1 Number of positive cases and negative cases used in training and validation sets
Training set Validation set
Positive Negative Positive Negative

418 1318 179 565

In this work, we demonstrate how to automatically classify CXRs at different status

levels. Four publicly available radiology datasets that contain CXR images and reports
namely are exploited as follows: the OpenI [17] open source literature, JSRT [32], Shen-
zhen Chest X-ray set [33], and NLM-Montgomery County Chest X-ray set [33]. A com-
mon challenge in medical image analysis is the data bias. When considering the whole
population, diseased cases are much less than healthy cases, which is also the case in the
CXRs datasets used. In our study, normal cases account for 75.9% (1883 images) of the
entire dataset (2480 images), and the abnormal cases account for 24.1% (597 images) of
the entire dataset.
Inspired by the ideas introduced in [34, 35], we employ the already trained models
to obtain the high level features and use them to infer the probability of image level.
Then, we train an autoencoder network with the obtained joint image and generate new
image based on decoder network to obtain the reconstruction error. With the combined
metrics including the probability and reconstruction error based on ROC curve, we can
achieve a score for classifying the image level. Finally, we vote for the decision of the
image level using the ensemble method. We can ultimately generate the better and more
accurate image level. It will greatly reduce the workload of medical experts and assist
them to pay more attention to the suspected higher risk chest X-rays.
The outline of this paper is organized as follows. In “Methods”, we present the pre-
processing of the steps for the acquired CXR images, and features extraction is pre-
sented by transfer learning using a classical deep model. Moreover, the convolutional
sparse denoising autoencoder network to reconstruct the image is proposed and image
classification with CNNs, which are fine-tuned with extracted features. We report the
experimental results and discussions in “Results” and “Discussion”, respectively. Finally,
conclusions and the future work are summarized in “Conclusion”.

Methods
This section discusses the datasets, preprocessing, lung segmentation, feature extrac-
tion, and classification steps. The convolutional sparse denoising autoencoder (CSDAE)
is proposed to get the reconstruction error for classifying CXRs. Moreover, image clas-
sification with CNNs can allow us to obtain the probability in order to classify CXRs.
Finally, the CXRs are classified by combining predictors based on the ROC curves.

Datasets
To ensure the robustness of our method, we test the proposed technique on a total of
2480 images including 1883 negative cases and 597 positive cases from four public data-
sets, namely JSRT, OpenI, SZCX, and MC. The data is randomly split into 70% for train-
ing set and 30% validation set. The number of positive cases and negative cases in the
training set and validation set are summarized in Table 1.
Wang et al. BioMed Eng OnLine (2018) 17:63 Page 4 of 19

Next, we give detailed description of every datasets as follows.

Japanese Society of Radiological Technology (JSRT)

A standard database that is acquired from the JSRT. The detailed information of JSRT
can be assessed in [32]. This dataset contains 154 cases with confirmed lung nodules
by CT and 93 normal cases without nodules. The images have been digitized to 12-bit
based on 2048 × 2048 pixels and at a resolution of 0.175 mm per pixel. The nodule size
ranges from 5 to 40 mm in diameter.

OpenI
Generally, we examine the posteroanterior views through the chest of the subject from
back to front. Hence, we only select the frontal images (numbering 3812) in OpenI
dataset that has totally 7470 images including 3658 as lateral images. Only 1433 images
including 49 cases with nodule and tuberculosis, and 1384 normal cases are selected
from the frontal images. The images have following different resolutions (in pixels):
512 × 624, 512 × 420, and 512 × 512.

China set—The Shenzhen set—chest X‑ray database (SZCX)

The standard digital image database for Tuberculosis is created by the National Library
of Medicine, Maryland, USA, in collaboration with Shenzhen No. 3 People’s Hospital,
Guangdong Medical College, Shenzhen, China [33]. The Chest X-rays are from out-
patient clinics and had been captured as part of the daily routine using Philips DR Digi-
tal Diagnose systems. There are 336 cases with manifestation of tuberculosis and 326
normal cases. The format of this dataset is PNG where the image size varies for each
X-ray with approximately 3000 × 3000 pixels. This set includes some pediatric images.
As a result, the image sizes in the dataset vary: image width: minimum is 1130 pixels
and maximum is 3001 pixels; for the image height, the minimum is 948 pixels and the
maximum is 3001 pixels. The resolutions range from 1024 × 1024 to around 2480 × 2480
pixels. The dataset is publicly available in: http://archive.nlm.nih.gov/repos/chestImage
s.php.

Montgomery‑County chest X‑ray database (MC)

The standard digital image database for Tuberculosis is created by the National Library
of Medicine in collaboration with the Department of Health and Human Services, Mont-
gomery County, Maryland, USA [33]. This image set contains data from X-rays collected
under Montgomery County’s Tuberculosis screening program. There are 58 cases with
manifestation of tuberculosis and 80 normal cases. Its format is PNG and the matrix
size is 4020 × 4892 or 4892 × 4020 pixels. The pixel spacing in vertical and horizontal
directions is 0.0875 mm while the number of gray levels is 12 bits. The dataset is publicly
available in: http://archive.nlm.nih.gov/repos/chestImages.php.

Preprocessing
Preprocessing is a preliminary stage in designing computer aided diagnosis (CAD) sys-
tems. Its main goal is to enhance the characteristics of the image that can help to improve
performance in subsequent stages. There are many sources of variance in the CXRs data,
Wang et al. BioMed Eng OnLine (2018) 17:63 Page 5 of 19

which may negatively affect the performance of downstream classification tasks using
feature-based methods or neural networks. We first process all images based on histo-
gram equalization in order to increase contrast within each CXRs image. After that, we
label images that have disease as positive cases, and all other normal image as negative
cases. We randomly split the entire dataset into 70% for training and the remaining 30%
for testing. Before inputting the image into the network for extraction of the high-level
features, we crop the region from the original image, and then down-sample every image
to a 512 × 512 matrix of pixels.
Our chest screening system consists of several modules in the processing pipeline
(Fig. 1). The first module aims to perform preprocessing on the chest radiographs. Due
to the high effect of the outside of lung, we segment the region of lung using U-net [36]
and achieve the promising dice and intersection-over-union (IoU) in the second module.
High level features are extracted using transfer learning from pre-trained deep network
model in the third module, which are used for training the classifier so as to give the
confidence of the status level pertaining to the image. The fourth module is to recon-
struct the image with the CSDAE. Finally, the two metric indices are used to determine
the final decision on the level of the image.
The development of such systems has so far been hindered by immense variance of
CXRs images and lack of labeled data. The normal versus abnormal ratio is 3.15:1. To
reduce the impact of the data imbalance, we augment the data by using SMOTE [37],
SMOTE-TK [38], and SMOT-ENN [38] in the feature space. Finally, we normalize the
images based on their mean and standard deviation in the training feature set. In addi-
tion, we implement the data augmentation by a rotation range from 0 to 10, with a width
shift and a height shift of both 0.2 for each training image.

Lung field segmentation

The key step in CAD scheme for lung disease detection in chest radiographs is the seg-
mentation of the lung field whereby many approaches have been proposed. All images in
the utilized datasets contain regions outside lungs that are not relevant to lesion detec-
tion. To diminish the risk that the features present in the images, but are irrelevant to
lesion detection tends to distort the final results, we decide to segment the lung regions

Fig. 1 Workflow diagram of our proposed method

Wang et al. BioMed Eng OnLine (2018) 17:63 Page 6 of 19

of the CXRs. The most classic method is a graph cut based segmentation method that
is mentioned in [39]. It begins with content based image retrieval using a training set
along with its defined masks. The initial specific anatomical model is created using
SIFT-flow for deformable registration of training masks for the patient CXR. Finally, a
graph cuts optimization procedure with a custom energy function is used. These meth-
ods can be broadly classified into four categories: (1) rule-based techniques, (2) pixel
classification technique, (3) deformable model-based techniques, and 4) hybrid tech-
niques [39]. Although these methods yield accurate segmentation based on pixel clas-
sification, they can result in non-desirable shapes, such as in [40, 41]. Recently, there
has been an increasing interest towards the exploration of deep learning methods. There
are many works on image segmentation employing these method such as references [36,
42–44]. The encoder-decoder structure of U-net can capture the delicate boundaries of
objects by exploiting the high resolution feature maps in the decoder path. Hence, we
can employ the U-net to account for the high variability of lung shapes with the JSRT
and MC. Figure 2 shows examples of the final boundaries of the lung areas with high IoU
0.963 and Dice 0.978. After the segmentation of the lung region, the resulting image is
cropped to the size of a minimum bounding box containing all the pixels of the lungs.

Fig. 2 Segmentation of lung from public database based on U-net

Wang et al. BioMed Eng OnLine (2018) 17:63 Page 7 of 19

Feature extraction
Feature extraction is another critical component in the CAD systems, which has a great
influence on their performance. Recently, deep neural network has gained popular-
ity because of its ability to learn mid and high-level image features. The deep learning
method has been applied in many medical image analysis tasks [10, 45, 46] and remark-
able results have been achieved. Deep learning methods are more powerful when
applied to large training sets. Meanwhile, we can ideally train a CNNs by a large number
of medical datasets and achieve promising performance. However, the large datasets in
the medical field are usually rare and very expensive to obtain. That is to say, we do not
have enough medical data, and deep learning methods are most effective when applied
to networks with large number of training data in order to train the deep neural net-
work. However, we can extract the high level features by using deep learning method
based on non-medical learning [47]. Recently, many papers have been published in the
general computer vision literature using transfer learning, which is an efficient way to
utilize image representations learned with CNNs on large-scale annotated datasets. In
particular, these are the target domains in which limited data exists. Transfer learning
from pre-trained ImageNet can be useful in medical images [7, 35].
In the computer vision domain, large image sets exist (e.g. ImageNet) which enable
better training of popular CNNs. In many image recognition tasks pertaining to the
large scale visual recognition challenge of ImageNet, a few examples of such CNNs are:
Decaf (2014), AlexNet (2012), VGG (2014), Overfeat (2013), Google Inception Net
(2015), and ResNet (2016) [48–54]. The CNNs were able to extract improved representa-
tions from raw data without requirement for domain knowledge. This was done with no
hyper-parameter tuning, which suggests that there are further improvements that can
be made. This is important for the task generally as it can mean that there is potential in
using CNNs or other deep learning strategies as a “black box”, whereby we will be able to
achieve excellent machine learning performance neglecting the need of expert-designed
feature extraction or domain knowledge.
The VGGNet in [50] was trained over a subset of images from ImageNet containing
1000 categories and 1.2 million images. This network is characterized by its simplicity,
as it involves using only 3 × 3 convolutional layers stacked on top of each other with
increasing depth. Reducing volume size is handled by max pooling. Two fully-connected
layers, each with 4096 nodes is then followed by a softmax classifier. For the fully-con-
nected layers, each has 4096 neurons. We extracted the feature from the fully-connected
layer on the pre-trained VGGNet model by the transfer learning to train our classifier.
The dataset can be benefited from more complex GoogLeNet [51], which is arguably the
current state-of-the-art CNN architectures. However, there are still other classical deep
neural networks such as InceptionNet [48] and residual networks [49].
In this paper, we extract the feature using transfer learning from VGGNet16. After
that, training the traditional classifier is done using a 10-fold cross validation. Besides,
we also design the neural network to fine-tune it from the extracted features.

CSDAE
An autoencoder (AE) neural network is an unsupervised learning algorithm that
applies backpropagation, and setting the target values to be equal to the inputs. In
Wang et al. BioMed Eng OnLine (2018) 17:63 Page 8 of 19

other words, it is trying to learn an approximation to the identity function so that the
output can be similar to the input. The AE mostly aims at reducing feature space in
order to distill the essential aspects of the data versus more conventional deep learn-
ing, which expands the feature space significantly in order to capture non-linearity
and subtle interactions within the data. Autoencoder can also be seen as a non-linear
alternative to principal component analysis. This trivial function seems not to be very
exciting at all; however, if we consider some constraints on the AE, one can discover
suitable features for a learning problem in an automatic way. The goal of the AE is
to learn a latent or compressed representation of the input data by minimizing the
reconstruction error between the input at the encoding layer and its reconstruction at
the decoding layer.
The AE comprises two parts: the encoder and decoder. The encoder reduces the
dimensions of input data so that the original image is compressed. The decoder
restores the original image from the compressed data. The autoencoder is a neu-
ral network that learns to encode and decode automatically, which can be shown in
Fig. 3.
Beyond simply learning features by AE, there is a need for reinforcing the sparsity of
weights and increasing its robustness to noise. Ng et al. introduced the sparse autoen-
coder (SAE) [55], which is a variant of the AE. Sparsity is a useful constraint when
the number of hidden units is large. SAE has very few neurons that are active. Sparse
feature learning is a common method for compressed feature extraction in shallow
encoder-decoder-based networks, i.e. in sparse coding [56–59], in AE [60], and in
Restricted Boltzmann Machines (RBM) [61, 62]. There is another variant of AE called
the denoising autoencoder (DAE) [63], which minimizes the error in reconstructing
the input from a stochastically corrupted transformation of the input. The stochas-
tic corruption process involves randomly setting some inputs to zero. The purpose
of this denoising autoencoder is to take a noisy image as input and return a clean
image as output. In our research, we consider CSDAE to train a convolutional sparse
AE, which can reconstruct the input data from a corrupted version by manual addi-
tion with random noise (Table 2). Based on our case, Gaussian noise is added to the
original image. This approach can effectively integrate the advantages in SAE, DAE,
and CNN. This hybrid structure forces our model to learn more abstract and noise-
resistant features, which will help to improve the model’s representation learning per-
formance. We reconstruct the original dataset using the reduced set of features and
compute the means squared error for both of them (Fig. 4).
The identity function seems particularly trivial to be learned. However, by placing con-
straints on the network via limiting the number of hidden units, we can discover inter-
esting structures pertaining to the data. In this paper, we employ convolutional sparse

Fig. 3 Flow chart of Autoencoder

Wang et al. BioMed Eng OnLine (2018) 17:63 Page 9 of 19

Table 2 Network architecture of CSDAE

Layer (type) Output shape Param#

Input_1(InputLayer) (None, 512, 512, 1) 0

Conv2d_1(Conv2D) (None, 512, 512, 16) 160
Max_pooling2d_1(MaxPooling2D) (None, 256, 256, 16) 0
Conv2d_2(Conv2D) (None, 256, 256, 8) 1160
Max_pooling2d_2(MaxPooling2D) (None, 128, 128, 8) 0
Conv2d_3(Conv2D) (None, 128, 128, 8) 584
Max_pooling2d_3(MaxPooling2D) (None, 64, 64, 8) 0
Conv2d_4(Conv2D) (None, 64, 64, 8) 584
Up_sampling2d_1(UpSampling2D) (None, 128, 128, 8) 0
Conv2d_5(Conv2D) (None, 128, 128, 8) 584
Up_sampling2d_2(UpSampling2D) (None, 256, 256, 8) 0
Conv2d_6(Conv2D) (None, 256, 256, 8) 1168
Up_sampling2d_3(UpSampling2D) (None, 512, 512, 16) 0
Conv2d_7(Conv2D) (None, 512, 512, 1) 145

Fig. 4 Loss over the epochs on the AE and CSDAE. a AE with noise factor 0.01, b CSDAE with noise factor
0.01, c AE with noise factor 0.05, and d CSDAE with noise factor 0.05

autoencoder in order to reconstruct the original image. The power of CSDAE lies in the
form of reconstruction-oriented training, where the hidden units can conserve the effi-
cient feature to represent the input data. Feature extractors are learned by minimizing
the reconstruction error of the cost function in [1]. The first term in the cost function is
the error term. The second term is a regularization term that is given by
Wang et al. BioMed Eng OnLine (2018) 17:63 Page 10 of 19

M
1 i
L(X, Y ) = x − yi + W 2 , (1)
2 2
i=1

where X and Y represent the training and reconstructed data while and W are the reg-
ularization parameter and weight, respectively.
In order to obtain a better representation, we consider a rectified linear unit (ReLU)
activation function, and the default hyperparameters settings are as follows: learning
rate = 0.0001, and batch size = 62. These hyperparameters are chosen as those previously
optimized on ImageNet. We set the L1 regularization parameter as equal to 0.00001 in
order to determine the sparseness value. The Gaussian noise factors are set to 0.01 and
0.05.
The key idea of CSDAE is to learn a sparse but robust bank of local features. After that,
we can compute the reconstruction error by performing an input image subtraction of
the reconstructed image from the CSDAE network.
Totally, a 2480 images from the four public databases are used for the experiments,
which are randomly split into 70% training (1736 images) and 30% testing (744 images).
Only 1318 images were used for training the CSDAE.

Image classification with CNN

Even when we balance the dataset by augmenting many diseased samples, it is still dif-
ficult for CNN to learn a good model to distinguish many abnormal cases from normal
cases, which have many variations on their original samples. Therefore, we attain the
features from VGGNet16 with pre-training on ImageNet to fine-tune our CNNs with
batch-normalization, data-dropout, which is assessed by the cross entropy and focal loss
[64]. Default hyperparameter settings are as follows: learning rate = 3e−5, regularization
is L1, batch size = 62, and drop out = 0.25 or 0.5. These parameters are chosen as opti-
mized on training data. Table 3 illustrates the network architecture. For the focal loss
function, we set lambda = 0.1, and gamma = 0.1. In addition, we train our network on
the original training data and other three data augmentation.

Table 3 Network architecture of proposed image classification

Layer (type) Output shape Param#

Input_4(InputLayer) (None, 512) 0

Batch_normalization_4(Batch) (None, 512) 2048
Dropout_4(Dropout) (None, 512) 0
Dense_4(Dense) (None, 512) 262,656
Batch_normalization_5(Batch) (None, 512) 2048
Dropout_5(Dropout) (None, 512) 0
Dense_5(Dense) (None, 512) 262,656
Batch_normalization_6(Batch) (None, 512) 2048
Dropout_6(Dropout) (None, 512) 0
Dense_6(Dense) (None, 1) 513
Wang et al. BioMed Eng OnLine (2018) 17:63 Page 11 of 19

Combining predictors for classification based on ROC curve

We can attain two metrics from the prediction of CNN and the reconstruction error from
the CSDAE. How to combine multiple predictors into a score is the key to the final deci-
sion. We assume that the predictors (C1 , C2 , . . . CP ) are given and the statistical problem is to
estimate β = (β2 , . . . , βP ) from data in Eq. (2). We seek estimators that are consistent under
the risk score model [3], since under that model fβ (C) is the optimal combination as follows:

fβ (C) = C1 + β2 C2 + · · · βP CP . (2)

Under a particular circumstance we can allow fβ (C) to be the “right” combination score
for classification based on C. If the risk score is some monotone increasing function of
fβ (C) , then we have

P[y = 1|C] = g(C1 + β2 C2 + · · · βP CP ) = g fβ (C) . (3)

It follows the Neyman-Pearson lemma [65, 66] that rules based on fβ (C) > c are opti-
mal. Assuming only the generalized linear model [3], the optimality of fβ (C) implies that
the ROC curve for any other function of C cannot be higher at any point than the ROC
curve for fβ (C) . The area under the ROC curve (AUC) is the most popular ROC sum-
mary index. The optimal ROC curve has maximum AUC so we can use it as the basis
for an objective function of the data to estimate β . It is easy to show that the AUC of the
empirical ROC curve is the Mann–Whitney U statistic as follows:
nD nD̄
I Lb (CDi ) > Lb (CD̄i )

AUC(b) = i=1 i=1
. (4)
nD nD̄

where nD is the number of positive cases and nD̄ is the number of negative cases. We pre-
sent the corresponding AUC based estimator of β as

βAUC = argmax AUC(b) . (5)

Results
All experiments are conducted on HP Z840 platform with the Tesla K40c and Quadro
K5200, CPU E5-2650 v3 2.30 GHz, memory 126G, and Ubuntu 16.04 operating system.
In total, 2480 images from the four public databases are used in our experiments,
which are randomly split into 70% training (1736 images) including 1318 negative cases
and 418 positive cases, and 30% testing (744 images) including 565 negative cases and
179 positive cases. The testing data is strictly independent to the training data, which
is not used to tune our algorithm. We employ four commonly used metrics to quanti-
tatively evaluate the performance of our method namely, precision, recall, F1, and AUC
scores.
Through experiments, we are able to show that these factors elevate the classification
accuracy of our CSDAE. They are all indispensable to our model as there are usually a
small drop in accuracy when removing these structures. We design a CSDAE network
by adding different noise factors of 0.01 and 0.05 for all data. Then, the CSDAE network
and the regular AE network are trained and tested on the datasets. The mean and vari-
ance metrics of the MSE are used for evaluation, respectively. The experimental results
Wang et al. BioMed Eng OnLine (2018) 17:63 Page 12 of 19

Table 4 Comparisons of the mean and std of MSE between AE and CSDAE

Noise factor Train data MSE (mean ± std) Test data MSE (mean ± std) Test noised data
MSE (mean ± std)

AE 0.01 0.000983 ± 0.000304 0.001088 ± 0.0004 0.001112 ± 0.000405

CSDAE 0.01 0.000829 ± 0.000282 0.000923 ± 0.000369 0.000922 ± 0.000368
AE 0.05 0.001039 ± 0.000334 0.001146 ± 0.000427 0.001255 ± 0.000445
CSDAE 0.05 0.001003 ± 0.000334 0.001103 ± 0.000421 0.001174 ± 0.00045

Fig. 5 Original lung image and the reconstructed lung image from AE: upper row is the original image while
the bottom row is the reconstruction image

Fig. 6 Original lung image and reconstructed lung image from CSDAE. a Results of CSDAE with noise
factor 0.01: the upper row is the original image, middle row is the image with noise, and bottom row is the
reconstructed image. b Results of CSDAE with noise factor 0.05: the upper row is the original image, middle
row is the image with noise, and bottom row is the reconstructed image

show that our CSDAE network is better than the conventional AE network under differ-
ent noise levels, which illustrate the reliability of our network design. The detailed com-
parisons are shown in Table 4. The training process of AE and CSDAE based on different
noise factors is presented in Fig. 5. The original lung image and the reconstructed lung
image from AE are shown in Fig. 6. Meanwhile, Fig. 7 shows the original lung image and
the reconstructed lung image from CSDAE.
Tables 5 and 6 provide the comparison of different classifiers based on four data-
sets: (1) without data augmentation, (2) smote augmentation (SMOTE), (3) only data
Wang et al. BioMed Eng OnLine (2018) 17:63 Page 13 of 19

Fig. 7 Precision for different threshold in CSDAE (a) with noise factor 0.01 and (b) noise factor 0.05

Table 5 Performance results based on test data using four classifiers: KNN, logistic
regression, SVM, random forest
Classifier Data augmentation Precision Recall F1 AUC

KNN Without augmentation 0.73 0.58 0.64 0.753

0.87 0.93 0.9
SMOTE 0.44 0.84 0.58 0.752
0.93 0.67 0.78
Positive augmentation 0.57 0.68 0.62 0.758
0.89 0.68 0.87
4× augmentation 0.57 0.84 0.62 0.758
0.89 0.89 0.87
Logistic Without augmentation 0.76 0.54 0.63 0.744
0.87 0.95 0.91
SMOTE 0.61 0.82 0.64 0.79
0.92 0.76 0.84
Positive augmentation 0.57 0.66 0.61 0.754
0.89 0.84 0.86
4× augmentation 0.63 0.72 0.67 0.792
0.91 0.87 0.89
SVM Without augmentation 0.63 0.73 0.68 0.798
0.91 0.86 0.89
SMOTE 0.63 0.77 0.69 0.813
0.92 0.86 0.89
Positive augmentation 0.59 0.64 0.61 0.75
0.88 0.86 0.87
4× augmentation 0.61 0.71 0.66 0.784
0.9 0.86 0.88
Random forest Without augmentation 0.69 0.38 0.49 0.663
0.83 0.95 0.88
SMOTE 0.62 0.5 0.56 0.704
0.85 0.9 0.88
Positive augmentation 0.54 0.63 0.58 0.73
0.88 0.83 0.85
4× augmentation 0.59 0.54 0.56 0.71
0.86 0.88 0.87
Wang et al. BioMed Eng OnLine (2018) 17:63 Page 14 of 19

Table 6 Comparison of accuracy, recall, F1 score, and AUC of the methods on test data
by the deep network based on four data augmentation methods
Loss function Data augmentation Precision Recall F1 AUC

Cross entropy Without augmentation 0.76 0.72 0.74 0.821

0.91 0.93 0.92
SMOTE 0.58 0.82 0.68 0.813
0.93 0.81 0.87
Positive augmentation 0.67 0.75 0.81 0.815
0.92 0.88 0.90
4× augmentation 0.69 0.77 0.72 0.827
0.92 0.89 0.91
Focal loss Without augmentation 0.76 0.73 0.74 0.828
0.91 0.93 0.92
SMOTE 0.52 0.82 0.64 0.79
0.93 0.76 0.84
Positive augmentation 0.74 0.72 0.73 0.817
0.91 0.92 0.91
4× augmentation 0.7 0.74 0.72 0.821
0.9 0.9 0.91

Table 7 Results of the performance of our network method

augmentation for positive data in training data (positive augmentation), and (4) data
augmentation 4 times for training data (4× augmentation). Table 6 shows that the
respective performance by CNN with different loss function and data augmentation
methods. It demonstrates that the focal loss is useful for the imbalanced dataset com-
pared to when using the cross-entropy loss in Table 6. From the result of Table 5, it
shows that CNN classifier has superior performance when compared to the tradi-
tional classifier.
Nonetheless, the network is able to attain high AUC 0.821 with an accuracy 0.7,
recall 0.74 and F1 score of 0.72 on the abnormal data, 0.9, recall 0.9, and F1 score of
0.91 on the normal data for testing data, respectively, with the following confusion
matrix given in Table 7.
We can notice the changes in precision and recall for different threshold values
on reconstruction error from the CSDAE with noise factor 0.01 and 0.05 as shown
in Figs. 7, 8. From Table 4, we see that the MSE of reconstruction error with mean
0.0009, std 0.00037 on the CSDAE with noise factor 0.01, as well as the MSE of recon-
struction error with mean 0.00117 and std 0.00045 with noise factor 0.05. Hence, our
CSDAE is a well- trained autoencoder for the normal case. We can screen the abnor-
mal case from the CSDAE if its reconstruction error is above specific threshold value.
Wang et al. BioMed Eng OnLine (2018) 17:63 Page 15 of 19

Fig. 8 Recalls for different thresholds in CSDAE (a) with noise factor 0.01 and (b) noise factor 0.05

Table 8 The results of the performance of our method with max AUC

Now, two metrics are given for the final decision, and we can combine these two
metrics to one score using the max area under the ROC curve (AUC).

βAUC = argmax
AUC(b) ,
0 < T < Tmean (6)
0.5 < P < 1

where T is the threshold, Tmean is the average construction error of CSDAE, and P is the
predicted probability from our CNNs. The confusion matrix of the performance of our
method with max AUC for testing data is given in Table 8.
Finally, we can compute the three levels pertaining to the image by voting on these
different classifiers. For the testing data, we can obtain 395 normal cases, 88 abnormal
cases, and 261 uncertainty case. Comparing with the ground truth, we can achieve
precision of 98.7% (390/395) on normal cases and 94.3% (83/88) on abnormal cases
for testing data. It is good to separate these images into three different levels with
a total precision of 97.9% based on the normal and abnormal status levels. Achiev-
ing 100% of accuracy in CXR abnormality classification, to the best of our knowl-
edge, does not exist as there is always possibility for false positive rate. We propose
a method that has high detection rate and low false positive. In order to reduce false
positives, this article proposes a variety of indicators and optimizes these indicators
based on maximizing AUC. The results of this paper show that, the accuracy rate in
our sample classified as normal is 98.7%, and the accuracy of classification as abnor-
mality is 94.3%. In this way, we can divide the entire data set into three categories.
This allows senior doctors to focus on high-risk suspected abnormal chest radio-
graphs, mid-level physicians focus on uncertain categories, and lower-level medical
Wang et al. BioMed Eng OnLine (2018) 17:63 Page 16 of 19

experts can pay more attention to the suspected normal samples. Eventually, these
experts can spend more time focusing on the high-risk chest radiographs. On the one
hand, this greatly reduces the workload of the expert team, and correcting potential
misdiagnosis. On the other hand, it can enable patients to receive timely medical
treatment as a result of the time-to-diagnosis that is saved.

Discussion
Many methods have been proposed to perform CXRs classification task, such as [21, 34].
Shin et al. [34, 35] used a CNN to detect specific diseases in CXRs images and achieved
precision of 0.776 and recall of 0.76 based on the normal case, and then precision 0.556,
recall 0.577 based on the nodule. Tataru et al. [21] attempted to classify a chest X-ray
as normal versus abnormal in order to assist primary care physicians and radiologists
to move more quickly and efficiently rather than render radiology obsolete with accu-
racy of 0.8 and F1 score of 0.66. However, our method yields an accuracy of 0.7, recall
of 0.74, and F1 score of 0.72 on the abnormal data, while it achieves an accuracy of 0.9,
recall of 0.9, and F1 score of 0.91 on the normal data. Yet, we cannot entirely compare
these methods because of the different classification tasks and the databases. We only
deal with the grade diagnosis for physical examination. The contribution of this paper is
based on the fact that it proposes multiple new metric indices. Combining predictors for
classification using area under the ROC curve is the proposed solution for this task. We
find that, the final decision is related to the threshold from reconstruction error and the
probability value.

Conclusion
In this paper, we present an effective framework that learns and detects diseases from
the patients’ CXRs based on the four public datasets. Furthermore, we introduce an
approach to classify image levels by summarizing the hyper classifiers outputs and
reconstruction error. Different metrics are used in this paper to classify the image lev-
els. We combine the multiple classifiers using AUC in order to guarantee high confi-
dence. Not only can this computer aided triaging system classify and detect disease in
images, but also it can even compute the different image levels with promising results.
Compared to existing methods, our method yields high accuracy, recall, and F1 score for
the abnormal and normal datasets. Note that, in this research, our preliminary results
cannot justify our proposed method to be fully adopted for an entirely automated chest
screen system in clinical practice. However, this technique can partially help in the clas-
sification of normal versus abnormal CXRs and provide the physicians and radiologists
with valuable information to significantly decrease time-to-diagnosis. To increase the
performance of our method, we plan to build a big dataset for training intermediary
deep network from clinical data in our future work. In addition, implementing an end-
to-end learning model can be promising and may achieve high performance via opti-
mally determined parameters.
Authors’ contributions
CW conceived and designed the experiments. CW and AE prepared the manuscript and analyzed the results. QH, FJ, JW,
and AE discussed the results and gave good suggestions. QH supervised all stages of the project and revised the manu-
script. All authors read and approved the final manuscript.
Wang et al. BioMed Eng OnLine (2018) 17:63 Page 17 of 19

Author details
1
Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, 1068 Xueyuan Boulevard, Shenz-
hen 518055, China. 2 University of Chinese Academy of Sciences, 52 Sanlihe Road, Beijing 100864, China. 3 Guangdong
Key Laboratory for Biomedical Measurements and Ultrasound Imaging, School of Biomedical Engineering, Health
Science Center, Shenzhen University, Shenzhen 518060, China. 4 Department of Computer Science, Misr Higher Institute
for Commerce and Computers, Mansoura 35516, Egypt. 5 Key Laboratory of Human-Machine Intelligence Synergy Sys-
tems, 1068 Xueyuan Boulevard, Shenzhen 518055, China.

Acknowledgements
None.

Competing interests
The authors declare that they have no competing interests.

Availability of data and materials

All datasets used in this study are publicly available as indicated in "Datasets" section. Algorithmic implementations
generated and analyzed during this study are available from the corresponding author(s) on reasonable request.

Consent for publication

All authors consent to the publication of this work.

Ethics approval and consent to participate

Not applicable.

Funding
This work has been supported by: Joint Key project of NSFC-Shenzhen Robot Foundation Research Center (No.
U1713215), Shenzhen Key Basic Research Grant (No. JCYJ20160331191401141), and National Natural Science Foundation
of China (No. 61671440).

Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Received: 4 April 2018 Accepted: 9 May 2018

References
1. Jackson WL. In radiology, turnaround time is king. Practice management. 2015.
2. Deng J, Dong W, Socher R, Li L-J, Li K, Fei-Fei L, editors. Imagenet: a large-scale hierarchical image database. In: 2009
IEEE conference on Computer vision and pattern recognition. Piscataway: IEEE; 2009
3. Schmidhuber J. Deep learning in neural networks: an overview. Neural Netw. 2015;61:85–117.
4. LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521(7553):436.
5. Wang D, Khosla A, Gargeya R, Irshad H, Beck AH. Deep learning for identifying metastatic breast cancer. arXiv pre-
print arXiv:160605718. 2016.
6. Esteva A, Kuprel B, Novoa RA, Ko J, Swetter SM, Blau HM, et al. Dermatologist-level classification of skin cancer with
deep neural networks. Nature. 2017;542(7639):115.
7. Bar Y, Diamant I, Wolf L, Lieberman S, Konen E, Greenspan H, editors. Chest pathology detection using deep learning
with non-medical training. In: IEEE 12th international symposium on biomedical imaging (ISBI). Piscataway: IEEE;
2015.
8. Ciompi F, de Hoop B, van Riel SJ, Chung K, Scholten ET, Oudkerk M, et al. Automatic classification of pulmonary
peri-fissural nodules in computed tomography using an ensemble of 2D views and a convolutional neural network
out-of-the-box. Med Image Anal. 2015;26(1):195–202.
9. van Ginneken B, Setio AA, Jacobs C, Ciompi F, editors. Off-the-shelf convolutional neural network features for
pulmonary nodule detection in computed tomography scans. In: IEEE 12th international symposium on biomedical
imaging (ISBI). Piscataway: IEEE; 2015.
10. Shen W, Zhou M, Yang F, Yang C, Tian J, editors. Multi-scale convolutional neural networks for lung nodule classifica-
tion. In: International conference on information processing in medical imaging. Berlin: Springer; 2015.
11. Chen S, Qin J, Ji X, Lei B, Wang T, Ni D, et al. Automatic scoring of multiple semantic attributes with multi-task feature
leverage: a study on pulmonary nodules in CT images. IEEE Trans Med Imaging. 2017;36(3):802–14.
12. Ciompi F, Chung K, Van Riel SJ, Setio AAA, Gerke PK, Jacobs C, et al. Towards automatic pulmonary nodule manage-
ment in lung cancer screening with deep learning. Sci Rep. 2017;7:46479.
13. Dou Q, Chen H, Yu L, Qin J, Heng P-A. Multilevel contextual 3-d cnns for false positive reduction in pulmonary nod-
ule detection. IEEE Trans Biomed Eng. 2017;64(7):1558–67.
14. Li W, Cao P, Zhao D, Wang J. Pulmonary nodule classification with deep convolutional neural networks on com-
puted tomography images. Comput Math Methods Med. 2016;2016:6215085. https://doi.org/10.1155/2016/62150
85.
15. Setio AAA, Ciompi F, Litjens G, Gerke P, Jacobs C, van Riel SJ, et al. Pulmonary nodule detection in CT images: false
positive reduction using multi-view convolutional networks. IEEE Trans Med Imaging. 2016;35(5):1160–9.
16. Shen W, Zhou M, Yang F, Dong D, Yang C, Zang Y, et al. editors. Learning from experts: developing transferable deep
features for patient-level lung cancer prediction. In: International conference on medical image computing and
computer-assisted intervention. Berlin: Springer; 2016.
Wang et al. BioMed Eng OnLine (2018) 17:63 Page 18 of 19

17. Sun W, Zheng B, Qian W. Computer aided lung cancer diagnosis with deep learning algorithms. In: Sun W, Zheng B,
editors. Medical imaging 2016: computer-aided diagnosis. Bellingham: International Society for Optics and Photon-
ics; 2016.
18. Teramoto A, Fujita H, Yamamuro O, Tamaki T. Automated detection of pulmonary nodules in PET/CT images: ensem-
ble false-positive reduction using a convolutional neural network technique. Med phys. 2016;43(6Part 1):2821–7.
19. Rajkomar A, Lingam S, Taylor AG, Blum M, Mongan J. High-throughput classification of radiographs using deep
convolutional neural networks. J Digit Imaging. 2017;30(1):95–101.
20. van Ginneken B, Schaefer-Prokop CM, Prokop M. Computer-aided diagnosis: how to move from the laboratory to
the clinic. Radiology. 2011;261(3):719–32.
21. Tataru C, Yi D, Shenoyas A, Ma A. Deep Learning for abnormality detection in chest X-ray images. 2017. http://cs231
n.stanford.edu/reports/2017/pdfs/527.pdf (unpublished).
22. Fernandes SL, Gurupur VP, Lin H, Martis RJ. A Novel fusion approach for early lung cancer detection using computer
aided diagnosis techniques. J Med Imaging Health Inform. 2017;7(8):1841–50.
23. Jia T, Zhang H, Bai Y. Benign and malignant lung nodule classification based on deep learning feature. J Med Imag-
ing Health Inform. 2015;5(8):1936–40.
24. Lan T, Chen S, Li Y, Ding Y, Qin Z, Wang X. Lung nodule detection based on the combination of morphometric and
texture features. J Med Imaging Health Inform. 2018;8(3):464–71.
25. Keming M, Zhuofu D. Lung nodule image classification based on ensemble machine learning. J Med Imaging
Health Inform. 2016;6(7):1679–85.
26. Yao L, Poblenz E, Dagunts D, Covington B, Bernard D, Lyman K. Learning to diagnose from scratch by exploiting
dependencies among labels. arXiv preprint arXiv:171010501. 2017.
27. Wang X, Peng Y, Lu L, Lu Z, Bagheri M, Summers RM, editors. Chestx-ray8: Hospital-scale chest x-ray database and
benchmarks on weakly-supervised classification and localization of common thorax diseases. In: 2017 IEEE confer-
ence on computer vision and pattern recognition (CVPR). Piscataway: IEEE; 2017.
28. Huang G, Liu Z, Weinberger KQ, van der Maaten L, editors. Densely connected convolutional networks. In: Proceed-
ings of the IEEE conference on computer vision and pattern recognition. 2017.
29. Hochreiter S, Schmidhuber J. Long short-term memory. Neural Comput. 1997;9(8):1735–80.
30. Kumar P, Grewal M, Srivastava MM. Boosted cascaded convnets for multilabel classification of thoracic diseases in
chest radiographs. arXiv preprint arXiv:171108760. 2017.
31. Rajpurkar P, Irvin J, Zhu K, Yang B, Mehta H, Duan T, et al. CheXNet: Radiologist-level pneumonia detection on chest
x-rays with deep learning. arXiv preprint arXiv:171105225. 2017.
32. Shiraishi J, Katsuragawa S, Ikezoe J, Matsumoto T, Kobayashi T, K-i Komatsu, et al. Development of a digital image
database for chest radiographs with and without a lung nodule: receiver operating characteristic analysis of radiolo-
gists’ detection of pulmonary nodules. Am J Roentgenol. 2000;174(1):71–4.
33. Jaeger S, Candemir S, Antani S, Wáng Y-XJ, Lu P-X, Thoma G. Two public chest X-ray datasets for computer-aided
screening of pulmonary diseases. Quant Imaging Med Surg. 2014;4(6):475.
34. Shin H-C, Roberts K, Lu L, Demner-Fushman D, Yao J, Summers RM, editors. Learning to read chest X-rays: recurrent
neural cascade model for automated image annotation. In: Proceedings of the IEEE conference on computer vision
and pattern recognition. 2016.
35. Shin H-C, Roth HR, Gao M, Lu L, Xu Z, Nogues I, et al. Deep convolutional neural networks for computer-aided detec-
tion: CNN architectures, dataset characteristics and transfer learning. IEEE Trans Med Imaging. 2016;35(5):1285–98.
36. Ronneberger O, Fischer P, Brox T, editors. U-net: convolutional networks for biomedical image segmentation. In:
International conference on medical image computing and computer-assisted intervention. Berlin: Springer; 2015.
37. Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. J Artif Intell
Res. 2002;16:321–57.
38. Batista GE, Prati RC, Monard MC. A study of the behavior of several methods for balancing machine learning training
data. ACM SIGKDD Explor Newsl. 2004;6(1):20–9.
39. Candemir S, Jaeger S, Palaniappan K, Musco JP, Singh RK, Xue Z, et al. Lung segmentation in chest radiographs using
anatomical atlases with nonrigid registration. IEEE Trans Med Imaging. 2014;33(2):577–90.
40. Li X, Luo S, Hu Q, Li J, Wang D, Chiong F. Automatic lung field segmentation in X-ray radiographs using statistical
shape and appearance models. J Med Imaging Health Inform. 2016;6(2):338–48.
41. Wu G, Zhang X, Luo S, Hu Q. Lung segmentation based on customized active shape model from digital radiography
chest images. J Med Imaging Health Inform. 2015;5(2):184–91.
42. Badrinarayanan V, Kendall A, Cipolla R. Segnet: a deep convolutional encoder–decoder architecture for image
segmentation. IEEE Trans Pattern Anal Mach Intell. 2017;39(12):2481–95.
43. Long J, Shelhamer E, Darrell T, editors. Fully convolutional networks for semantic segmentation. In: Proceedings of
the IEEE conference on computer vision and pattern recognition. 2015.
44. Chen L-C, Papandreou G, Kokkinos I, Murphy K, Yuille AL. Deeplab: semantic image segmentation with deep convo-
lutional nets, atrous convolution, and fully connected crfs. IEEE Trans Pattern Anal Mach Intell. 2018;40(4):834–48.
45. Suk H-I, Lee S-W, Shen D. Initiative AsDN. Deep sparse multi-task learning for feature selection in Alzheimer’s disease
diagnosis. Brain Struct Funct. 2016;221(5):2569–87.
46. Wang S, Kim M, Wu G, Shen D. Scalable high performance image registration framework by unsupervised deep
feature representations learning. Deep learning for medical image analysis. New York: Elsevier; 2017. p. 245–69.
47. Bar Y, Diamant I, Wolf L, Greenspan H. Deep learning with non-medical training used for chest pathology identifica-
tion. In: Bar Y, Diamant I, Wolf L, editors. Medical imaging 2015: computer-aided diagnosis 2015. Bellingham: Interna-
tional Society for Optics and Photonics; 2015.
48. Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z, editors. Rethinking the inception architecture for computer
vision. In: Proceedings of the IEEE conference on computer vision and pattern recognition. 2016.
49. He K, Zhang X, Ren S, Sun J, editors. Deep residual learning for image recognition. In: Proceedings of the IEEE confer-
ence on computer vision and pattern recognition. 2016.
Wang et al. BioMed Eng OnLine (2018) 17:63 Page 19 of 19

50. Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv
:14091556. 2014.
51. Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, et al., editors. Going deeper with convolutions 2015: Cvpr.
52. Donahue J, Jia Y, Vinyals O, Hoffman J, Zhang N, Tzeng E, et al. editors. Decaf: a deep convolutional activation feature
for generic visual recognition. In: International conference on machine learning. 2014.
53. Krizhevsky A, Sutskever I, Hinton GE, editors. Imagenet classification with deep convolutional neural networks. In:
Advances in neural information processing systems. 2012.
54. Sermanet P, Eigen D, Zhang X, Mathieu M, Fergus R, LeCun Y. Overfeat: Integrated recognition, localization and
detection using convolutional networks. arXiv preprint arXiv:13126229. 2013.
55. Ng A, Autoencoder S. CS294A Lecture notes. Dosegljivo. 2011. https://web.stanford.edu/class/cs294a/sparseAuto
encoder_2011new.pdf. Accessed 20 Jul 2016.
56. Olshausen BA, Field DJ. Sparse coding with an over complete basis set: a strategy employed by V1? Vision Res.
1997;37(23):3311–25.
57. Hoyer PO, editor. Non-negative sparse coding. Neural networks for signal processing. In: Proceedings of the 2002
12th IEEE Workshop. ‎Piscataway: IEEE; 2002.
58. Hoyer PO. Non-negative matrix factorization with sparseness constraints. J Mach Learn Res. 2004;5:1457–69.
59. Olshausen BA, Field DJ. Emergence of simple-cell receptive field properties by learning a sparse code for natural
images. Nature. 1996;381(6583):607.
60. Ngiam J, Chen Z, Bhaskar SA, Koh PW, Ng AY, editors. Sparse filtering. In: Advances in neural information processing
systems. 2011.
61. Poultney C, Chopra S, Cun YL, editors. Efficient learning of sparse representations with an energy-based model. In:
Advances in neural information processing systems. 2007.
62. Boureau Y-l, Cun YL, editors. Sparse feature learning for deep belief networks. In: Advances in neural information
processing systems. 2008.
63. Vincent P, Larochelle H, Bengio Y, Manzagol P-A, editors. Extracting and composing robust features with denoising
autoencoders. In: Proceedings of the 25th international conference on Machine learning. Helsinki: ACM; 2008.
64. Lin T-Y, Goyal P, Girshick R, He K, Dollár P. Focal loss for dense object detection. arXiv preprint arXiv:170802002. 2017.
65. Neyman J, Pearson E. On the problem of the most efficient tests of statistical inference. Biometrika A.
1933;20:175–240.
66. Pepe MS, Cai T, Longton G. Combining predictors for classification using the area under the receiver operating
characteristic curve. Biometrics. 2006;62(1):221–9.

Ready to submit your research ? Choose BMC and benefit from:

• fast, convenient online submission

• thorough peer review by experienced researchers in your field
• rapid publication on acceptance
• support for research data, including large and complex data types
• gold Open Access which fosters wider collaboration and increased citations
• maximum visibility for your research: over 100M website views per year

At BMC, research is always in progress.

Learn more biomedcentral.com/submissions

One Class Classification-based Quality Assurance of Organs-at-risk Delineation in Radiotherapy
No ratings yet
One Class Classification-based Quality Assurance of Organs-at-risk Delineation in Radiotherapy
9 pages
Computer Methods and Programs in Biomedicine
No ratings yet
Computer Methods and Programs in Biomedicine
12 pages
12885_2021_Article_8773
No ratings yet
12885_2021_Article_8773
10 pages
Fast_and_Efficient_Lung_Abnormality_Identification_With_Explainable_AI_A_Comprehensive_Framework_for_Chest_CT_Scan_and_X-Ray_Images
No ratings yet
Fast_and_Efficient_Lung_Abnormality_Identification_With_Explainable_AI_A_Comprehensive_Framework_for_Chest_CT_Scan_and_X-Ray_Images
19 pages
11.deep Learning Applications in Medical Image Analysis-Brain Tumor
No ratings yet
11.deep Learning Applications in Medical Image Analysis-Brain Tumor
4 pages
A Bilinear Convolutional Neural Network For Lung Nodules Classification On CT Images
No ratings yet
A Bilinear Convolutional Neural Network For Lung Nodules Classification On CT Images
11 pages
Literature Survey: Performance Comparision of Residual Deep Network For The Brain Tumor Detection
No ratings yet
Literature Survey: Performance Comparision of Residual Deep Network For The Brain Tumor Detection
19 pages
Lung sementation in X-Ray
No ratings yet
Lung sementation in X-Ray
12 pages
LungCancer DZK TAB
No ratings yet
LungCancer DZK TAB
6 pages
Rule-Based Outlier Detection of AI-generated Anatomy Segmentations
No ratings yet
Rule-Based Outlier Detection of AI-generated Anatomy Segmentations
15 pages
Anapub Paper Template
No ratings yet
Anapub Paper Template
10 pages
Paper
No ratings yet
Paper
25 pages
Tjcme 2011 Al
No ratings yet
Tjcme 2011 Al
7 pages
1 s2.0 S1746809424006372 Main
No ratings yet
1 s2.0 S1746809424006372 Main
12 pages
5
No ratings yet
5
6 pages
Lung Cancer Detection Using Machine Learning Algorithms and Neural Network On A Conducted Survey Dataset Lung Cancer Detection
No ratings yet
Lung Cancer Detection Using Machine Learning Algorithms and Neural Network On A Conducted Survey Dataset Lung Cancer Detection
4 pages
CNN Architecture Optimization Using Bio Inspired Algor - 2022 - Computers in Bio
No ratings yet
CNN Architecture Optimization Using Bio Inspired Algor - 2022 - Computers in Bio
13 pages
A Fully Integrated Computer-Aided Diagnosis System For Digital X-Raymammograms Via Deep Learning Detection, Segmentation, and Classification
No ratings yet
A Fully Integrated Computer-Aided Diagnosis System For Digital X-Raymammograms Via Deep Learning Detection, Segmentation, and Classification
11 pages
12 III March 2024
No ratings yet
12 III March 2024
9 pages
Cancer Detection Using Convolutional Neural Network: February 2021
No ratings yet
Cancer Detection Using Convolutional Neural Network: February 2021
10 pages
2D3D Clasification 01
No ratings yet
2D3D Clasification 01
11 pages
Computational Intelligence - 2019 - Mendoza - Detection and Classification of Lung Nodules in Chest X Ray Images Using Deep
No ratings yet
Computational Intelligence - 2019 - Mendoza - Detection and Classification of Lung Nodules in Chest X Ray Images Using Deep
32 pages
4 A deep learning-based cancer survival time classifier for small datasets
No ratings yet
4 A deep learning-based cancer survival time classifier for small datasets
9 pages
Lung segmentation_in_Chest_X-Ray
No ratings yet
Lung segmentation_in_Chest_X-Ray
11 pages
Ai TB
No ratings yet
Ai TB
12 pages
2022LNNSSpringer-LungCancerDetectionUsingDeepConvolutionalNeuralNetworks
No ratings yet
2022LNNSSpringer-LungCancerDetectionUsingDeepConvolutionalNeuralNetworks
14 pages
Lung Paper - Modified
No ratings yet
Lung Paper - Modified
12 pages
Lung Tumor Localization and Visualization in Chest
No ratings yet
Lung Tumor Localization and Visualization in Chest
17 pages
The feasibility and accuracy of holographic navigation with laser crosshair simulator registration on a mixed-reality display
No ratings yet
The feasibility and accuracy of holographic navigation with laser crosshair simulator registration on a mixed-reality display
31 pages
Hybrid Bio-Inspired Algorithm and Convolutional Neural Network For Automatic Lung Tumor Detection
No ratings yet
Hybrid Bio-Inspired Algorithm and Convolutional Neural Network For Automatic Lung Tumor Detection
14 pages
1-s2.0-S2590123024017006-main (1)
No ratings yet
1-s2.0-S2590123024017006-main (1)
17 pages
tumor size detection
No ratings yet
tumor size detection
19 pages
Sparrow_Search_Algorithm_With_Stacked_Deep_Learning_Based_Medical_Image_Analysis_for_Pancreatic_Cancer_Detection_and_Classification (1)
No ratings yet
Sparrow_Search_Algorithm_With_Stacked_Deep_Learning_Based_Medical_Image_Analysis_for_Pancreatic_Cancer_Detection_and_Classification (1)
9 pages
Sudha Multimedia
No ratings yet
Sudha Multimedia
25 pages
Pneumonia Lung Opacity Detection and Segmentation in Chest X-Rays by Using Transfer Learning of The Mask R-CNN
No ratings yet
Pneumonia Lung Opacity Detection and Segmentation in Chest X-Rays by Using Transfer Learning of The Mask R-CNN
9 pages
ResNet-50 Vs VGG-19 Vs Training From Scratch A Comparative Analysis of The Segmentation and Classification of Pneumonia From
No ratings yet
ResNet-50 Vs VGG-19 Vs Training From Scratch A Comparative Analysis of The Segmentation and Classification of Pneumonia From
10 pages
02 Pneumonia - Detection - From - Chest - X-Rays - Using - The - CH
No ratings yet
02 Pneumonia - Detection - From - Chest - X-Rays - Using - The - CH
12 pages
Intelligent_Pneumonia_Identification_From_Chest_X-Rays_A_Systematic_Literature_Review
No ratings yet
Intelligent_Pneumonia_Identification_From_Chest_X-Rays_A_Systematic_Literature_Review
25 pages
Comput Methods Programs Biomed 2021 206 106130
No ratings yet
Comput Methods Programs Biomed 2021 206 106130
11 pages
Image Classification Approach For Breast Cancer Detection Based On A Complex Event Processing
No ratings yet
Image Classification Approach For Breast Cancer Detection Based On A Complex Event Processing
8 pages
Comparing Deep Learning and Handcrafted Radiomics To Predict Chemoradiotherapy Response For Locally Advanced Cervical Cancer Using Pretreatment MRI
No ratings yet
Comparing Deep Learning and Handcrafted Radiomics To Predict Chemoradiotherapy Response For Locally Advanced Cervical Cancer Using Pretreatment MRI
11 pages
Lung Cancer Paper
No ratings yet
Lung Cancer Paper
7 pages
Breast Cancer Diagnosis in Two-View Mammography Using End-To-End Trained Efficientnet-Based Convolutional Network
No ratings yet
Breast Cancer Diagnosis in Two-View Mammography Using End-To-End Trained Efficientnet-Based Convolutional Network
15 pages
Research Article: CNN Based Multiclass Brain Tumor Detection Using Medical Imaging
No ratings yet
Research Article: CNN Based Multiclass Brain Tumor Detection Using Medical Imaging
8 pages
Thrall 2014
No ratings yet
Thrall 2014
5 pages
Deep Learning Radiomics-based Prediction of Distan
No ratings yet
Deep Learning Radiomics-based Prediction of Distan
10 pages
A Comparative Study of Tuberculosis Detection
No ratings yet
A Comparative Study of Tuberculosis Detection
5 pages
10
No ratings yet
10
12 pages
Detecting Pneumonia Using Convolutions and Dynamic Capsule Routing For Chest X-Ray Images
No ratings yet
Detecting Pneumonia Using Convolutions and Dynamic Capsule Routing For Chest X-Ray Images
30 pages
1 s2.0 S0169260717301451 Main - Read PDF
No ratings yet
1 s2.0 S0169260717301451 Main - Read PDF
12 pages
DL Frameworks - Brain Tumor
No ratings yet
DL Frameworks - Brain Tumor
27 pages
Classification of Breast Cancer Histopathological Images Using Discriminative Patches Screened by Generative Adversarial Networks
No ratings yet
Classification of Breast Cancer Histopathological Images Using Discriminative Patches Screened by Generative Adversarial Networks
16 pages
paper 26
No ratings yet
paper 26
11 pages
Deep Learning Predictive Model for Colon Cancer
No ratings yet
Deep Learning Predictive Model for Colon Cancer
10 pages
[email protected]
No ratings yet
[email protected]
11 pages
Cancers 11 01235 PDF
No ratings yet
Cancers 11 01235 PDF
36 pages
Reliable Tuberculosis Detection Using Chest X-Ray With Deep Learning, Segmentation and Visualization
No ratings yet
Reliable Tuberculosis Detection Using Chest X-Ray With Deep Learning, Segmentation and Visualization
15 pages
(IJETA-V11I3P9) :dr. Rahul Misra, Sudhanshu Vashistha
No ratings yet
(IJETA-V11I3P9) :dr. Rahul Misra, Sudhanshu Vashistha
5 pages
Textbook of Urgent Care Management: Chapter 35, Urgent Care Imaging and Interpretation
From Everand
Textbook of Urgent Care Management: Chapter 35, Urgent Care Imaging and Interpretation
Tim Hogan
No ratings yet
Applied Machine Learning and Multi-criteria Decision-making in Healthcare
From Everand
Applied Machine Learning and Multi-criteria Decision-making in Healthcare
Ilker Ozsahin
No ratings yet
Anarock_India_Data_Center_Report_1742961368
No ratings yet
Anarock_India_Data_Center_Report_1742961368
15 pages
Personality Prediction System Via CV Analysis: Curriculum Vitae
No ratings yet
Personality Prediction System Via CV Analysis: Curriculum Vitae
8 pages
Medical Image Captioning Via Generative Pretrained Transformers
No ratings yet
Medical Image Captioning Via Generative Pretrained Transformers
12 pages
A Review: Artificial Intelligence and Expert Systems For Cyber Security
No ratings yet
A Review: Artificial Intelligence and Expert Systems For Cyber Security
5 pages
Gartner CMO Guide
No ratings yet
Gartner CMO Guide
65 pages
A Multimodal Text Block Segmentation Framework For Photo Translation
No ratings yet
A Multimodal Text Block Segmentation Framework For Photo Translation
12 pages
7 Why Does The Future Not Need Us
No ratings yet
7 Why Does The Future Not Need Us
21 pages
Min Lin PDF
No ratings yet
Min Lin PDF
10 pages
IBM Registration Process Sample Format On The Steps To Be Followed by The Candidates 2022
No ratings yet
IBM Registration Process Sample Format On The Steps To Be Followed by The Candidates 2022
12 pages
Quantum Convolutional Neural Networks - PennyLane
No ratings yet
Quantum Convolutional Neural Networks - PennyLane
1 page
Machine Learning-1
No ratings yet
Machine Learning-1
64 pages
Reasoning With Sarcasm
No ratings yet
Reasoning With Sarcasm
11 pages
2023-2024 - UIL - 2 - Creativity and AI
No ratings yet
2023-2024 - UIL - 2 - Creativity and AI
43 pages
Sample Test 1
100% (1)
Sample Test 1
74 pages
STEMpedia AI Lab Brochure-1
No ratings yet
STEMpedia AI Lab Brochure-1
8 pages
Research Jollibot
100% (1)
Research Jollibot
21 pages
PV Systems AI
100% (1)
PV Systems AI
10 pages
EZRA EEMAN_Wayfinder Review Preview 23-24 Trends Report
No ratings yet
EZRA EEMAN_Wayfinder Review Preview 23-24 Trends Report
126 pages
Support Vector Machine-Updated Version
No ratings yet
Support Vector Machine-Updated Version
13 pages
Cloud Digital Leader Learning Path Quiz Solutions
No ratings yet
Cloud Digital Leader Learning Path Quiz Solutions
9 pages
Is Ai Killing JOBS???
No ratings yet
Is Ai Killing JOBS???
2 pages
Cognition Matlin 7th TB
100% (2)
Cognition Matlin 7th TB
22 pages
STGCN
No ratings yet
STGCN
4 pages
Reiter_2021_-_AIOps_-_A_Systematic_Literature_Review
No ratings yet
Reiter_2021_-_AIOps_-_A_Systematic_Literature_Review
11 pages
Varun Black Book 421080-1
No ratings yet
Varun Black Book 421080-1
34 pages
Unit 1
100% (1)
Unit 1
77 pages
1.a Survey of Elevator Group Control Systems For Vertical Transportation A Look at Recent Literature
No ratings yet
1.a Survey of Elevator Group Control Systems For Vertical Transportation A Look at Recent Literature
18 pages
Guava Fruit Disease Detction Using Machine Learning
No ratings yet
Guava Fruit Disease Detction Using Machine Learning
21 pages
Sai
No ratings yet
Sai
2 pages
Face Recognition Based Attendance System: Problem Statement
No ratings yet
Face Recognition Based Attendance System: Problem Statement
4 pages

Automated Chest Screening Based On A Hybrid Model of Transfer Learning and Convolutional Sparse Denoising Autoencoder

Uploaded by

Automated Chest Screening Based On A Hybrid Model of Transfer Learning and Convolutional Sparse Denoising Autoencoder

Uploaded by

Wang

et al. BioMed Eng OnLine (2018) 17:63

RESEARCH Open Access

Automated chest screening based

418 1318 179 565

In this work, we demonstrate how to automatically classify CXRs at different status

Next, we give detailed description of every datasets as follows.

Japanese Society of Radiological Technology (JSRT)

China set—The Shenzhen set—chest X‑ray database (SZCX)

Montgomery‑County chest X‑ray database (MC)

Lung field segmentation

Fig. 1 Workflow diagram of our proposed method

Fig. 2 Segmentation of lung from public database based on U-net

Fig. 3 Flow chart of Autoencoder

Table 2 Network architecture of CSDAE

Input_1(InputLayer) (None, 512, 512, 1) 0

Image classification with CNN

Table 3 Network architecture of proposed image classification

Input_4(InputLayer) (None, 512) 0

Combining predictors for classification based on ROC curve

Table 4 Comparisons of the mean and std of MSE between AE and CSDAE

AE 0.01 0.000983 ± 0.000304 0.001088 ± 0.0004 0.001112 ± 0.000405

KNN Without augmentation 0.73 0.58 0.64 0.753

Cross entropy Without augmentation 0.76 0.72 0.74 0.821

Table 7 Results of the performance of our network method

Table 8 The results of the performance of our method with max AUC​

Availability of data and materials

Consent for publication

Ethics approval and consent to participate

Received: 4 April 2018 Accepted: 9 May 2018

Ready to submit your research ? Choose BMC and benefit from:

• fast, convenient online submission

At BMC, research is always in progress.

Learn more biomedcentral.com/submissions

You might also like

Fig. 1 Workflow diagram of our proposed method

Fig. 2 Segmentation of lung from public database based on U-net

Fig. 3 Flow chart of Autoencoder

Table 2 Network architecture of CSDAE

Table 3 Network architecture of proposed image classification

Table 4 Comparisons of the mean and std of MSE between AE and CSDAE

Table 7 Results of the performance of our network method

Table 8 The results of the performance of our method with max AUC