Automatic Detection of Pneumonia On Compressed Sensing Images Using Deep Learning
Automatic Detection of Pneumonia On Compressed Sensing Images Using Deep Learning
Abstract—Pneumonia is one of the life threatening very framework that enables detection of pneumonia more accu-
common disease and needs proper diagnosis at an early stage rately.
for proper treatment of recovery. Chest X-ray is used as an Deep learning (DL) is a technique of extracting patterns
imagining modality to identify the disease by a professional
radiologist. This paper suggests a Compressed Sensing (CS) based from data through an artificial neural network (ANN) with
deep learning framework for automatic detection of pneumonia a large number of interconnected units in order to perform
on X-ray images to assist the medical practitioners. Extensive real world complex tasks especially related to images [4].
simulation results show that the proposed approach enables Sometimes, the deep learning based systems perform better
detection of pneumonia with 97.34% prediction accuracy and than human in term of accuracy in pattern recognition. The
an improvement on reconstruction quality of the X-ray images
in terms of PSNR by 1±0.76 dB and SSIM by 0.2±0.05 using the deep learning is chosen as a very natural application in the
proposed method compared to the other state-of-the-art methods. field of medical radiology as the diagnosis process as it relies
on extracting useful information from images. The use of
Index Terms—Compressed sensing, Pneumonia detection, Deep deep learning in CS paradigm ensures convergence bounds
Learning, CNN for the recovery of a signal in an underdetermined system
[5]. An underdetermined system, instead of following the
I. I NTRODUCTION Nyquist sampling theorem, enables signal acquisition process
Pneumonia is a very common infective disease caused by by reducing the required sample size with significantly higher
bacteria, viruses, or fungi [1]. In severe case, it leads to order [5]–[7]. Hence, the combining effect of DL with the
death specially in children younger than 5 years of age, older CS leads to faster signal acquisition along with a guaranteed
adults and people with other diseases or impaired immune improvement in diagnosis accuracy.
systems. In every year, around 1 million people are detected The rest of the paper is organized as follows: a brief review
with pneumonia and around 50, 000 die from the disease on state-of-the-art methods and scope of the present work
in the United States (U.S.) like country [2]. To diagnose are reported in Section II. The proposed system model using
pneumonia, chest X-ray is used by the medical practitioners as DL framework for CS image reconstruction and pneumonia
the best imaging modality. However, proper analysis of the X- detection is described in Section III. Section IV presents
ray images need a radiologist with rich domain expertise and the simulation results and performance evaluation. Finally,
experience. According to an estimate by the World Health conclusions and scope of the future works are stated in Section
Organization (WHO), two thirds of the people in the world V.
don’t have access to a radiologist for diagnosis of their disease
yet. It would be very helpful if an automated computerized II. L ITERATURE R EVIEW AND S COPE OF P RESENT W ORK
system is developed to analyze the X-ray images for assisting This section presents a brief review on the related works and
the medical advisor to diagnose appropriately. To offer proper describes the scope and contributions of the present work.
medical services in remote villages, often imaging to be done
at one end and the medical advisors are present at far end A. Literature Review
which needs transmission of images over wireless channels. Over the last decade, several machine learning based auto-
To avail and adapt the limited bandwidth of wireless channel, mated methods for identifying different types of pneumonia
far end image reconstruction (to be used for diagnosis) to be have been widely studied [8]–[11]. Fiszman et al [8] used a
accomplished from sub-sample measurements [3]. This faster natural language processing (NLP) tool to identify acute bac-
sensing i.e. sub-sample sensing is also used for reduction in terial pneumonia-related disease in chest X-ray. Performance
radiation from X-ray imaging. Recently, Compressed Sensing of this type of resource intensive application is very much
(CS) is used in various medical imaging and a deep learning comparable to that of the human expert. Chapman et al [9]
Convolutional Network
demonstrated three computerized methods using a rule base, a
probabilistic Bayesian network, and a decision tree to diagnose
the chest X-ray report associated with acute bacterial pneumo-
nia. In [10], a study of feasibility of an NLP-based monitoring
system is done to identify healthcare-associated pneumonia 32 × 32 × 128
64 × 64 × 64 x̂
in neonates. However, practical clinical applications of these
types of methods are limited due to the dependency on the Label
4096
16 × 16 × 128
32 × 32 × 256
as fuzzy c-means allocate weights to all the pixels of the input Convolutional Network FullyConnected Network
X-ray images. Rajpurkar et al [11] demonstrated ChexNet, Fig. 1: Proposed CS based DL model for pneumonia detection and reconstruction of the
a 121-layer deep convolutional neural network (CNN), that X-ray images
provides the probability of detecting or identifying pneumonia
using a heatmap to localize the area of the infection. Kermany A. CS Image Reconstruction Model
et al. [13] introduced a transfer learning based DL framework The image reconstruction is considered as an inverse prob-
to diagnose pediatric pneumonia using chest X-ray images. lem which seeks an estimation x̂ of the image by minimizing
However, none of the methods are exploited to classify X-ray the following objective function
images with pneumonia for the CS framework to meet the
need of remote end analysis. x̂ = arg min k Φx − y k22 +λϕ(x) (2)
x∈S
B. Scope and Contributions of Present Work
where S is a convex set, λ > 0 trade offs the weights
An accurate reconstruction followed by classification algo-
between the terms in the objective function and ϕ is a
rithm plays a very important role in developing an application
regularization parameter.
specific medical imaging system to diagnose a disease reliably.
The objective function in Equation 2 can be equivalently
An image classification system operated under CS framework
learned by a CNN through updating weights W using proximal
needs to extract the most useful features from the observations
(or projected) gradient descent [5].
for accurate classification and also to address an ill-posed
inverse problem to reconstruct the X-ray image. Therefore,
it is very much essential to develop a robust method which x̂n = Q R x̂n−1 ; Φ, y, % (3)
can be easily implemented in a low cost X-ray imaging
where, Q is defined as a quasi-projection operator onto the
system to detect pneumonia more accurately. To this aim, this
set of artifact-free images, n is the iteration number and α is
paper suggests a DL based CS framework with the following
the step size. The C x̂(n) denotes the cost function which
contributions:
• A multi-layer convolutional neural network (CNN) is
is defined as a reconstruction operator R x; Φ, y, % to learn
used to extract features from CS measurements for clas- Equation 2 by the CNN. The % is a set of parameters for the
sifying the X-ray images of pneumonia patient. reconstruction method.
• A sub-channel is used to provide the reconstructed X-ray
image which helps to verify manually whether pneumonia B. The classification Model
is present or not. A simple multi-layer network, with 4 convolutional and
• Extensive simulation results show that the proposed ap- 3 fully connected layers as hidden layers, is used to detect
proach can classify the pneumonia infected X-ray images pneumonia for the CS framework as shown in Figure 1. To
with 97.34% prediction accuracy. introduce non-linearity in the network, all the hidden layers
use leaky Rectified linear unit (ReLU) activation function that
III. P ROPOSED S YSTEM M ODEL
has a non-zero gradient over its entire domain. The leaky ReLu
This section presents the proposed system model using (LReLU) activation function is defined as
DL framework for CS image reconstruction and pneumonia (
detection. In CS, a sparse signal x ∈ <N is recovered y if y > 0
from a measurement vector y ∈ <M (where M << N ). flR (y) =
0.001y otherwise
Mathematically, the measurement vector can be represented
using the following equation As shown in Figure 1, after reshaping the input measure-
ment vector y, 2 − d convolutional followed by up-sampling
y = Φx + η (1)
operations are performed to reconstruct the X-ray image. The
where Φ ∈ <M ×N is a linear operator called as sensing matrix last three layers are added as fully connected network for
and η ∈ <N is the noise on the sensing signal. classifying the input with an appropriate label. The output
2019 IEEE Canadian Conference of Electrical and Computer Engineering (CCECE)
layer uses a sigmoid function with 2 units (one for pneumonia low throughout the network and also adds regularization effect
and another one for normal) as defined follows: to reduce overfitting of the network on the training data [15].
1 TABLE I: Hyper Parameters used in the Training
fS (y) =
1 + e−ΘT y
Parameter Name Value
where, Θ denotes parameters of the model, defined as the
weight matrices W and bias vectors b. Initial Learning Rate, α 0.0018
Momentum:γ, τ 0,0
Now, the output of the final layer can be expressed mathe-
Dropout Rate 0.4
matically as
Optimizer used Gradient Descent
Layer activations Leaky ReLu
f (y; Θ) = fS W hF flR W hF −1 y + bhF −1 + bhF
(4)
Figure 2 shows the plot of the variation of the classification
The output of the final layer can be defined as categorical
accuracy evaluated for both the training and the validation
probability distribution:
data-sets against each epoch. During the learning process,
f (y)c = p̂(Lo = c|y) the training and the validation accuracy reach 98.82% and
99.80%, respectively after completion of 400 epochs. Variation
where Lo denotes the output label and c ∈ {0, 1} is one of of the MSE loss in every epoch is also studied as plotted
the 2 output classes. in Figure 3 for both the training and the validation data-
The parameters of the network is trained by minimizing the sets. The training and the validation loss go on reducing with
cross-entropy (negative log-likelihood) shown in Equation 5 the development of the learning process of the network as
between the models output and the target distribution using expected.
stochastic gradient descent (SGD) on mini-batches.
1
X
L f (y); Lo = − log f (y)c = − log f (y)Lo (5)
c=0