Lung Cancer Predictionusing Deep Learning Framework
Lung Cancer Predictionusing Deep Learning Framework
net/publication/346700750
CITATIONS READS
26 2,264
6 authors, including:
Srikar Amara
SEE PROFILE
All content following this page was uploaded by R. Raja Subramanian on 08 December 2020.
Abstract
Lung carcinoma also known as lung cancer is one of the dangerous diseases caused all over the
world. It is caused due to the reluctant increase of cells in the lung tissues. It is cured only
during its early stage, by starting the treatment. This is detected using the Computed
Tomography (CT) scanning and blood test reports. By blood test, the tumour is detected after the
humans affected with a minimum span of 4 years. So, to know the early stage of cancer, CT
scanning is used. The CT images are classified into normal and abnormal. The abnormal image
is detected by focusing on the tumour portion. The dataset in jpg format, composed of Computed
Tomography (CT) images. The proposed model is trained by using the Convolutional Neural
Network (CNN). Pretrained ImageNet models including LeNet, AlexNet and VGG-16, are used to
detect lung cancer. The proposed model uses AlexNet model and the features obtained from the
last fully connected layer of the model were separately applied as input to the softmax classifier.
The combination of AlexNet and softmax layer together has given an accuracy of 99.52%. The
proposed model serves as a consistent and sustainable diagnosis model for lung cancer
detection.
Keywords: Deep Learning, Lung Cancer Prediction, AlexNet, Softmax layer, CT images
1. Introduction
Cancer is a disease which is caused by abnormal growth of cells/tissues in the human body.
Since many years’ cancer is one of the deadly diseases that threatens human health. As of the
figures it is estimated that there are 1.8 million new cancer cases diagnosed and more than
606,520 deaths in the United States as of 2020 and estimated new cases of whopping 247,270
lung cancer cases. The examination led in 2018, results that 18.1 million disease cases will be
added to the accessible malignancy cases on the planet and around 9.6 million will bring about
death. Lung Carcinoma growth is the most well-known kind of disease on the planet with a pace
of 13%. In clinical field, the assessment and determination of the lung CT picture is a delicate
procedure and requires time and high capability.
The abstract assessment prompts the emerging changeability among the onlookers. Thus, the
PC based frameworks are sought after. The diagnosing procedure should be possible by utilizing
existing open learning innovations. Along these lines, the cost will be radically decreased. For
that reason, one of the basic models favored for lung malignancy (carcinoma) determination is
profound learning picture nets. We have used convolutional neural network models such as
AlexNet, LeNet and VGG-16. AlexNet model is used for analysing and classifying lung CT
images. It consists of eight layers. In those layers, 5 of them are convolutional and max-pooling.
Three of which are fully connected layers. We’ve used a hybrid model where the CNN AlexNet
model is connected with the softmax layer. In our study, the CT image dataset is classified as
Cancer and Non- cancer dataset. We used the ‘Adam’ optimizer technique
mean histogram balance approach. To improve the nature of the picture, the IPCT is utilized [3].
A probabilistic model is utilized to foresee lung malignant growth by utilizing CNN based
technique utilized for profound component extraction and a low-portion CT check dataset is
joined with full-mode iterative recombination (IMR) [4].
Experiments are conducted Clamors are killed by applying a weighted mean histogram
balance approach. To improve the nature of the picture, the IPCT is utilized [3]. A probabilistic
model is utilized to foresee lung malignant growth by utilizing CNN based technique utilized for
profound component extraction and a low-portion CT check dataset is joined with full-mode
iterative recombination (IMR) [6]. In this a new model is created by using the SVM and CNN
together called SVM-CNN hybrid model for the pattern recognition. In this the last output layer
of CNN is replaced with SVM [6,7]. Deep CNN model is used to develop a classification scheme
over microscopic lung cancer images and for the classification purpose, a softmax layer is used
alongside with CNN model. It consists of layers such as two fully connected layers, three
convolutional and three pooling layers. To beat the overfitting in the preparation procedure, each
picture is gone through a lot of methodology, for example, reflection, revolution, channel and so
on. It made a 71% progress rate in the grouping procedure [7]. In another investigation, the
impact of including the lymph hub size as a contribution to a backpropagation ANN was
analyzed. This was done on the 133 patients with non - little cell lung malignant growth. It
guaranteed a precision of 99.2% [9]. So as to foresee the malignant growth in its beginning
period non-little cell lung disease (NSCLC), a picture classifier called computational
histomorphometry using surface, shape, atomic direction from digitized H&E tissue microarray
(TMA) was introduced. This model gives a precision of 81% [10].
By using the computer-aided diagnostic system (CADS), a lung cancer detection system is
developed. The images in the edges are extracted into three stages. In primary stage, the vertical
direction of the image at the edge map was extracted. In secondary stage, a horizontal direction
of the image at the edge map was mined. In final stage, both vertical and horizontal directions of
the edge map of all transitions were subtracted and combined [12].
3.1. Convolution
Convolution thinks about the pictures piece by piece. The pieces that it searches for, known as
features [16]. The actual trained image is compared with the input by looking at the position of
the image with the trained image. CNN improves at seeing correspondingly than entire picture
coordinating plans [17].
155
ISSN: 2005-4297 IJCA
Copyright ⓒ 2020 SERSC
International Journal of Control and Automation
Vol. 13, No. 3, (2020), pp. 154-160
156
ISSN: 2005-4297 IJCA
Copyright ⓒ 2020 SERSC
International Journal of Control and Automation
Vol. 13, No. 3, (2020), pp. 154-160
is zero then it is the perfect prediction. If the loss is higher, then it is the worst model [13, 14]
The loss percentage of the model is depicted in figure 2. It is evident that the model exhibits
lower loss percentage in subsequent epochs. The accuracy of training the model is presented in
figure 3. The model exhibits an accuracy rate of 99.52%. Confusion matrix is also known as
error matrix. It is a table, where true values are known and is used to define the operation of a
classifier on a test dataset. In a classification problem it is the summary of predicted results. The
main aim of the confusion matrix is to summarize the correct and incorrect predictions with the
count values by each class. It gives us the types of errors that came from the classifier. It is the
classifier with actual class and predicted class which contains the positive and negative. Positive
represents the observation (actual value) is true. Negative represents theobservation (actual
value) is false. Based on the positive and negative four terms are
Figure 2. Loss Percentage Graph
157
ISSN: 2005-4297 IJCA
Copyright ⓒ 2020 SERSC
International Journal of Control and Automation
Vol. 13, No. 3, (2020), pp. 154-160
derived for our easy understand. True Positive (TP), True Negative, False Positive (FP) and False
Negative (FN) [15, 16, 17]. In TP observation and prediction is true. In TN observation and
prediction is false. In FP observation is false but prediction is true. In FN observation is true but
prediction is false. The confusion matrix is graphically depicted in figure 4.
Figure 3. Accuracy of the Proposed Model
7. Conclusion
A deep learning model leveraging AlexNet pretrained model combined with a softmax layer is
developed and used to efficiently classify the lung CT images for cancer. The proposed model is
compared with the existing state-of-the-art models. Empirical evaluations of the proposed model
depict that the model outperforms the similar models available in the literature, with the accuracy
level of 99.52%. In addition, a user interface leveraging python-tkinter is developed for effective
usage by common people. This health care project deems to be a sustainable application for
classifying lung CT images for cancer prediction. The results from the application are in par with
the clinical results.
In the future, we prefer to identify the influential factors for lung cancer in the human body,
and use the same as features of cancer prediction. Physiological parameters for prediction include
pressure, oxygen level, body temperature. An IoMT application can be developed to extract these
physiological parameters. Effective classification and analysis can be done leveraging fog-
assisted cloud computing.
158
ISSN: 2005-4297 IJCA
Copyright ⓒ 2020 SERSC
International Journal of Control and Automation
Vol. 13, No. 3, (2020), pp. 154-160
References
[1] M. Togacara, B. Ergen, Z. Cömert, “Detection of lung cancer on chest CT images using
minimum redundancy maximum relevance feature selection method with convolutional
neural networks”, Elsevier Journal of Biocybernetics and Biomedical engineering vol. 40,
no. 1, (2020) pp. 23-34.
[2] Z. Liu, C. Yao, H. Yu, T. Wu, “Deep reinforcement learning with its application for lung
cancer detection in medical Internet of Things”, Elsevier Journal of Future Generation
Computer Systems, vol. 97, (2019) pp. 1- 9.
[3] P. M. Shakeel, M. A. Burhanuddin, M. I. Desa, “Lung cancer detection from CT image
using improved profuse clustering and deep learning instantaneously trained neural
networks”, Elsevier Journal of Measurement, vol. 145, (2019) 702-712.
[4] Y. Liu, H. Wang, Y. Gua, X. Lv, “Image classification toward lung cancer recognition by
learning deep quality model”, Elsevier Journal of Visual Communication and Visual
Representation, vol. 63, (2019) pp. 274-282.
[5] P. Huang, C. T. Lin, Y. Li, M. Tammemagi, “Prediction of lung cancer risk at follow-up
screening with low-dose CT: a training and validation study of a deepLearning method”,
The Lancet Digital Health, vol. 1, (7), (2019).
[6] S. K., L. Mohanty, S. N., K., S., N., A., & Ramirez, G., “Optimal deep learning model for
classification of lung cancer on CT images”, Future Generation Computer Systems, vol. 92,
(2018) pp. 374-38.
[7] S. Alagarsamy, K. Kamatchi, V. Govindaraj, Y. D. Zhang, A. Thiyagarajan, “Multi-
channeled MR brain image segmentation: A new automated approach combining BAT and
clustering technique for better identification of heterogeneous tumors”, Biocybernetics and
Biomedical Engineering, vol.39, no.4 (2019), pp.1005-1035.
[8] S. Alagarsamy, K. Kamatchi, V. Govindaraj, A. Thiyagarajan “A fully automated hybrid
methodology using Cuckoo-based fuzzy clustering technique for magnetic resonance brain
image segmentation”, International journal of Imaging systems and technology, vol. 27,
no.4, (2017), pp. 317-332.
[9] S. Alagarsamy, K. Kamatchi, V. Govindaraj, “A Novel Technique for identification of
tumor region in MR Brain Image,” 3rd International conference on Electronics,
Communication and Aerospace Technology (ICECA), (2019), pp. 1061-1066.
[10] V. Govindaraj, V, Sofia Fazila, M. Beevi, M. Marikani, “An Imitating Device for Assisting
Trans-Radial Amputees using Gestures,” International Journal of Advanced Science and
Technology, vol. 29, no.7s, (2020), pp. 2922-2931.
[11] V. Govindaraj, V. Sivasankar, T. Raja, S. Alagarsamy, A. Thiyagarajan, “Light Guidance
for Easy Coagulation, Cutting and Desiccation in Surgical Diathermy,” Journal of Advanced
Science and Technology, vol. 29, no.7s, (2020), pp. 2941-2947.
[12] V. Govindaraj, K. Venkatesan, K. Seethapathy, S. Suthanthiram, S. Alagarsamy, A.
Thiyagarajan, “An Imitating Wearable Kidney: A Dialectical Replacement For The
Cumbersome Dialysis Procedure For Renal Failures,” Journal of Advanced Science and
Technology, vol. 29, no.7s, (2020), pp. 2932-2940.
[13] A. Thiyagarajan, P. Sankar, A.R. Chandragurunathan, J.M. Jaffer, V. Govindaraj,
S. Alagarsamy, “Detection of Alzheimer’s Disease Using Soft Computing Techniques,”
Journal of Advanced Science and Technology, vol. 29, no.8s, (2020), pp. 2914-2921.
[14] S. Alagarsamy, V. Govindaraj, K. Meenakumari, K. Priyadhara, “Identification of various
diseases in guava fruit using spiral optimization (SPO) technique,” Test Engineering and
Management, vol.83, (2020), pp.9561-9566.
[15] R.R. Subramanian, K. Seshadri, “Design and Evaluation of a Hybrid Hierarchical Feature
Tree Based Authorship Inference Technique”, In: Kolhe M., Trivedi M., Tiwari S., Singh V.
(eds) Advances in Data and Information Sciences. Lecture Notes in Networks and Systems,
vol 39. Springer, Singapore (2019).
[16] QZ. Song, L. Zhao, XK. Luo, XC. Dou, “Using deep learning for classification of lung
nodules on computed tomography images”, Journal of Health Engineering, vol. 72 (7),
(2017).
159
ISSN: 2005-4297 IJCA
Copyright ⓒ 2020 SERSC
International Journal of Control and Automation
Vol. 13, No. 3, (2020), pp. 154-160
[17] LK. Toney, HJ. Vessellw, “Neural networks for nodal staging of non-small cell lung Cancer
with FDG PET and CT: importance of combining uptake values and sizes of nodes and
primary tumor”, Radiology, vol.270, no.1 (2014), pp.91- 98.
[18] R.R Subramanian, R. Ramar, “Design of Offline and Online Writer Inference Technique”,
International Journal of Innovative Technology and Exploring Engineering, vol. 9, (2019),
pp. 1-4.
[19] X. Wang, A. Janowczyk, Y. Ahou, R. Thawani, P. Fu, K. Schalper, “Prediction of
recurrence in early stage non-small cell lung cancer using computer extracted nuclear
features from digital H&E images”, Sci Rep (2017) 7:13543
[20] Potghan S., “Multi-layer perceptron-based lung tumor classification”, Second int conf
Electron common aerospace technology, (2018) pp. 499- 502.
[21] R.R. Sudharsan, J. Deny, E. Kumaran, A.S. Geege, “An Analysis of Different Biopotential
Electrodes Used for Electromyography”, Journal of Nano- and Electronic Physics (2020) pp.
1-7.
[22] R.R. Krishna, P.S. Kumar, R.R. Sudharsan, “Optimization of wire-length and block
rearrangements for a modern IC placement using evolutionary techniques”, IEEE
International Conference on Intelligent Techniques in Control, Optimization and Signal
Processing, (2017) pp. 1-4.
[23] J. Deny, R.R Sudharsan, “Block Rearrangements and TSVs for a Standard Cell 3D IC
Placement”, In Intelligent Computing and Innovation on Data Science, Springer, (2020) pp.
207-214.
[24] R.R. Sudharsan, J. Deny, “Field Programmable Gate Array (FPGA)-Based Fast and Low-
Pass Finite Impulse Response (FIR) Filter”, In Intelligent Computing and Innovation on
Data Science, Springer, (2020) pp. 199-206.
[25] C. D. Scott and R. E. Smalley, “Diagnostic Ultrasound: Principles and Instruments”, Journal
of Nanosci. Nanotechnology., vol. 3, no. 2, (2003), pp. 75-80.
160
ISSN: 2005-4297 IJCA
Copyright ⓒ 2020 SERSC