Lung Cancer Detection Using Transfer Learning
Lung Cancer Detection Using Transfer Learning
Transfer Learning
Table of Contents
1. Introduction
2. Visualization of Dataset
3. Proposed Model
4. Convolutional neural network
5. Transfer Learning : VGG16-Net
6. Future work
7. Reference
INTRODUCTION
Lung cancer is one of the deadliest cancers worldwide. However, the early detection of lung cancer significantly improves
survival rate. Cancerous (malignant) and noncancerous (benign) pulmonary nodules are the small growths of cells inside the
lung. Detection of malignant lung nodules at an early stage is necessary for the crucial prognosis.
Early-stage cancerous lung nodules are very much similar to non-cancerous nodules and need a differential diagnosis on the
basis of slight morphological changes, locations, and clinical biomarkers. The challenging task is to measure the probability
of malignancy for the early cancerous lung nodules. Various diagnostic procedures are used by physicians, in connection, for
the early diagnosis of malignant lung nodules, such as clinical settings, computed tomography (CT) scan analysis
(morphological assessment), positron emission tomography (PET) (metabolic assessments), and needle prick biopsy analysis
For the input layer, lung nodule CT images are used and are collected for various steps of the project. The source of the
dataset is the LUNA16 dataset .
The LUNA16 dataset is a subset of LIDC-IDRI dataset, in which the heterogeneous scans are filtered by different criteria.
Since pulmonary nodules can be very small, a thin slice should be chosen. Therefore scans with a slice thickness greater than
2.5 mm were discarded.
VISUALIZATION OF DATASET
Visualization of dataset is an important part of training , it gives better understanding of dataset. But CT scan images are hard
to visualize for a normal pc or any window browser. Therefore we use the pydicom library to solve this problem. The
Pydicom library gives an image array and metadata information stored in CT images like patient’s name,patient’s id, patient’s
birth date,image position , image number , doctor’s name , doctor’s birth date etc.
(fig 3.Small sample of Metadata contain in a single dicom
slice)
PROPOSED MODELS
The proposed model is a convolutional neural network approach based on lung segmentation on CT scan images. At first we
preprocess the dataset of luna16. We tried three different models of Convolutional Neural Networks, which are based on the
comparative study of performance of each type model in different dataset and for different classification problems.
So, in order to increase the accuracy of the model we will try to do more efficient data-preprocessing techniques are to be
implemented now after and before the image segmentation process which will mainly focus on efficient division of data into
cancerous and non-cancerous classes and making the dataset compatible to be processed with computer vision library of
python otherwise implementing the algorithms on the dataset from self defined functions.
Also a new data processing, training and classification pipeline is to be proposed which will help the models to predict the
data more accurately.
Current Suggestions includes the use of some other transfer learning models from imagenet in keras including the one
proposed above and implementation of Feature Extraction Algorithms like BRISK and SIFT from Computer Vision Library
and also integrating the ML training methods.
REFERENCES
1. Bjerager M., Palshof T., Dahl R., Vedsted P., Olesen F. Delay in diagnosis of lung cancer in general practice. Br. J. Gen. Pract.
2006;56:863–868. [PMC free article] [PubMed] [Google Scholar]
2. Nair M., Sandhu S.S., Sharma A.K. Cancer molecular markers: A guide to cancer detection and management. Semin. Cancer Biol.
2018;52:39–55. doi: 10.1016/j.semcancer.2018.02.002. [PubMed] [Google Scholar]
3. Silvestri G.A., Tanner N.T., Kearney P., Vachani A., Massion P.P., Porter A., Springmeyer S.C., Fang K.C., Midthun D., Mazzone P.J.
Assessment of plasma proteomics biomarker’s ability to distinguish benign from malignant lung nodules: Results of the PANOPTIC
(Pulmonary Nodule Plasma Proteomic Classifier) trial. Chest. 2018;154:491–500. doi: 10.1016/j.chest.2018.02.012. [PMC free article]
[PubMed] [Google Scholar]
4. Shi Z., Zhao J., Han X., Pei B., Ji G., Qiang Y. A new method of detecting pulmonary nodules with PET/CT based on an improved
watershed algorithm. PLoS ONE. 2015;10:e0123694. [PMC free article] [PubMed] [Google Scholar]
5. Lee K.S., Mayo J.R., Mehta A.C., Powell C.A., Rubin G.D., Prokop C.M.S., Travis W.D. Incidental Pulmonary Nodules Detected on
CT Images: Fleischner 2017. Radiology. 2017;284:228–243. [PubMed] [Google Scholar]
ABOUT TechieYan Technologies
TechieYan Technologies offers a special platform where you can study all the most cutting-edge technologies directly from
industry professionals and get certifications. TechieYan collaborates closely with engineering schools, engineering students,
academic institutions, the Indian Army, and businesses.
Website: https://techieyantechnologies.com/
THANK YOU