Final Book
Final Book
By
2024
Artificial intelligence project applied in
Presented by:
2024
Eng: Abdelmawala Yousef
Dedication
To my supportive parents,
who always believe in my
abilities and encourage me
to face challenges. Thanks
to you, I was able to achieve
this accomplishment. With
all my love and gratitude.
2024
TABLE OF CONTENTS III
LIST OF FIGURES IV
NOMENCLATURE V
ACKNOWLEDGEMENT VI
ABSTRACT VI
CHAPTER 1: INTRODUCTION 1
1.1 OVERVIEW 2
1.2 PROBLEM DEFINITION AND OBJECTIVES 4
1.3 PROPOSED SOLUTION 5
1.4 MOTIVATION 5
1.5 PROJECT PLAN 7
CHAPTER 2: LUNG CANCER 8
2.1 OVERVIEW 10
2.2 TYPES OF LUNG CANCER 11
2.3 SYMPTOMS 12
2.4 PREVENTION 13
2.5 DIAGNOSIS 13
2.6 TREATMENT AND CARE 14
2.7 STAGES OF CARE 14
2.8 CLINICAL TRIALS 15
2.9 LUNG CANCER SCANS 16
CHAPTER 3: LITERATURE REVIEW 18
6.1 UI/UX 61
6.2 WEB SITE 64
6.3 APPLICATION 66
CHAPTER 7: USED SOFTWARE 71
7.1 INTRODUCTION 72
7.2 TOOLS USED 72
CHAPTER 8: CONCLUSION 77
8.1 CONCLUSION 78
8.2 FUTURE WORK 80
CHAPTER 9: REFERENCE 81
Table of Contents
List of Figures
We would like to express our sincere gratitude to everyone who contributed to the
development of the lung cancer detection and type identification project.
Firstly, we would like to express our appreciation to our advisors and mentors who
provided invaluable guidance and support throughout the project. Your ideas and
comments have helped us improve our ideas and make critical decisions. We would
also like to thank our supervisor, Dr. Fatima Muhammad Talaat, for her invaluable
guidance and support throughout the research and experimentation process.
Thank you all for your contributions to this project and your continued support of
our work.
Abstract
Lung cancer remains the leading cause of death worldwide. Early and accurate
detection is critical to improving patient outcomes. Using complex algorithms and
software, artificial intelligence (AI) is able to mimic human cognition in analyzing,
interpreting, and making sense of complex data, and is currently being successfully
applied in various healthcare settings. By leveraging AI's ability to measure
information from images, and its superior ability to recognize complex patterns in
images compared to humans, AI has the potential to help clinicians interpret LDCT
images obtained in the lung cancer screening setting. This book explores the potential
of deep learning to revolutionize lung cancer diagnosis. Intelligent Imaging: A Deep
Learning Approach to Lung Cancer Detection and Patient Support details the
development of a deep learning model to identify lung cancer in chest scans. The
book delves into the technical aspects of the model while emphasizing its real-world
applicability.
CHAPTER 1: INTRODUCTION
Chapter 1:INTRODUCTION
1.1 OVERVIEW
Lung cancer is a malignant tumor that arises from lung cells, especially within the
epithelial lining of the bronchi, bronchioles, or alveoli. It is widespread and associated
with high mortality rates on a global scale. In its early stages, Lung cancer shows no
symptoms or presents mild manifestations. As a result, it is usually diagnosed at an
advanced stage. Delay in identifying a medical condition affects the effectiveness of
treatment and reduces the likelihood of long-term survival. The two common types of
Lung cancer are non-small cell lung cancer (NSCLUNG CANCER) and small cell
lung cancer (SCLUNG CANCER). NSCLUNG CANCER is usually classified into
two subtypes: lung squamous cell carcinoma (LUSC) and lung adenocarcinoma
(LUAD). The precise classification of Lung cancer, including LUSC, LUAD, and
SCLUNG CANCER, has an important role in determining the prognosis of Lung
cancer compared with benign and malignant classifications. Accurate classification of
LUNG CANCER at the initial diagnosis stage significantly improves the treatment
efficacy and thus increases the survival rate of patients. PET and computed
tomography (CT) are widely used as non-invasive diagnostic imaging modalities in
clinical practice, where they serve as valuable tools to evaluate the specific diagnosis
of Lung cancer.
Early detection and treatment of Lung cancer through effective screening methods is
vital in enhancing patient outcomes. Based on results from the National Lung
Screening Trial, low-dose helical CT screening is more effective in reducing mortality
in high-risk populations. However, the Lung cancer screening process is prone to
giving false positive (FP) results, which leads to increased costs due to unwarranted
medical interventions and may lead to psychological distress in individuals.
Computer-aided diagnosis has notable advantages in Lung cancer detection, including
enhanced scope in early cancer screening and reduced incidence of FP findings
throughout the diagnostic process.
In the field of Lung cancer detection, there have been notable developments in the
form of new methods and technologies to enhance early diagnosis and treatment
effectiveness. Liquid biopsies are used to test blood samples for cancer. These
diagnostic tests can identify genetic abnormalities and alterations associated with
2
Chapter 1:INTRODUCTION
Lung cancer. Thus, these tests provide a non-invasive way to diagnose the disease and
monitor the effectiveness of treatment. Low dose computed tomography (LDCT)
scanning has emerged as a widely applied method for timely identification of Lung
cancer. LDCT scans use lower radiation levels than traditional CT scans while
providing high-resolution images of the lung area. New bronchoscopy methodologies,
including electromagnetic navigation bronchoscopy and robot-assisted bronchoscopy,
facilitate minimally invasive lung lesion biopsies. These tools promote rapid and
accurate identification of medical conditions. The integration of genomics, proteomics
and metabolomics has allowed the development of diverse strategies for Lung cancer
identification. These methodologies use several molecular markers to enhance
diagnostic accuracy and discover potential targets for therapeutic intervention.
Due to privacy concerns and the cost of data acquisition, medical image datasets are
typically small. Pre-trained models can apply broader image dataset expertise to
medical images, enabling models to be trained with minimal medical data. These
3
Chapter 1:INTRODUCTION
models can extract hierarchical information from images, including fine details and
important patterns. Feature extraction supports medical image classification models.
Despite the fact that significant advances in the diagnosis and treatment of lung
cancer have been made, the disease is still associated with poor clinical outcomes and
survival is strongly determined by the stage of disease at diagnosis and thus, whereas
the five-year survival rate for patients with the early-stage disease is 56%, in those
with advanced disease the 5-year survival rate is less than 5%. Considering that only
16% of lung cancers are diagnosed in the early stage and that most patients present
with advanced disease, developing screening tests capable of detecting the disease in
the initial stages has been a long-term goal in lung cancer care.
Several screening methods have been tested so far, including sputum cytology, chest
radiographs (CXR), and low-dose computer tomography (LDCT), and recently the
analysis of various biomarkers, however, data from clinical trials indicate that only
the use of low-dose computer tomography scans (LDCT) in heavy smoker individuals
has been associated with a significant reduction in lung-cancer-related mortality.
Despite the fact that screening lung cancer with LDCT has demonstrated a clear
benefit for reducing all-cause mortality, the high rate of false-positives and the cost of
unnecessary diagnostic procedures needed to confirm or rule out those false-positives
are important limitations of this approach.
4
Chapter 1:INTRODUCTION
The emergence of artificial intelligence as a new tool for evaluating medical data
means new opportunities to improve the diagnosis and treatment of various human
diseases. In the case of lung cancer diagnosis, coupling AI algorithms with available
clinical and biomedical data has the potential to improve lung cancer screening
methods. For example, AI has the potential to improve the analysis and interpretation
of lung images obtained via magnetic resonance imaging (MRI) or computer
tomography (CT) and can be useful in deciphering the clinical significance of data
derived from tissue or fluid biomarkers. Better. Accordingly, we decided to design an
artificial intelligence model that detects cancer and accurately determines its type
from CT scans.
1.4 MOTIVATION
Lung cancer remains the leading cause of cancer-related deaths globally, claiming
more lives than breast, prostate, and pancreatic cancer combined.
Early detection is crucial for improving survival rates and potentially achieving
complete cure.
Deep learning algorithms excel at analyzing complex medical images like CT scans
and PET scans.
They can identify subtle patterns and features indicative of early-stage lung cancer,
which might be missed by human analysis alone.
Deep learning models can be trained on vast datasets of medical images, leading to
highly accurate predictions and classifications.
This can assist radiologists in their diagnoses, potentially reducing human error and
improving overall detection efficiency.
5
Chapter 1:INTRODUCTION
Deep learning algorithms can analyze patient data beyond just images, including
genetic information and medical history.
This can aid in personalized treatment plans, risk assessments, and targeted therapies
tailored to individual patients.
As deep learning technology advances, it has the potential to be integrated into real-
time diagnostic tools.
This could enable faster diagnoses during medical procedures, leading to quicker
treatment initiation and improved patient outcomes.
Widespread adoption of accurate and efficient lung cancer detection methods powered
by deep learning can significantly impact public health.
Early detection translates to better treatment outcomes, reduced mortality rates, and
potentially a decrease in the overall burden of lung cancer on healthcare systems.
6
Chapter 1:INTRODUCTION
Firstly, we put a plan to improve our project, so we put the important points and start
searching to reach our goal. Here are the important points we think about:
3. UI/UX: (Chapter 5)
Now we need a design that fits the situation, so we decided to choose designs
like hospitals and clinics designs to let the user feel familiar with our
application.
5. Merging: (Chapter 5)
Now we need to merge the application with the model.
We found that Fast API is the best tool to handle this job so we will use Fast
API to link our model with the application and web site.
7
CHAPTER 2: LUNG CANCER
8
Chapter 2 :LUNG CANCER
9
Chapter 2 :LUNG CANCER
2.1 OVERVIEW
Lung cancer is a type of cancer that starts when abnormal cells grow in an
uncontrolled way in the lungs. It is a serious health issue that can cause severe harm
and death.
Symptoms of lung cancer include a cough that does not go away, chest pain and
shortness of breath.
It is important to seek medical care early to avoid serious health effects. Treatments
depend on the person’s medical history and the stage of the disease.
The most common types of lung cancer are non-small cell carcinoma (NSCLUNG
CANCER) and small cell carcinoma (SCLUNG CANCER). NSCLUNG CANCER is
more common and grows slowly, while SCLUNG CANCER is less common but
often grows quickly.
Smoking tobacco (including cigarettes, cigars, and pipes) is the primary risk factor for
lung cancer, but it can also affect non-smokers. Other risk factors include exposure to
secondhand smoke, occupational hazards (such as asbestos, radon and certain
chemicals), air pollution, hereditary cancer syndromes, and previous chronic lung
diseases.
10
Chapter 2 :LUNG CANCER
The main types of lung cancer are non-small cell lung cancer and (NSCLUNG
CANCER) and small cell lung cancer (SCLUNG CANCER).
About 80% to 85% of lung cancers are NSCLUNG CANCER. The main subtypes of
NSCLUNG CANCER are adenocarcinoma, squamous cell carcinoma, and large cell
carcinoma. These subtypes, which start from different types of lung cells, are grouped
together as NSCLUNG CANCER because their treatment and prognoses (outlooks)
are often similar.
Lung adenocarcinoma occurs mainly in people who smoke or used to smoke, but it is
also the most common type of lung cancer seen in people who don’t smoke. It is more
common in women than in men, and it is more likely to occur in younger people than
other types of lung cancer.
11
Chapter 2 :LUNG CANCER
This type of lung cancer tends to grow and spread faster than NSCLUNG CANCER.
In most people with SCLUNG CANCER, the cancer has already spread beyond the
lungs at the time it is diagnosed. Since this cancer grows quickly, it tends to respond
well to chemotherapy and radiation therapy. Unfortunately, for most people the
Small cell lung cancer (SCLC) continues to carry a poor prognosis with a five-year
survival rate of 3.5% and a 10-year survival rate of 1.8%.
2.3 SYMPTOMS
Lung cancer can cause several symptoms that may indicate a problem in the lungs.
12
Chapter 2 :LUNG CANCER
2.4 PREVENTION
secondhand smoke
air pollution
workplace hazards like chemicals and asbestos.
Early treatment can prevent lung cancer from becoming worse and spreading to other
parts of the body.
Secondary prevention for lung cancer involves screening methods that aim to detect
the disease in its early stages, before symptoms become apparent and can be indicated
for high-risk individuals. In this population, early detection can significantly increase
the chances of successful treatment and improve outcomes. The primary screening
method for lung cancer is low dose computed tomography (LDCT).
2.5 DIAGNOSIS
Diagnostic methods for lung cancer include physical examination, imaging (such as
chest X-rays, computed tomography scans, and magnetic resonance imaging),
examination of the inside of the lung using a bronchoscopy, taking a sample of tissue
(biopsy) for histopathology examination and definition of the specific subtype
(NSCLUNG CANCER versus SCLUNG CANCER), and molecular testing to identify
specific genetic mutations or biomarkers to guide the best treatment option.
13
Chapter 2 :LUNG CANCER
Treatments for lung cancer are based on the type of cancer, how much it has spread,
and the person’s medical history. Early detection of lung cancer can lead to better
treatments and outcomes.
Surgery
radiotherapy (radiation)
chemotherapy
targeted therapy
immunotherapy
Surgery is often used in the early stages of lung cancer if the tumor has not spread to
other areas of the body. Chemotherapy and radiation therapy can help shrink the
tumor.
Doctors from several disciplines often work together to provide treatment and care of
people with lung cancer.
Supportive care is important for people with lung cancer. It aims to manage
symptoms, provide pain relief, and give emotional support. It can help to increase
quality of life for people with lung cancer and their families.
14
Chapter 2 :LUNG CANCER
1-Early-stage disease
The primary treatment for early-stage lung cancer (i.e. tumor limited to the lung, with
no metastatic dissemination to distant organs or lymph nodes) is surgical removal of
the tumor through procedures such as lobectomy, segmentectomy, or wedge resection.
Neoadjuvant therapy (chemotherapy and/or radiation therapy before surgery) can help
reduce tumor size, making it more manageable for surgical removal. Adjuvant
treatment (chemotherapy and/or radiation therapy) is very often recommended after
surgery to reduce the risk of cancer recurrence. In cases where surgery is not feasible,
radiation therapy or stereotactic body radiation therapy (SBRT) may be used as the
primary treatment. Targeted therapy and immunotherapy may also be considered
based on specific tumor characteristics. Individualized treatment plans should be
discussed with healthcare professionals.
2-Advanced disease
The treatment for metastatic stage lung cancer, where the cancer has spread to distant
organs or lymph nodes, is based on various factors, including the patient's overall
health, the extent and location of metastases, histology, genetic profile, and individual
preferences. The primary goal is to prolong survival, alleviate symptoms, and improve
quality of life. Systemic therapies, such as chemotherapy, targeted therapy, and
immunotherapy, play a crucial role in the treatment of metastatic lung cancer.
Chemotherapy is often the first-line treatment for the majority of patients around the
world and involves the use of drugs that circulate throughout the body to kill cancer
cells. Combination chemotherapy regimens are commonly used, and the choice of
drugs depends on factors such as the histological type of the cancer and the patient's
general health conditions. Targeted therapy, designed to block the signaling pathways
that drive the growth of cancer cells, is an important option for patients with specific
genetic mutations or biomarkers identified in their tumor. Immunotherapy,
specifically immune checkpoint inhibitors, has revolutionized the treatment of
metastatic lung cancer. These drugs help to stimulate the immune system to recognize
and attack cancer cells. Local treatments, such as radiation therapy and surgery, may
15
Chapter 2 :LUNG CANCER
Clinical trials like this are crucial as they offer patients access to novel treatments and
help advance medical knowledge, potentially leading to new standard treatment
protocols. Participation in such trials not only provides access to cutting-edge
therapies but also contributes to the broader fight against cancer by supporting the
development of more effective treatments.
Medical imaging tools help radiologists diagnose lung diseases. Among these medical
imaging methods, CT offers more advantages, including size, location,
characterization, and growth of the lesion, which can determine the information of
lung cancer and nodules. 4D CT provides more precise targeting of administered
radiation, which greatly impacts lung cancer management. An automatic detection
system based on linear discriminant analysis (LDA) and optimal deep neural network
(ODNN) has been developed for lung cancer classification in lung CT images. LDA
reduced the extracted image features to reduce the feature dimensions. ODNN is
applied and optimized by modified gravity search algorithm to provide more accurate
classification results. Compared with CT, LDCT is more sensitive for early-stage lung
nodules and detects cancer with lower radiation. However, it does not help reduce
lung cancer deaths.
16
Chapter 2 :LUNG CANCER
MRI is the most powerful tool for lung imaging without ionizing radiation, but it
provides insufficient information with high costs and time-consuming limitations. It
fails to detect approximately 10% of small lung nodules (4-8 mm in diameter). MRI
with ultra-short echo time (UTE) can improve signal intensity and reduce lung
sensitivity. MRI with UTE is sensitive for detecting small lung nodules (4-8 mm).
MRI achieves a higher lung nodule detection rate than LDCT. MRI with different
pulse
sequences have also improved the sensitivity of detecting lung nodules. The authors
investigated T1-weighted and T2-weighted MRI for the detection of small lung
nodules. Compared with a 3T 1.5 MRI, a 1.5T MRI is much easier to identify ground
glass opacities. Ground-glass opacities have been successfully detected in 75% of
people with lung fibrosis who received 1.5 Tesla MRI with SSFP sequences. MRI
with T2-weighted fast spin echo provides similar or better performance for detecting
ground-glass infiltration in immunocompromised subjects.
17
Chapter 2 :LUNG CANCER
18
CHAPTER 3: LITERATURE REVIEW
19
Chapter 3:LITERATURE REVIEW
Several studies have investigated the use of deep learning algorithms for CT-based
lung cancer screening and diagnosis. In general, there are unique image attenuation
patterns in CT images for healthy and unhealthy scans. To distinguish the lungs from
the surrounding tissues, straightforward techniques such as numerical approaches,
gray-level thresholding, and shape-based approaches can be used to perform simple
lung segmentation.
In [1], CNN based model for automatic detection of lung cancer provided lung CT
scan image. We proposed an algorithm known as CNN based Automatic Lung Cancer
Detection (CNN-ALCD) which is based on supervised learning phenomenon. The
learned model is capable of detecting lung cancer from any newly arrived test sample.
The proposed solution has different mechanisms such as preprocessing, building CNN
with different layers, training the CNN model and performing lung cancer detection.
Empirical study revealed that the proposed CNN based model outperforms many
existing neural network-based methods with highest accuracy 94.11%. Therefore, the
proposed system can be integrated with a Clinical Decision Support System (CDSS)
in healthcare units for automatic diagnosis of lung cancer.
20
Chapter 3:LITERATURE REVIEW
In [2], aims to classify malignant and non-malignant cells development in the lungs
using the 2D Convolutional Neural Network (CNN) algorithm to classify the tumors
found in lung as malignant or benign. This method was evaluated on Kaggle CT
scans, experimental results show that our method achieves 88.76% accuracy in
identifying lung nodules from CT images, which is more efficient as compared to
accuracy obtained by the traditional neural network systems.
In [3], some Computed Tomography (CT) images of the Lung Image Database
Consortium (LIDC) dataset are adopted as training and testing data, data
preprocessing is completed by intercepting pixels, normalization and other methods,
data enhancement is realized such as rotation and scaling methods, and the pulmonary
nodule sample library is expanded. Utilizing the constructed lung nodule sample
library, train the Convolutional Neural Network (CNN) model, complete the detection
and segmentation of pulmonary nodules, and exact the regions of pulmonary nodules.
The size and regularity features of pulmonary nodules are extracted, and lung cancer
recognition is realized according to the size and shape of pulmonary nodules. The
experiment results show the lung cancer detection and identification method based on
convolutional neural network with morphological features has higher accuracy.
In [4], In the validating conviction, the enactment of the neural network technique has
been initiated to examine the cancerous growth in the gathered image datasets. With
the help of Artificial intelligence and deep learning technique the cancerous growth
can be evaluated. In accordance to knock back the performance measures the
supervised learning technique is implemented with the use of the deep learning
technique. Convolutional Neural Network the stratagem for tumor detection. The
substructure of this work includes the following constraints such as image acquisition,
image pre-processing, image enhancement, image segmentation, feature extraction,
neural identification. To put it succinctly, machine learning technique gives an
innovational approach to enrich the decision support in lung tumor medicaments at
less cost.
21
Chapter 3:LITERATURE REVIEW
In [5], The use of machine learning is an efficient way to distribute the work of
doctors and process a large amount of data to produce accurate results on the go.
Three phases of CT image pre-processing, Deep Learning, and Convolutional Neural
Network use make up the diagnosis approach. The pre-processing converts raw data
into usable form and deep learning algorithm assigns weight to the data, in the last
stage CNN is used to conclude the health status of the lung, i.e. normal or abnormal.
In [6], combines three types of optimizers with six deep learning models to conduct a
performance comparison. This investigation focuses on six models AlexNet,
GoogleNet, ResNet, Inception V3, EfficientNet b0, and SqueezeNet. The different
models are assessed by comparing their performance with a stochastic gradient with
momentum, Adam, and RMSProp optimization techniques. The study showed that
CPU training takes time for training without GPU support. According to this study,
the google net with Adam as optimizer gives Accuracy-92.08%, Precision-100%,
Recall-86.89%, F1score-92.98%, FPR-0%, FNR-13.11%,outperforming the other
deep learning architectures. When comparing the computational time for deep
learning models, it is observed that Inception V3 takes the most time to train, and
AlexNet takes the least time.
In [7], “Modern deep learning model advancements can be applied to create advanced
computer-aided diagnosis methods to find malignant nodules. The suggested method
classifies nodules seen in CT scan pictures as malignant or benign utilizing a Particle
Swarm Optimization-RNN. The identification and categorization of malignant
nodules has made substantial use of image analysis and neural networks. RNNs are
therefore more suited for the job of classifying and detecting nodules. Additional
characteristics of PSO-RNNs include multiple feature extraction. The suggested PSO-
RNN model, which makes use of the domain expertise of the CT scan pictures of the
lung in the department of medicine and Multilayer Perceptron, will be appropriate for
the early recognition and characterization of CT images including nodules with an
accuracy of 93.52%.”
22
Chapter 3:LITERATURE REVIEW
In [8], a lung nodule detection algorithm based on deep learning. The proposed
method is intended for chest radiography, which has been proved to be an effective
tool for detecting pulmonary nodules in clinical practice. We propose a novel
convolutional neural network (CNN) architecture, which can learn to detect and
classify pulmonary nodules from medical images. Our model obtained the most
advanced results on the lung nodule detection task (lndt). Lndt is a challenging
benchmark data set with high sensitivity and specificity, and has promising
performance on other data sets, including chest X-ray data set (cord). This research is
carried out by using deep learning technology, which is widely used in image
recognition and pattern recognition. Compared with other existing methods, the
proposed method can detect the presence or absence of pulmonary nodules in chest X-
ray images with high accuracy.
In [9], Lung cancer is nothing but abnormal swelling of lung tissues and could be a
life threatening one. As per statistics, it is responsible for more deaths than any other
type of cancer. It is important to identify and treat this anomaly from the patient's
perspective. For identifying the tumour cells from CT scans, numerous image-
processing and soft-computing procedures are used. CT scan images are mostly used
in image processing since these are high quality and clear images with more ppi (pixel
per inch). Using this method, small nodes of tissues (nodules) can be found. In
primary finding of lung cancer, patient's probabilities of existence are increased.
Hence, an effective CAD system for lung tumour detection has been projected. This
system comprises three points: initial level processing, segmentation and
classification of nodules. The study of lung disorders requires accurate segmentation
of lung images which is very important in detecting lung cancer. Lung images contain
noise and weak boundaries so, accurate detection of lung nodule is very difficult or
challenging task. This paper covers the comprehensive review of methods used for
lung nodule detection.
23
Chapter 3:LITERATURE REVIEW
In [10], a deep learning model has been proposed which can perfectly detect and
predict lung cancer levels from histopathological information. The model has been
trained and validated using 15,000 lung cancer histopathological image data and has
got 99.80% prediction accuracy from our model.
In [11], CNN-based approach for the classification of lung cancer and attained
95.62% accuracy. When applied to classifying lung cancer, the solution achieves the
most outstanding performance possible throughout the entire dataset. The overfitting
issue that arises during lung cancer classification tasks may be solved with the help of
the proposed framework, which also outperforms existing methods that are considered
to be state-of-the-art.
In [12], Lung cancer is one of the most common and dangerous cancers in the world.
However, lives can be saved through early diagnosis by CT scan images, which is the
best imaging technique in the medical field for early treatment. Though CT scan
imaging is the best technique, doctors and radiologists face some difficulties such as
not being able to diagnose early and commence treatment and to interpret and identify
cancer from CT scan images because of the limitation of equipment and specialists.
Therefore, to identify cancerous cells accurately, computer-aided diagnosis can be
more helpful for doctors. Computer-aided techniques based on image processing and
machine learning have been extensively researched and are being implemented
currently to address this issue.
In [13], The second leading cause for the exponential increase in the mortality rate
globally is due to lung cancer. Over-consumption of tobacco and cigarettes are the
major reasons. Uncontrollable cell growth in lung region will affect the survival rate
of humans. Manual interpretation of disease prediction might be challenging due to
the exponential increase in medical reports. So, early detection of tumor from proper
manifestation can be done through Computer Aided Diagnosis (CAD) techniques. In
24
Chapter 3:LITERATURE REVIEW
In [15], Lung cancer is a major contributor to global mortality rates and identification
is critical to improve patient outcomes. In recent years, machine learning algorithms
have demonstrated promising results in identifying lung nodules from medical
images. The most compelling area of research for scientists is the early detection of
lung cancer. This study is a method for lung nodule detection using CT images. The
study incorporates a hybrid model that combines multiple machine learning
algorithms including CNN, SVM, DTC, ANN, and KNN to improve the accuracy of
nodule detection. The hybrid model demonstrated high accuracy in identifying
various types of lung nodules, including Adenocell carcinoma, squamous cell
carcinoma, and large cell carcinoma. Specifically, the model achieved an accuracy
rate of over 90% in detecting and differentiating normal lung tissue and Adenocele
25
Chapter 3:LITERATURE REVIEW
carcinomas. Accuracy graphs and priority setting were utilized to assess the model's
capability in accurately predicting the presence of lung cancer. Additionally, the
efficiency of the hybrid model was compared with other machine learning algorithms,
including SVM, Random Forest, and Decision Trees. A large dataset of CT scans was
collected for training and evaluation purposes. The results demonstrated the
advantages of the suggested hybrid model in terms of accuracy and efficiency. This
study highlights the importance of early lung nodule identification using CT scans and
demonstrates the effectiveness of the hybrid model in accurately identifying different
types of lung nodules.
26
Chapter 3:LITERATURE REVIEW
27
Chapter 3:LITERATURE REVIEW
This study included a training set (n = 802) and two validation cohorts (internal, n =
196; external, n = 224). The ACC of deep radiomics algorithm in internal validation
achieved 0.8776, 0.8061 in the 2-category, 3-category classification, respectively.
Even in 8 classifications, the AUC ranged from 0.739 to 0.940 in internal set. Further,
we constructed a prognosis model that C-index was 0.892(95% CI: 0.846–0.937) in
internal validation set.
In [18] This study recruited participants prospectively in two rural sites of western
China. A deep learning system was developed to assist clinicians to identify the
nodules and evaluate the malignancy with state-of-the-art performance assessed by
recall, free-response receiver operating characteristic curve (FROC), accuracy (ACC),
area under the receiver operating characteristic curve (AUC).
This study enrolled 12,360 participants scanned by mobile CT vehicle and detected
9511 (76.95%) patients with pulmonary nodules. Majority of participants were female
(8169, 66.09%), and never-smokers (9784, 79.16%). After 1-year follow-up, 86
patients were diagnosed with lung cancer, with 80 (93.03%) of adenocarcinoma, and
73 (84.88%) at stage I. This deep learning system was developed to detect nodules
(recall of 0.9507; FROC of 0.6470) and stratify the risk (ACC of 0.8696; macro-AUC
of 0.8516) automatically.
28
Chapter 3:LITERATURE REVIEW
CNN-Based Classification CNN for lung cancer High accuracy Needs large dataset,
classification, addresses (95.62%), handles dependent on input
[11]
overfitting overfitting quality
29
Chapter 3:LITERATURE REVIEW
30
Chapter 4:PROPOSED TECHNIQUE
In this chapter, we will describe the technology proposed for our project. The
approach consists of several steps that include pre-processing CT images and using a
convolutional neural network (CNN) to detect and classify lung nodules.
4.2 DATASET
The dataset used for this project includes a collection of CT scan images of lungs.
Each image is labeled with the presence or absence of lung nodules, as well as the
characteristics of any nodules present.
31
Chapter 4:PROPOSED TECHNIQUE
There was very little data available on the disease, so we took the step of data
Augmentation the data to obtain more data
Rotation
Translation
Scaling
Flapping
Shearing
Zooming
Brightness and Contrast Adjustment
Noise Injection
4.3.2 Preprocessing
we made a preprocessed data to make sure that the model will understand
the data.
Resize Images
Channel Ordering
Mean Subtraction
Batching
Normalization
32
Chapter 4:PROPOSED TECHNIQUE
The pre-trained model has been trained on a massive image dataset (ImageNet) and
has learned valuable features for image recognition.
The base layers of the pre-trained MobileNet are frozen. This prevents these layers
from being modified during training and focuses the training process on the final
layers for lung cancer classification.
New classification layers are added on top of the frozen pre-trained MobileNet
architecture. These new layers are specifically designed for binary classification
(cancerous vs. non-cancerous) or multi-class classification (different lung cancer
types).
The entire model, including the frozen pre-trained layers and the newly added
classification layers, is then trained on the preprocessed CT scan dataset. During
training, the model's internal parameters are adjusted to minimize classification errors
on the lung cancer classification task.
(In this section, elaborate on the training process. Specify the optimizer used, loss
function, and any hyperparameter tuning techniques employed. You can mention the
batch size and the number of training epochs here).
4.4.3 Evaluation
The trained model's performance is evaluated on a separate test dataset not used
during training. This ensures an unbiased assessment of the model's generalization
ability.
33
Chapter 4:PROPOSED TECHNIQUE
4.5 DISCUSSION
the MobileNet framework, here are some key components you can include in your
proposed model architecture for lung cancer classification:
On top of the frozen pre-trained MobileNet layers, you'll add new classification layers
specific to your task. These layers will be responsible for learning the patterns that
differentiate cancerous from non-cancerous lung patterns in the CT scans.
The number and type of these layers will depend on whether you're performing binary
classification (cancerous vs. non-cancerous) or multi-class classification (different
lung cancer types).
34
Chapter 4:PROPOSED TECHNIQUE
Common choices for the final layer include a dense layer with a sigmoid activation
for binary classification or a SoftMax activation for multi-class classification.
Additional Considerations:
Pooling Layers: You might consider including pooling layers (e.g., average
pooling) within the pre-trained MobileNet architecture to further reduce the
dimensionality of the data and control overfitting. However, be mindful not to lose
too much spatial information crucial for lung cancer classification.
Batch Normalization: Batch normalization layers can be added after each
convolutional layer to improve the model's training stability and potentially
accelerate convergence.
4.6 COMPARISON
We started to train the model on the data and activating the early stop and check
point tools .
We have used this model and have made some modifications that we will mention in
detail in the next chapter, but we obtained somewhat satisfactory results.
35
Chapter 4:PROPOSED TECHNIQUE
36
Chapter 4:PROPOSED TECHNIQUE
37
4.8 CONCLUSION
This chapter presented a MobileNet-based deep learning model for lung cancer
classification from CT scans. The model leverages transfer learning to exploit the pre-
trained features of MobileNet and fine-tune them for the specific task of lung cancer
detection. The proposed approach offers a computationally efficient and potentially
mobile-friendly solution for lung cancer screening. Future work will involve
exploring techniques to address class imbalance and improve model interpretability
for better clinical adoption.
38
CHAPTER 5: IMPLEMENTATION
39
Chapter 5 : IMPLEMENTATION
5.1 DATA
Data contains 3 chest cancer types which are Adenocarcinoma, Large cell
carcinoma, Squamous cell carcinoma , and 1 folder for the normal cell Data folder
is the main folder that contains all the step folders inside Data folder are test ,
train, valid.
“The precise location and size of a lung tumor can depend on factors such as the
specific subtype of cancer, the stage of the disease, and the patient's unique anatomy.”
1. Normal
Figure 5-4:Normal
Figure CT scan.
5-5 Normal.
40
Chapter 5 : IMPLEMENTATION
2. Adenocarcinoma
41
Chapter 5 : IMPLEMENTATION
Location: Large cell carcinoma can occur anywhere in the lungs and doesn't have
a specific predilection for central or peripheral locations.
Tumor Size Large cell carcinomas are often larger in size compared to other
types of lung cancer. They may present as large, bulky masses on CT scans.
Presence of Cavitation: Cavitation is less common in large cell carcinoma
compared to squamous cell carcinoma, but it can occur, especially in larger
tumors with central necrosis.
Lymph Node Involvement: Large cell carcinoma may involve regional lymph
nodes, but the pattern of lymph node involvement is less predictable compared to
squamous cell carcinoma.
Pattern of Spread: Large cell carcinoma tends to grow rapidly and may spread
early to distant organs such as the brain, bones, or adrenal glands. It can also
spread locally within the chest.
42
Chapter 5 : IMPLEMENTATION
Location Squamous cell carcinoma often arises centrally in the larger bronchi,
though it can also occur peripherally. It tends to grow within the airway, causing
obstruction.
Tumor Size: Squamous cell carcinomas can vary in size, from small nodules to
larger masses. They often present as discrete, solid masses on CT scans.
Presence of Cavitation: Cavitation is relatively common in squamous cell
carcinoma, particularly in larger tumors. Central necrosis can lead to cavitation,
which may be visible on CT scans.
Lymph Node Involvement: Squamous cell carcinoma has a higher propensity for
involving regional lymph nodes, particularly those near the trachea and main
bronchi.
Pattern of Spread: Squamous cell carcinoma typically spreads locally within the
chest, including to adjacent structures such as the chest wall or mediastinum. It
can also metastasize to distant organs.
43
Chapter 5 : IMPLEMENTATION
Data preprocessing is a crucial step in the machine learning pipeline that involves
transforming raw data into a format suitable for training a machine learning model. It
typically includes several steps such as cleaning, transforming, and organizing the
data.
The goal of data augmentation is to increase the diversity of the training dataset,
thereby improving the generalization and robustness of the trained model.
Rotation: Rotate the image from a certain angle, introducing variations in object
orientations.
Translation: Shift the image horizontally or vertically, simulating different object
positions within the frame.
Scaling: Resize the image, making objects appear larger or smaller relative to the
image size.
Flapping: Flip the image horizontally or vertically, creating mirror images.
Shearing: Skew the image along one of its axes, introducing perspective
distortions.
Zooming: Zoom into or out of the image, focusing on specific regions or
capturing a broader context.
Brightness and Contrast Adjustment: Increase or decrease the brightness and
contrast of the image.
Noise Injection: Add random noise to the image, simulating variations in
lighting conditions or sensor noise.
44
Chapter 5 : IMPLEMENTATION
We have used all data augmentation techniques on our project database to obtain the
following:
1. Increase Training Data: Data augmentation expands the effective size of the
training dataset by generating new, modified samples from the existing data. This
is particularly beneficial when the original dataset is limited in size, as it helps
prevent overfitting and improves the generalization ability of the model.
2. Improve Model Robustness: By exposing the model to a wider range of
variations and perturbations in the data, data augmentation encourages the model
to learn features that are more robust and invariant to such changes.
This makes the model more capable of handling variations in real-world data that
it may encounter during deployment.
The main language used in this project is Python. Python’s open-source libraries are
not the only feature that makes it favorable for machine learning and AI tasks. Python
is also highly versatile and flexible, meaning it can also be used alongside other
programming languages when needed. Even further, it can operate on nearly all OS
and platforms on the market.
Implementing Deep Neural Networks can be extremely time consuming, but Python
offers many packages that cut down on this. It is also an object-oriented
programming (OOP) language, which makes it extremely useful for efficient data
use and categorization.
45
Chapter 5 : IMPLEMENTATION
5.4 ACCURACY
Accuracy is a metric that measures how often a machine learning model correctly
predicts the outcome. You can calculate accuracy by dividing the number of correct
predictions by the total number of predictions.
In other words, accuracy answers the question: how often the model is right?
You can measure the accuracy on a scale of 0 to 1 or as a percentage. The higher the
accuracy, the better. You can achieve a perfect accuracy of 1.0 when every prediction
the model makes is correct.
This metric is simple to calculate and understand. Almost everyone has an intuitive
perception of accuracy: a reflection of the model's ability to correctly classify data
points.
46
Chapter 5 : IMPLEMENTATION
Each row of the matrix represents the instances in an actual class while each column
represents the instances in a predicted class, or vice versa – both variants are found in
the literature. The name stems from the fact that it makes it easy to see whether the
system is confusing two classes (i.e. commonly mislabeling one as another).
Given a sample of 12 individuals, 8 that have been diagnosed with cancer and 4 that
are cancer-free, where individuals with cancer belong to class 1 (positive) and non-
cancer individuals belong to class 0 (negative), we can display that data as follows:
47
Chapter 5 : IMPLEMENTATION
5.6.1 Introduction
At the heart of our graduation project lies the ambition to provide healthcare
practitioners with cutting-edge AI algorithms precisely designed to detect cancers
quickly and accurately in human lungs on computed tomography (CT) scans. Through
comprehensive exploration of advanced AI techniques, our goal is to create a robust
system capable of independent classification of a variety of cancers identified on CT
images. By harnessing the power of deep learning methodologies, our ultimate goal is
to redefine diagnostic accuracy and effectiveness in clinical practice, pushing the
boundaries of medical imaging and raising the standard of patient care around the
world.
Figure 5-13: CT scan from Train file. Figure 5-12: CT scan from Validate file. Figure 5-11: CT scan from Test file.
48
Chapter 5 : IMPLEMENTATION
5.6.2 Libraries
WE use Libraries as they provide a set of pre-built functions and tools that simplify
the process of developing and deploying deep learning solutions. By using these
libraries, developers and data scientists can focus more on the problem-solving aspect
rather than spending time on coding complex algorithms from scratch, so here is the
libraries we used in our models.
1. Tensor flow
2. OpenCV
Provides a way to interact with the operating system, such as navigating directories
and handling file paths.
3. NumPy
Fundamental library for numerical computing in Python, providing support for array
operations and mathematical functions.
4. matplotlib
Is a collection of command style functions that make matplotlib work like MATLAB.
Each pyplot function makes some change to a figure: e.g., creates a figure, creates a
plotting area in a figure, plots some lines in a plotting area, decorates the plot with
labels, etc. In matplotlib. Pyplot various states are preserved across function calls, so
that it keeps track of things like the current figure and plotting area, and the plotting
49
Chapter 5 : IMPLEMENTATION
functions are directed to the current axes (please note that “axes” here and in most
places in the documentation refers to the axes part of a figure and not the strict
mathematical term for more than one axis).
5. Seaborn
6. sklearn.metrics
The sklearn.metrics module implements several loss, score, and utility functions to
measure classification performance. Some metrics might require probability estimates
of the positive class, confidence values, or binary decisions values. Most
implementations allow each sample to provide a weighted contribution to the overall
score, through the sample_weight parameter.
7. OS(Operating System)
The os system() method executes the command (a string) in a subshell. This method is
implemented by calling the Standard C function system() with some limitations. If
command generates any output, it is sent to the interpreter standard output stream.
Provides a way to interact with the operating system, such as navigating directories
and handling file paths.
8. keras.callbacks
A callback is an object that can perform actions at various stages of training (e.g. at
the start or end of an epoch, before or after a single batch, etc.). You can use callbacks
to: Write TensorBoard logs after every batch of training to monitor your metrics.
Periodically save your model to disk. Do early stopping.
9. Pandas
Offers high-level data structures and data manipulation tools, particularly useful for
handling tabular data.
50
Chapter 5 : IMPLEMENTATION
5.6.3 VGG16:
51
Chapter 5 : IMPLEMENTATION
5.6.3.1 Preprocessing
Firstly, we made a preprocessing on data to make sure that the model will understand
the data.
1. Resize Images: VGG16 expects input images to have a fixed size. The original
VGG16 model was trained on 224x224 pixel images. Therefore, before feeding
images into the model, you need to resize them to match this size. This can be
done using libraries like OpenCV or PIL (Python Imaging Library).
2. Mean Subtraction: Subtracting the mean pixel value across all images in the
dataset is a common preprocessing step. For VGG16, you would typically subtract
the mean RGB pixel value computed over the entire ImageNet dataset. This helps
center the data around zero and can improve convergence during training.
4. Channel Ordering: Ensure that the input image is in the correct channel ordering
expected by the model. VGG16 expects images to be in the 'RGB' format, where
channels are ordered as Red, Green, Blue.
5. Batching: Prepare the input data in batches for efficient processing. CNNs like
VGG16 often process images in batches to take advantage of parallel processing
capabilities provided by modern hardware.
52
Chapter 5 : IMPLEMENTATION
We removed the last layer and set the trainable = false so the model does not know
anything about the older data and trains on every image alone and gives us the results.
Then we set the dropout rate = 0.5. This means during each training iteration, half of
the neurons in that layer will be randomly deactivated.
We used the SoftMax type activation to predict the class of an input image and we
used optimizer Adam.
53
Chapter 5 : IMPLEMENTATION
4.4.3.2 Performance
1. Train Accuracy
2. Validate
3. Test
4. Confusion Matrix
54
Chapter 5 : IMPLEMENTATION
After checking the accuracy of the model, we found that this model doesn’t fit our
needs, so we tried another model to get better accuracy.
5.6.4 EfficientNet-B0
In the EfficientNet architecture, the input image is first passed through a series of
convolutional layers that reduce the resolution of the image while also increasing the
number of channels. This is followed by a series of bottleneck layers, which are
composed of a depthwise separable convolution followed by a pointwise convolution.
These layers reduce the computational cost of the model while also increasing its
depth.
The output of the bottleneck layers is passed through a series of fully connected layers
that produce the final output of the model. The final output is a vector of probabilities,
one for each class in the dataset, indicating the likelihood that the input image belongs
to each class.
55
Chapter 5 : IMPLEMENTATION
5.6.4.1 Performance
1. Test
2. Confusion Matrix
We have also worked on this model so that we can get better results.
After applying all the previous steps of processing, uploading the data, dividing it, and
running the model, we got better accuracy but it wasn’t good enough as we expected.
56
Chapter 5 : IMPLEMENTATION
5.6.5 MobileNet:
57
Chapter 5 : IMPLEMENTATION
5.6.5.1 Preprocessing
Firstly, we made a preprocessing on our data to make sure that the model will
understand the data.
Resize Images: MobileNet, like most CNNs, expects input images to have a fixed
size. The size can vary depending on the specific MobileNet variant you're using
(e.g., MobileNetV1, MobileNetV2). For example, MobileNetV1 commonly uses
224x224 pixel images, while MobileNetV2 can handle various input sizes.
Mean Subtraction: Similar to VGG16, you may subtract the mean pixel value
across all images in the dataset. However, MobileNet models might have been
trained with different datasets than VGG16, so it's essential to use the appropriate
mean pixel values for MobileNet.
Channel Ordering: Ensure that the input image is in the correct channel ordering
expected by the model. MobileNet typically expects images to be in the 'RGB'
format.
Batching: Prepare the input data in batches for efficient processing. MobileNet,
like VGG16, benefits from processing images in batches for parallel processing.
58
Chapter 5 : IMPLEMENTATION
We removed the last 5 layers, then added dropout rate=0.5 , then we added a layer
called global average pooling 2d, which reduces overfitting, then we put the
outputs of the layers that contain 4 neurons, and the activation function is
SoftMax.
Then we made the last 50 layers in the mobile net did not train the trainable =
false. so that the model does not know anything and trains on all the images alone
and gives us the results.
we used the SoftMax type activation to predict the class of an input image and we
used optimizer Adam.
We prepare a data generator for training a model using images from a directory. The
images are preprocessed using the MobileNet preprocessing function before being fed
into the model.
Train
We classified the data for train into 4 classes each class indicates a type of tumor
while the first class indicates the normal case so know we have (normal = 0,
Validate
Then we classified the validation data as same as trained data but the total number
of the images in the validated data is 1553 belonging to 4 classes.
Test
59
Chapter 5 : IMPLEMENTATION
The tested data is also the same. Classified into 4 types but now the number of images
is 315 images belonging to 4 classes.
1. Importing Libraries
This line imports the `EarlyStopping` callback from the Keras library. Callbacks are
functions that can be applied during the training process of a neural network to
perform certain actions at specific points.
2. Defining Callbacks
3. Check Point
We started to train the model on the data and activating the early stop and check
point tools using 20 epochs.
60
Chapter 5 : IMPLEMENTATION
An epoch is when all the training data is used at once and is defined as the total
number of iterations of all the training data in one cycle for training the machine
learning model.
4.4.5.4 Performance
1. Train
2. Validation
3. Test
4. Confusion Matrix
61
Chapter 5 : IMPLEMENTATION
62
CHAPTER 6: LAYOUT
63
In this chapter we will talk about the layout and the designs we used in our project.
We started our mission by choosing a suitable design then we built the web site and
application.
6.1 UI/UX
Figma is a collaborative web application for interface design, with additional offline
features enabled by desktop applications for macOS and Windows. The feature set of
Figma focuses on user interface and user experience design, with an emphasis on real-
time collaboration, utilizing a variety of vector graphics editor and prototyping tools.
The Figma mobile app for Android and iOS allows viewing and interacting with
Figma prototypes in real-time on mobile and tablet devices.
Firstly, we designed the log in and sign-up page to create the user’s account.
64
Then we create the home page, so the user has multiple choices as scanning his CT
scan or watching the videos of symptoms and causes or treatments and diagnosis.
There are also some hospitals and centers as reference if the user doesn’t know where
to go.
65
Here is the patient history page as the user can see his previous scans and the output
with the dates so he can see if there is progress or not.
66
There are many other features, but we will discuss them later. Now we need to
convert this design into a real platform able to be used by a user.
67
Figure 6-22 Diagnosis & Treatment Page
68
Chapter 6: LAYOUT
69
Chapter 6: LAYOUT
6.3 APPLICATION
70
Chapter 6: LAYOUT
71
72
Chapter 7: USED SOTWARE
7.1 INTRODUCTION
In this chapter, we will display all the software products used in this project, and we
will also clarify some points specifically for evaluating the model.
73
Chapter 7: USED SOTWARE
With zero configuration needed, Colab enables Python coding directly in the browser,
supports easy sharing of notebooks stored in Google Drive, and seamlessly integrates
executable code with rich text, images, HTML, and LaTeX. Utilizing popular Python
libraries such as NumPy and matplotlib, Colab empowers users to analyze and
visualize data efficiently.
74
Chapter 7: USED SOTWARE
7.2.2 Kaggle
Kaggle is an online community and platform tailored for data scientists and AI
enthusiasts, offering collaborative features, dataset publishing, GPU-integrated
notebooks, and competitive challenges. Founded in 2010 by Anthony Gold bloom and
Jeremy Howard and later acquired by Google in 2017, Kaggle aims to empower
professionals and learners in their data science journey by providing robust tools and
resources.
Users can engage in contests hosted by major companies, share and explore datasets,
exchange code snippets, and participate in discussions. Additionally, Kaggle offers
free courses with certificates upon successful completion, making it an inclusive hub
for knowledge-sharing and skill development in the fields of data science and
artificial intelligence.
75
Chapter 7: USED SOTWARE
Android Studio is the official integrated development environment (IDE) for Google's
Android operating system, built on JetBrains' IntelliJ IDEA software and designed
specifically for Android development. It is available for download on Windows,
macOS and Linux based operating systems. It is a replacement for the Eclipse
Android Development Tools (E-ADT) as the primary IDE for native Android
application development. Android Studio is licensed under the Apache license but it
ships with some SDK updates that are under a non-free license, making it not open
source.
Android Studio was announced on May 16, 2013, at the Google I/O conference. It
was in early access preview stage starting from version 0.1 in May 2013, then entered
beta stage starting from version 0.8 which was released in June 2014.The first stable
build was released in December 2014, starting from version 1.0.At the end of 2015,
Google dropped support for Eclipse ADT, making Android Studio the only officially
supported IDE for Android development.
On May 7, 2019, Kotlin replaced Java as Google's preferred language for Android app
development. Java is still supported, as is C++.
76
Chapter 7: USED SOTWARE
Fast API is a modern, fast (high-performance), web framework for building APIs with
Python based on standard Python type hints.
Fast: Very high performance, on par with NodeJS and Go. One of the fastest Python
frameworks available.
Fast to code: Increase the speed to develop features by about 200% to 300%. *
Easy: Designed to be easy to use and learn. Less time reading docs.
Short: Minimize code duplication. Multiple features from each parameter declaration.
Fewer bugs.
Standards-based: Based on (and fully compatible with) the open standards for APIs:
Open API (previously known as Swagger) and JSON Schema.
77
CHAPTER 8: CONCLUSION
78
Chapter 8 :CONCLUSION
Introduction: This section discusses the conclusions of this project in relation to the
space of the detection model design and development as a whole, as well as
applications and possible future work.
8.1 CONCLUSION
In conclusion, the integration of artificial intelligence (AI) into lung cancer detection
through deep learning models represents a significant advancement in medical
imaging and diagnostics. The capability of AI to analyze and interpret low dose
computed tomography (LDCT) images with high precision offers a promising tool to
enhance early detection of lung cancer, which is critical for improving patient
outcomes. By leveraging complex algorithms and superior pattern recognition
abilities, AI surpasses human limitations in identifying subtle anomalies in chest
scans, thereby potentially increasing the accuracy and efficiency of lung cancer
screening programs.
"Health Lung" explores the technical development and real-world application of such
models, highlighting the transformative potential of AI in healthcare. This approach
not only aids clinicians in making more informed decisions but also supports
personalized patient care through accurate and timely diagnosis.
Overall, the adoption of deep learning models in lung cancer detection underscores a
pivotal shift towards more intelligent, data-driven medical practices, heralding a new
era in the fight against lung cancer. This project exemplifies the vital role of AI in
advancing medical technology and improving patient care outcomes.
In summary, the application of deep learning to lung cancer detection exemplifies the
transformative potential of AI in healthcare. It not only enhances the diagnostic
process but also significantly contributes to improving patient outcomes by enabling
earlier detection and intervention. The continued advancement and integration of AI
technologies in medical practice are poised to revolutionize the field, making
79
Chapter 8 :CONCLUSION
Each approach has its strengths and weaknesses, making them suitable for different
applications and contexts in lung cancer detection. The choice of approach would
depend on the specific requirements, available data, and computational resources.
80
Chapter 8 :CONCLUSION
Chatbot: Chatbots are conversational tools that perform routine tasks efficiently.
People like them because they help them get through those tasks quickly so they can
focus their attention on high-level, strategic, and engaging activities that require
human capabilities that cannot be replicated by machines.so we want to add chatbot
specially a type called hybrid chatbot. A hybrid chatbot is a harmonious blend of
chatbot and live chat that combines the best of both worlds. A customer service
representative will be available in live chat to answer any customer’s questions, which
may be too complex or nuanced for automation alone.
Reservations: we also need to add a feature that allows the user to reserve an
appointment so if the scans were positive the user can reserve his appointment in the
hospital or the clink from the web site.
Improving: The main goal of this project is to help in the medical field and help the
patients to have the most accurate results. we want to improve the accuracy of the
model so we can get more accurate results, but now this depends on the data. There is
a problem with lung cancer CT scans. The problem is it’s very hard to get data from
online search engines, so we need to contract with a specialized hospital in lung
cancer treatment . After taking this data from the hospital, we can train our model on
more data so the model will be more accurate.
Otherwise, we can add some new features as using computer vision techniques or
NLP in the near future.
81
CHAPTER 9: REFERENCE
82
Chapter 9 :REFERENCE
[3] Y. Zhang, B. Dai, M. Dong, H. Chen and M. Zhou, "A Lung Cancer Detection
and Recognition Method Combining Convolutional Neural Network and
Morphological Features", IEEE 5th International Conference on Computer and
Communication Engineering Technology (CCET), Aug. 2022, [online] Available:
https://doi.org/10.1109/ccet55412.2022.9906329.
[6] N. Vijayan and J. Kuruvilla, "The impact of transfer learning on lung cancer
detection using various deep neural network architectures", 2022 IEEE 19th India
Council International Conference (INDICON), Nov. 2022.
83
Chapter 9 :REFERENCE
[8] Zhao, "Lung Nodule Detection Algorithm Based on Deep Learning In Medical
Images", International Conference on Artificial Intelligence of Things and
Crowdsensing (AIoTCs), Oct. 2022, [online] Available:
https://doi.org/10.1109/aiotcs58181.2022.00118.
84
Chapter 9 :REFERENCE
[14] T. Tan et al., "Optimize transfer learning for lung diseases in bronchoscopy
using a new concept: Sequential Fine-Tuning", IEEE Journal of Translational
Engineering in Health and Medicine, vol. 6, pp. 1-8, Jan. 2018.
[17] Wang C., Shao J., Lv J., Cao Y., Zhu C., Li J., Shen W., Shi L., Liu D., Li
W. Deep learning for predicting subtype classification and survival of lung
adenocarcinoma on computed tomography. Transl. Oncol. 2021;14:101141.
doi: 10.1016/j.tranon.2021.101141.
[18] Shao J., Wang G., Yi L., Wang C., Lan T., Xu X., Guo J., Deng T., Liu D., Chen
B., et al. Deep learning empowers lung cancer screening based on mobile low-dose
computed tomography in resource-constrained sites. Front. Biosci.
Landmark. 2022;27:212.
85