9.biomedical Image Analysis For Colon and Lung Cancer Detection Using Tuna Swarm Algorithm With Deep Learning Model
9.biomedical Image Analysis For Colon and Lung Cancer Detection Using Tuna Swarm Algorithm With Deep Learning Model
ABSTRACT The domain of Artificial Intelligence (AI) is made important strides recently, leading to devel-
opments in several domains comprising biomedical diagnostics and research. The procedure of AI-based
systems in biomedical analytics takes opened up novel avenues for the progress of disease analysis, drug
discovery, and treatment. Cancer is the second major reason of death worldwide; around one in every six peo-
ple pass away suffering from it. Among several kinds of cancers, the colon and lung variations are the most
frequent and deadliest ones. Initial detection of conditions on both fronts significantly reduces the probability
of mortality. Deep learning (DL) and Machine learning (ML) systems are exploited to speed up such cancer
detection, permitting researchers to analyze a huge count of patients in a lesser time count and at a minimal
cost. This study develops a new Biomedical Image Analysis for Colon and Lung Cancer Detection using
Tuna Swarm Algorithm with Deep Learning (BICLCD-TSADL) model. The presented BICLCD-TSADL
technique examines the biomedical images for the identification and classification of colon and lung cancer.
To accomplish this, the BICLCD-TSADL technique applies Gabor filtering (GF) to preprocess the input
images. In addition, the BICLCD-TSADL technique employs a GhostNet feature extractor to create a
collection of feature vectors. Moreover, AFAO was executed to adjust the hyperparameters of the GhostNet
technique. Furthermore, the TSA with echo state network (ESN) classifier is utilized for detecting lung and
colon cancer. To demonstrate the more incredible outcome of the BICLCD-TSADL system, an extensive
experimental outcome is carried out. The comprehensive comparative analysis highlighted the greater
efficiency of the BICLCD-TSADL technique with other approaches with maximum accuracy of 99.33%.
INDEX TERMS Cancer, biomedical imaging, artificial intelligence, colon cancer, tuna swarm algorithm,
GhostNet.
and it can be considered that an unaware diet leads to the This study develops a new Biomedical Image Anal-
growth of colon cancer [3]. Thus, lung tumour is the second ysis for Colon and Lung Cancer Detection using Tuna
cancer with colon cancer. In simple, a patient can affect by Swarm Algorithm with Deep Learning (BICLCD-TSADL)
both colon and lung cancer simultaneously. Therefore, it is model. The BICLCD-TSADL technique applies Gabor fil-
dynamic to examine both cancer types in patients and to tering (GF) to preprocess the input images. In addition,
detect them in advance [4]. the BICLCD-TSADL technique employs a GhostNet feature
The common symptoms are muscle pain, fatigue, cough, extractor to make a collection of feature vectors. Moreover,
etc., follow by various kinds of syndromes [5]. Some of AFAO can be executed to adjust the hyperparameters of
the frequently used radiographic imaging methods are ultra- the GhostNet technique. Furthermore, the TSA with echo
sound, mammography, histopathological imaging, computed state network (ESN) classifier is utilized for detecting lung
tomography (CT), magnetic resonance imaging (MRI), and and colon cancer. To demonstrate the greater efficiency of
positron emission tomography (PET) for cancer detection [6]. the BICLCD-TSADL algorithm, an extensive experimental
Of these, histopathology images comprising phenotypic data outcome is carried out. In short, the key contributions of the
are vital for the evaluation and diagnosis of cancer diseases. study are listed as follows.
Manual examination of these medical images by profession- • Develop an automated colon and lung cancer detection
als is a difficult and delicate task. Hence, it also necessitates model, comprising GF preprocessing, GhostNet feature
a strong focus and time-consuming task [7]. Likewise, the extraction, AFAO-based hyperparameter tuning, ESN
recognitionofcasesismoredifficultintheevent of initial iden- classification, and TSA-based parameter optimization.
tification; the symptoms are difficult and vague to identify. To the best of our knowledge, the BICLCD-TSADL
Likewise, identifying tumours consumes a lot of time and technique never existed in the literature.
is dependent on different opinions of physicians in the initial • Employ AFAO with the GhostNet model for the feature
stages [8]. A distinct domain of healthcare can sort out these extraction process, which contributes to the accurate
difficulties. Artificial intelligence (AI) methods are utilized in representation of colon and lung cancer image data.
the medical domain, like early identification of health disas- • Employ the ESN model to effectively learn and classify
ters, biomedical image, and disease forecast [9]. DL methods cancerous and non-cancerous patterns in colon and lung
are highly capable of analyzing data from anatomical repre- cancer images, contributing to accurate cancer detection.
sentations, high-dimensional images, and videos. Likewise, • Hyperparameter optimization process using AFAO and
DL methods extract hidden characteristics and features from TSA helps to improve the cancer detection performance
medical images that are invisible to the naked eye for the of the BICLCD-TSADL model.
initial identification of cancers and discrimination among
their phases [10]. In this study, due to the same features of II. RELATED WORKS
abnormal cells in the initial phases, numerous hybrid sys- In [11], CNN methods were used to examine imaging data of
tems have been established with extraction features by mixed colon cells. CNN with average pooling and max pooling layers
techniques. and MobileNetV2 methods have been applied for colon cell
Multi-level hyperparameters, also called hierarchical imageclassification.Wahid et al. [12]introducedaCADmech-
hyperparameters, can be employed for controlling the config- anism through the CNN to find colon, lung and cancer tissues.
uration and behavior of complex DL models. These hyper- Heretheauthor,namelyResNet18,ShuffleNet V2,andGoogle
parameters are organized in a hierarchical manner, where Net,inadditiontoonesimplecustomizedCNNmodel,usesthree
higher-level hyperparameters affects the values or ranges of pre-trained CNN models. Kumar et al. [13] devise 4 CNN,
lower-level hyperparameters. It offers a structured way of namely,3-block,baseline,3-block,and2-blockCNNswithdata
managing and optimizing the settings for each component, augmentation, for classifying colon tissue histopathological
making it easier to maintain, understand, and tune the over- images(HPI).TofourCNN,theHPIwasfedasinput.
all model. When optimizing hyperparameters for a feature Garg and Garg [14] purposes of using and altering the
extractor and classifier, it is important to consider the hier- present pre-training CNN-related technique for finding colon
archical relationship between these components. The feature and lung cancer with the help of HPI, including bet-
extractor is responsible for transforming the input data into ter augmentation approaches. Here, 8 various pre-trained
a meaningful feature representation, while the classifier uses CNN methods like MobileNet, InceptionResNetV2, VGG16,
these features to make predictions. The learning rate is a crit- InceptionV3, DenseNet169, ResNet50, NASNetMobile, and
ical hyperparameter in many feature extractor architectures, Xception were trained on LC25000 data. Adu et al. [15]
such as neural networks. It controls the step size at which present a novel DHS-CapsNet abbreviated as dual horizontal
the model updates its internal parameters during training. For squash capsule network for categorizing the colon and lung
classifiers like neural networks, the number of hidden units in cancers on HPI. DHS-CapsNet is a new horizontal squash
the classifier’s layers significantly impacts the model’s capac- (HSquash) function develop for encoder feature fusion (EFF).
ity to learn complex patterns. Tuning this hyperparameter While a squash function, HSquash is modelled to make sure
using random search or Bayesian optimization can help that vectors can be effectually squashed and makes sparsity
achieve the right balance between model complexity and for higher discriminative capsules for extracting significant
generalization. data from images with different backgrounds.
94706 VOLUME 11, 2023
M. Obayya et al.: BICLCD-TSADL Model
α1 · Xbest
t t
− Xit +α2 ·Xit ,
+β · Xbest
tuna are performed based on the probability allocation, and
i = 1,
X1t+1 = (12) if the selection probability for the two foraging techniques
α t t i
− Xit +α2 ·Xt−1 ,
1 · X best +β · Xbest is 1/2, then they are concurrently performed, and it can be
i = 2, 3, . . . , NP,
mathematically formulated as follows.
t
α1 = ∂ + (1 − a) · , −X1ι )+TF · 2 ·Xbest
t t t
(13) X +(rand ·Xbest −Xit ),
tmax best
T
ifrand< 0.5,
α2 = (1 − a) − (1 − a) · , X1t+1 =
tmax
(14)
TF · p2
· X1ι ,
β = ebl · cos (2πb) ifrsnd≥ 0.5,
(15)
l = e3cos(tmax +1/t)−1)π) (16) (19)
(t/tmax )
t
where best denotes the existing better individual (food), Xiι+1 p= 1− , (20)
tmax
denotes the i − th individuals of the t + 1 iteration, a indicates
the constant, defines to which extent the tuna follows the where TF denotes the random integer within [1, −1].
better individual and the prior individual at the initial phase, The fitness chosen is a vital feature in the TSA approach.
t and tmax indicates the existing and the maximum amount An encoder solution can be utilised to grow the ability of
of iterations, α1 and α2 denotes the weight coefficient which candidate solutions. Here, the accuracy value is the critical
controls the movement trends of the individual to the better factor utilized to propose a fitness function.
and the prior individuals, and b is a uniformly distributed
Fitness = max (P) (21)
random integer within [0,1]. Once the best individual could
TP
not find food, blindly following the optimum individual for- P= (22)
aging is not advantageous to group foraging. Hence, to assist TP + FP
every individual in having the best spatial search abilities, whereas FP signifies the false positive and TP denotes the
a reference point for spiral search should be given to produce true positive value.
a random coordinate in the search range, thereby allowing
TSA to have the best global exploration abilities, and it can IV. EXPERIMENTAL VALIDATION
be a mathematical equation by Eq. (17): The proposed model is simulated using Python 3.6.5 tool on
PC i5-8600k, GeForce 1050Ti 4GB, 16GB RAM, 250GB
α1 · Xrand
t t
−Xit +α2 ·Xit ,
+β · Xrand SSD, and 1TB HDD. The parameter settings are given as
t=1′
i = 1, follows: learning rate: 0.01, dropout: 0.5, batch size: 5, epoch
X1 = (17)
α t t
−Xit +α2 ·Xi−1
t
,
1 · Xrand +β · Xrand count: 50, and activation: ReLU.
i = 2, 3, . . . , NP The experimental validation of the BICLCD-TSADL
algorithm was tested on the LC25000 database [26] com-
t
where Xrand denotes the random reference point in the prising five classes with 5000 samples under each class,
search ranges, TSA is typically explored extensively globally as depicted in Table 1. Fig. 3 depicts the sample of Lung
at an early stage and then slowly transitioned to accurate and colon images. The 5 classes are colon adenocarcino-
local exploitation. Thus, with the increasing amount of iter- mas, benign colonic tissues, lung adenocarcinomas, lung
ations, TSA slowly changes the reference point of spiral squamous cell carcinomas and benign lung tissues. For
foraging from a random individual at the beginning to an experimental validation, 70:30 of training/testing data.
optimum individual. The spiral foraging strategy can be Fig. 4 establishes the classifier outcomes of the
mathematically expressed by Eq. (18), as shown at the bottom BICLCD-TSADL system under the test database. Figs. 4a-4b
of the next page. represents the confusion matrix offered by the
Tuna chooses to spiral foraging along with parabolic coop- BICLCD-TSADL model on 70:30 of TRP/TSP. The figure
erative foraging. Tuna forms a parabola with the targeted food denoted that the BICLCD-TSADL approach has identified
as a reference to the Z -point. Tuna finds the targeted food by and classified all 5 class labels accurately. Likewise, Fig. 4c
searching around the parabola. Both foraging approaches of represents the PR investigation of the BICLCD-TSADL
(
α1 · t
+ β · Xbest
t
− Xit + α2 · Xit ,
Xbest i = 1, t
if rand ≥
α1 · t
+ β · Xbest
t
− Xit + α2 ·Xi−1
t
, i = 2, 3, . . . , NP,
Xbest tmax
X1t+1 ′ = ( (18)
α1 · t
+ β · Xrand
t
− Xit + α2 · Xit ,
Xrand i = 1, t
if rand <
α1 · Xrand + β · Xrand − Xit + α2 ·Xi−1 , i = 2, 3, . . . , NP,
t t
t tmax