Peerj Cs 10 2008
Peerj Cs 10 2008
ABSTRACT
Brain tumors present a significant medical challenge, demanding accurate and timely
diagnosis for effective treatment planning. These tumors disrupt normal brain functions
in various ways, giving rise to a broad spectrum of physical, cognitive, and emotional
challenges. The daily increase in mortality rates attributed to brain tumors underscores
the urgency of this issue. In recent years, advanced medical imaging techniques,
particularly magnetic resonance imaging (MRI), have emerged as indispensable tools
for diagnosing brain tumors. Brain MRI scans provide high-resolution, non-invasive
visualization of brain structures, facilitating the precise detection of abnormalities
such as tumors. This study aims to propose an effective neural network approach
for the timely diagnosis of brain tumors. Our experiments utilized a multi-class
MRI image dataset comprising 21,672 images related to glioma tumors, meningioma
tumors, and pituitary tumors. We introduced a novel neural network-based feature
engineering approach, combining 2D convolutional neural network (2DCNN) and
VGG16. The resulting 2DCNN-VGG16 network (CVG-Net) extracted spatial features
from MRI images using 2DCNN and VGG16 without human intervention. The newly
created hybrid feature set is then input into machine learning models to diagnose
Submitted 7 November 2023 brain tumors. We have balanced the multi-class MRI image features data using the
Accepted 1 April 2024 Synthetic Minority Over-sampling Technique (SMOTE) approach. Extensive research
Published 17 May 2024 experiments demonstrate that utilizing the proposed CVG-Net, the k-neighbors
Corresponding author classifier outperformed state-of-the-art studies with a k-fold accuracy performance
Amjad Rehman, [email protected] score of 0.96. We also applied hyperparameter tuning to enhance performance for
Academic editor multi-class brain tumor diagnosis. Our novel proposed approach has the potential
Shibiao Wan to revolutionize early brain tumor diagnosis, providing medical professionals with a
Additional Information and cost-effective and timely diagnostic mechanism.
Declarations can be found on
page 20
DOI 10.7717/peerj-cs.2008 Subjects Algorithms and Analysis of Algorithms, Artificial Intelligence, Computer Vision, Data
Mining and Machine Learning
Copyright Keywords Machine learning, Deep learning, Brain tunmor, Image processing, Transfer learning,
2024 Al-Otaibi et al.
Magnetic resonance imaging
Distributed under
Creative Commons CC-BY 4.0
OPEN ACCESS
How to cite this article Al-Otaibi S, Rehman A, Raza A, Alyami J, Saba T. 2024. CVG-Net: novel transfer learning based deep features for
diagnosis of brain tumors using MRI scans. PeerJ Comput. Sci. 10:e2008 http://doi.org/10.7717/peerj-cs.2008
INTRODUCTION
Brain tumors are abnormal growths of cells within the brain that can disrupt normal brain
function and pose serious health risks (Miao et al., 2023). Detecting and diagnosing brain
tumors is paramount for timely intervention and treatment. MRI brain scans have become
a crucial tool (Zhu et al., 2023). MRI scans utilize powerful magnets and radio waves to
generate detailed images of the brain structure, allowing healthcare professionals to visualize
abnormalities (Liu et al., 2023b). The MRI scans are especially effective in identifying brain
tumors because they provide high-resolution images that can differentiate between healthy
brain tissue and tumor masses.
Brain tumors can be classified into various types, each with its distinct characteristics
and potential challenges in diagnosis and treatment (Mehnatkesh et al., 2023; Liu et al.,
2023a). Glioma tumors originate in the glial cells, which provide support and protection
to neurons (Oztek et al., 2023). They are often classified according to their aggressiveness,
with high-grade gliomas being particularly concerning due to their rapid growth and
infiltrative nature. On the other hand, meningioma tumors develop from the meninges,
the protective membranes surrounding the brain and spinal cord. While often benign, they
can cause symptoms by compressing nearby brain structures (Bhatele & Bhadauria, 2023).
Pituitary tumors, which arise in the pituitary gland located at the base of the brain, can
affect hormone production and regulation, leading to a wide range of health issues.
The mortality rates caused by brain tumors are increasing daily. Between 2014 and
2018, the average annual age-adjusted incidence rate for primary malignant brain tumors
was 7.06 per 100,000 individuals. Projections indicated that in 2022, an estimated 25,050
new diagnoses and 18,280 fatalities would be attributed to primary malignant brain
tumors (Thierheimer et al., 2023). Brain tumors were 24.25 per 100,000 individuals,
comprising 7.06 per 100,000 for malignant cases and 17.18 per 100,000 for non-malignant
cases (Li et al., 2023a). Similarly, based on the 2018 data from the Global Cancer Registry,
a total of 18,078,957 cancer cases were reported across all genders, with 29,681 cases
attributed to brain cancer (Khazaei et al., 2020).
Traditional diagnostic methods for brain tumors have primarily relied on radiological
imaging techniques such as computed tomography (CT) (Zhang et al., 2023) and MRI.
Classical treatment modalities for brain tumors encompass surgical resection, radiation
therapy, and chemotherapy, with the specific approach chosen depending on tumor type,
location, and patient characteristics. Deep learning based methods use MRI images analyzed
using convolutional neural networks (CNN) for brain tumor recognition (Rajinikanth,
Kadry & Nam, 2021; Rajinikanth et al., 2022; Maqsood, Damasevicius & Shah, 2021;
Maqsood, Damaševičius & Maskeliūnas, 2022; Badjie & Deniz Ülker, 2022; Kurdi et al.,
2023; Khan et al., 2023).
In this study, we propose an advanced image-processing mechanism that effectively
diagnoses brain tumors without human intervention. We introduce a novel neural network,
CVG-Net, for transfer learning-based spatial feature extraction from MRI images. These
newly created transfer features are then employed to diagnose brain tumors using machine
learning models.
LITERATURE ANALYSIS
This literature analysis section comprehensively examines existing studies and
methodologies related to both neural network applications in medical imaging and
transfer learning techniques. It delves into the foundational work in medical image
analysis, emphasizing the pivotal role of neural networks in automating the detection and
diagnosis of various pathologies, including brain tumors. Additionally, it synthesizes the
key findings and methodologies from relevant studies, providing a robust foundation for
the subsequent implementation of transfer learning techniques in the proposed diagnosis
framework for brain tumors using MRI scans
PROPOSED METHODOLOGY
This section comprehensively describes the materials and methods used to evaluate our
research experiments. We examine the MRI scans dataset and the advanced neural network
approaches applied. Figure 1 illustrates our novel proposed research methodology for
diagnosing brain tumors. The following steps outline the process carried out in our
proposed methodology for conducting experiments.
• Step 1: At the outset of our research experiments, we acquired a brain tumor-related
dataset of MRI scan images for evaluating results. This dataset encompasses multiple
target classes, allowing for a comprehensive assessment of our findings.
2D-CNN
Features
Splitting
VGG-16
Multi Class Brain Tumors Novel Transfer Learning SMOTE Based Features
Feature Engineering Data Balancing Training
MRI Image Scans
Testing
Normal
Glioma_tumor
Meningioma_tumor
Pituitary_tumor
Diagnosis of Brain Tumors Model Evaluations Hyper-Parameterized Artificial
Using MRI Scans & Predictions Intelligence Approach
Figure 1 Workflow diagram of our proposed methodology for brain tumor diagnosis.
Full-size DOI: 10.7717/peerjcs.2008/fig-1
• Step 2: Next, for the extraction of rich-level features from MRI images, we have
proposed a novel neural network approach that leverages transfer learning for spatial
feature extraction, ultimately creating a new feature set.
• Step 3: We then analyzed the dataset and identified an imbalance. To address this issue,
we applied the SMOTE approach to balance the features of the dataset for multi-class
classification.
• Step 4: The image feature data is then divided into two portions: training and testing.
The split ratio employed is 80% for training and 20% for testing.
• Step 5: We have employed advanced machine learning and deep learning approaches
in this step. Using 80% of the training data, we have implemented the applied methods.
• Step 6: Then, the 20% portion of the data is utilized for the evaluations and predictions
of applied methods through various performance metrics in comparison.
• Step 7: Ultimately, the superior neural network method during experiments is employed
for brain tumor diagnosis.
target class (3) contains 5,908 samples. Additionally, we conducted a sample image analysis,
as shown in Fig. 3.
We utilized the state-of-the-art dataset: Multi-class Brain Tumour MRI Image Data,
comprising 21,672 MRI scan images. We chose this dataset because it aligns with the
problem addressed in our research study, specifically, brain tumor detection. Moreover,
this dataset encompasses a substantial number of images, enhancing the generalizability of
our research findings.
Hybrid Spatial
Input:(128, 128, 3) SoftMax Features
Conv2D Flatten (21672, 400)
MaxPooling2D
Layer
Repeated 5 Times
Figure 4 The architectural workflow analysis of our proposed CVG-Net feature engineering.
Full-size DOI: 10.7717/peerjcs.2008/fig-4
relative to others, can significantly hinder the performance of machine learning models,
as they tend to be biased towards the majority class. In medical imaging, this can have
critical implications for patient care. SMOTE operates by generating synthetic samples in
the feature space of the minority class, thereby alleviating the imbalance. When applied
to multi-class MRI datasets for brain tumor diagnosis, SMOTE ensures that the model
is exposed to a more representative data distribution, effectively discriminating between
various tumor types. This methodology has demonstrated promising results, contributing
to more accurate and reliable diagnoses, paramount in clinical decision-making processes.
Hyperparameter setting
Hyperparameter tuning is a crucial aspect of optimizing neural network models in
machine learning. These hyperparameters are configurations external to the model and can
significantly impact its performance. In our research, we focus on systematically exploring
and optimizing these hyperparameters to enhance the overall efficiency and effectiveness of
neural networks. We employ a recursive process of training and evaluations to determine
the best-fit hyperparameters. The selected hyperparameters for the neural network applied
to diagnose brain tumors are presented in Table 4.
Experimental setup
Our proposed experimental study setup includes the use of the Google Colab platform to
evaluate research results. The platform offers GPU backend support, providing 90 GB of
disk space and 13 GB of RAM. We have employed the Python 3.0 programming language
to build the neural network used in our research. We utilized the Sklearn, Tensorflow, and
Keras modules for evaluating the results. During performance evaluations, true positives
(TP), true negatives (TN), false positives (FP), and false negatives (FN) are used as basic
parameters. The performance metrics used for performance evaluation are examined as
follows:
• Accuracy: in diagnosing brain tumors using brain MRI scans refers to measuring how
effectively the diagnostic system or method correctly identifies and classifies the presence
or absence of brain tumors in the MRI images.
TP + TN
Accuracy = . (1)
TP + TN + FP + FN
• Precision: in diagnosing brain tumors using brain MRI scans refers to measuring how
accurately the diagnostic test identifies TP cases among all the positive cases it detects.
TP
Precision = . (2)
TP + FP
• Recall: in diagnosing brain tumors using brain MRI scans, is a fundamental performance
metric that measures the ability of a diagnostic model or process to correctly identify
Figure 5. The time series-based performance results comparison of CNN model during training. Each
Figure 5 The time series-based performance results comparison of CNN model during training. Each
performance
performancemeasure
measureisisevaluated
evaluated during
during the
the 20 epochsof
20 epochs oftraining.
training.
Full-size DOI: 10.7717/peerjcs.2008/fig-5
283 4.2 Performance analysis with classical approaches
284 The performance analysis of the applied classical deep learning-based approaches, 2DCNN and VGG16,
285 is Table 5 The
conducted inresults analysis
this section. of applied
MRI 2D-CNN
brain scan neural
images network.
are fed into the 2DCNN and VGG16 methods, and
286 the results are then evaluated.
Accuracy Loss Target class Precision Recall F1
287 4.2.1 Results of CNN Model Normal 0.72 0.76 0.74
288 The performance results of the applied CNN model during training
Glioma_tumor 0.70 are illustrated 0.68in Figure 5.0.69
We
289 conducted
0.76 a time series analysis over 20 epochs of the CNN model. The evaluation is based on loss scores,
3.64 Meningioma_tumor 0.78 0.75 0.77
290 accuracy, precision, and recall. The analysis reveals that the training loss is very high in the first two
291
Pituitary_tumor 0.87 0.87
epochs. Subsequently, there is a notable reduction in loss, while the validation loss gradually increases. 0.87
292 Similarly, the training accuracy starts off low in the initial two epochs, but then exhibits a steady increase.
Average 0.77 0.76 0.77
293 The validation accuracy scores fluctuate around 0.80. The precision and recall metrics exhibit a similar
294 pattern to the accuracy
Subsequently, thereperformance
is a notablescores throughout
reduction in training. In conclusion,
loss, while this analysis
the validation indicates that
loss gradually
295 the CNN model achieved acceptable scores for brain tumor diagnosis during training. Nevertheless, there
296
increases.
is still a needSimilarly, the training
for performance accuracy starts off low in the initial two epochs, but
enhancement.
297 then The performance
exhibits analysis
a steady of the The
increase. applied CNN model
validation for unseen
accuracy testing
scores data is around
fluctuate described0.92.
in Table
The
298 5. The analysis shows that the CNN model achieved an acceptable score of 0.76
precision and recall metrics exhibit a similar pattern to the accuracy performance scores during brain tumor
299 diagnosis. However, the loss scores are high for the testing data. The class-wise performance scores also
11/19
PeerJ Comput. Sci. reviewing PDF | (CS-2023:10:92458:1:1:NEW 18 Jan 2024)
Al-Otaibi et al. (2024), PeerJ Comput. Sci., DOI 10.7717/peerj-cs.2008 14/24
Normal 0.72 0.76 0.74
Glioma tumor 0.70 0.68 0.69
0.76 3.64 Meningioma tumor 0.78 0.75 0.77
Pituitary tumor 0.87 0.87 0.87
Average 0.77 0.76 0.77
Figure
Figure6.6 The
Thetime
timeseries-based performanceresults
series-based performance resultscomparison
comparisonofof VGG16
VGG16 model
model during
during training. Each
training.
performance measure
Each performance is evaluated
measure during
is evaluated the 20
during the epochs ofoftraining.
20 epochs training.
Full-size DOI: 10.7717/peerjcs.2008/fig-6
Table 7 The results analysis of applied neural networks with novel unbalanced features.
KNC LR
1200
1324 2 7 3 1200 1240 28 51 17
0
1000
1000
23 1196 40 4 32 997 207 27 800
800
1
Actuals
Actuals
600 600
35 82 1099 39 56 169 907 123
2
400 400
3
0 1 2 3 0 1 2 3
Predictions Predictions
RF
1200
1290 14 17 15
0
1000
600
24 178 959 94
2
400
5 25 27 1202 200
3
0 1 2 3
Predictions
Figure 7.
Figure 7 Confusion Confusion
matrix analysismatrix analysis
of applied neuralofnetwork
appliedapproaches.
neural network approaches.
Full-size DOI: 10.7717/peerjcs.2008/fig-7
Table 11 The performance comparison with state-of-the-art studies for brain tumor diagnosis.
computations compared to other applied methods. The proposed KNC method takes only
0.0281 s to diagnose brain tumors.
Discussions
In this research, we have introduced a novel neural network method, CVG-Net, for feature
engineering and the timely diagnosis of brain tumors. This proposed method combines
2DCNN and VGG16 to extract spatial features from MRI images. Our experiments
utilized a multi-class MRI image dataset comprising 21,672 images related to glioma
tumors, meningioma tumors, and pituitary tumors. Our commitment to privacy and
confidentiality underscores the implementation of robust data anonymization techniques,
thereby mitigating the risk of any inadvertent disclosure of sensitive personal details. This
explicit ethical discourse is pivotal not only in upholding the integrity of our research but
also in fostering trust within the scientific community and, more importantly, among the
individuals whose health data form the foundation of our study.
Study limitations
In this research study, we have introduced a novel CVG-Net for the prompt diagnosis
of brain tumors. Nevertheless, our proposed research does have some limitations. The
accuracy score of our approach, currently at 96%, could be further improved by minimizing
the loss error rates. To address potential overfitting issues or biases in the machine learning
models, we have employed the k-fold cross-validation mechanism, which validates the
Future work
In the future, we intend to design a graphical user interface app for medical specialists to
facilitate the effective diagnosis of brain tumors using MRI scans. Also we plan to enhance
the performance by implementing transfer learning-based advanced neural networks. The
designed app will utilize the proposed approach in the backend for diagnosis.
Funding
This research is supported by Princess Nourah bint Abdulrahman University Researchers
Supporting Project number (PNURSP2024R136), Princess Nourah bint Abdulrahman
University, Riyadh, Saudi Arabia. Prince Sultan University, Riyadh Saudi Arabia
supported the Article Processing Charges (APC) of this publication. Princess Nourah
bint Abdulrahman University had a role in the study design, conduct, data analysis and
interpretation (determined the scope of the work, directed the analysis, gave feedback and
dictated the desired output/results verification), manuscript writing, and dissemination of
results.
Grant Disclosures
The following grant information was disclosed by the authors:
Princess Nourah bint Abdulrahman University Researchers: PNURSP2024R136.
Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.
Prince Sultan University, Riyadh Saudi Arabia.
Author Contributions
• Shaha Al-Otaibi conceived and designed the experiments, analyzed the data, authored
or reviewed drafts of the article, and approved the final draft.
• Amjad Rehman conceived and designed the experiments, prepared figures and/or tables,
and approved the final draft.
• Ali Raza performed the experiments, performed the computation work, prepared figures
and/or tables, authored or reviewed drafts of the article, and approved the final draft.
• Jaber Alyami analyzed the data, performed the computation work, prepared figures
and/or tables, and approved the final draft.
• Tanzila Saba analyzed the data, authored or reviewed drafts of the article, and approved
the final draft.
Data Availability
The following information was supplied regarding data availability:
The dataset is available at Kaggle: https://www.kaggle.com/datasets/mohammadhossein77/
brain-tumors-dataset.
The code is available in the Supplementary Files.
Supplemental Information
Supplemental information for this article can be found online at http://dx.doi.org/10.7717/
peerj-cs.2008#supplemental-information.
REFERENCES
Abunadi I, Kumar RL. 2021. Blockchain and business process management in health
care, especially for COVID-19 cases. Security and Communication Networks
2021:2245808 DOI 10.1155/2021/2245808.
Archana B, Karthigha M, Suresh Lavanya D. 2023. Comparative analysis of op-
timisers used in CNN for brain tumor detection. In: 2023 5th international
conference on inventive research in computing applications (ICIRCA). 687–692
DOI 10.1109/ICIRCA57980.2023.10220625.
Azar AT, Khan ZI, Amin SU, Fouad KM. 2023. Hybrid global optimization algo-
rithm for feature selection. Computers, Materials and Continua 74:2021–2037
DOI 10.32604/cmc.2023.032183.
Badjie B, Deniz Ülker E. 2022. A deep transfer learning based architecture for brain
tumor classification using MR images. Information Technology and Control
51(2):332–344 DOI 10.5755/j01.itc.51.2.30835.
Bhatele KR, Bhadauria SS. 2023. Multiclass classification of central nervous system
brain tumor types based on proposed hybrid texture feature extraction methods
and ensemble learning. Multimedia Tools and Applications 82(3):3831–3858
DOI 10.1007/s11042-022-13439-1.