FastLeakyResNet-CIR A Novel Deep Learning Framework For Breast Cancer Detection and Classification
FastLeakyResNet-CIR A Novel Deep Learning Framework For Breast Cancer Detection and Classification
Corresponding authors: Pingping Bing ([email protected]), Shuangni Duan ([email protected]), and Lemei Zhu
([email protected])
This work was supported in part by the National Key Research and Development Program of China under Grant 2018YFB2000800 and
Grant 2022YFB3303600; in part by Hunan Provincial Education Department under Grant 20A056, Grant 22C0669, and Grant 22C0565;
and in part by Hunan Provincial Health Commission Foundation under Grant D202302088596.
ABSTRACT Breast cancer is a type of disease that primarily affects the breast tissue, and it is crucial
to achieve early diagnosis for successful treatment and recovery. In recent years, the residual network
(ResNet) has gained significant attention in the detection of breast cancer using medical images. In this paper,
we propose an efficient and robust deep learning framework called FastLeakyResNet-CIR, an improved
ResNet architecture, for breast cancer detection and classification. The FastLeakyResNet-CIR achieves an
impressive accuracy of 98.94% when evaluated on a dataset of 7909 microscopic images of breast tumor
tissue from BreakHis dataset, which outperforms the state-of-the-art methods, e.g. ResNet18, ResNet50,
InceptionV3 and VGG16. The experiment results further highlight the potential of FastLeakyResNet-CIR
for accurate and rapid diagnosis of breast cancer, thus facilitating effective medical treatment for patients.
INDEX TERMS Breast cancer, convolutional neural network, FastLeakyResNet-CIR, medical image
classification, residual network.
viable solution for early breast cancer detection because outperforms several architectures, e.g. ResNet 50, SeResNet
it is easy to carry, and low cost than mammography [10]. 50 and Inception. Mohiyuddin et al. [24] put forward a
It plays a crucial role in differentiating between solid-cystic modified network of YOLOv5 to detect and classify breast
breast lesions and characterizing solid lesions. Thanks tumors, and it performs better than YOLOv3 and faster
to technological advances, high-frequency ultrasonography regions with CNN (RCNN) with lowering the false positive
probes have enabled the visualization of even small breast ratio (FPR) and false negative ratio (FNR) and boosting
lesions, especially for patients with dense breast [11]. the Matthews correlation coefficient (MCC) value. Asadi
Therefore, it is the ideal imaging tool to guide subsequent and Memon [25] employed a cascade network with U-Net
procedures. Magnetic resonance imaging is primarily utilized for segmentation and a ResNet for efficient breast cancer
as a supplemental means to breast cancer screening along detection, which has a classification accuracy of 98.61% and
with mammography or ultrasound [12]. Since 2000, the F1 score of 98.41%.
breast MRI, with excellent sensitivity and specificity, has To improve the accuracy and interpretability of breast
become an important tool in high risk screening and diagnosis cancer detection and classification, we propose a novel deep
for earlier-stage breast cancer by measuring the size of the learning framework called fast leaky residual network with
cancer, and looking for other tumors in the breast. Recent class imbalance reduction, termed as FastLeakyResNet-CIR,
research has found that the MRI is capable of locating which has higher accuracy and faster convergence rate. The
some small breast lesions that may be sometimes missed by main contribution of this study is as follows.
mammography [13], thus, the MRI has been extensively used (1) The novel scheme FastLeakyResNet-CIR is presented
for patients with newly diagnosed breast cancer. for predicting breast cancer.
Nowadays, the methods based on deep learning (DL) (2) We strengthen the robustness of the model by
have been extensively applied to medical image analysis introducing the improved loss function and improved Adam
and breast cancer diagnosis [14], [15], [16]. Convolutional algorithm.
neural network (CNN) is one of the most popular approaches (3) The proposed method performs well in a performance
in DL [17], [18], which achieves an end-to-end mapping comparison against the state-of-the-art breast cancer predic-
learning from input image to output result and becomes an tion methods.
efficient classification model [19]. Subsequently, a variety of The rest of the paper is organized as follows. Section II
DL models are emerging, such as RNN, AlexNet, GoogleNet, provides the theoretical basis. Section III introduces a
ResNet, VGG, and U-Net. Tan et al. [20] utilized CNN detailed description of the proposed breast cancer detection
acting on mammogram imaging system to produce three scheme. Section IV presents the experiments and results.
classifications of normal, benign and malignant, which assists Section 5 gives the conclusions.
specialist to diagnosis and classify the breast cancer quickly.
Mambous et al. [21] developed a deep neural network (DNN) II. METHODOLOGY
model that is related to pre-trained Inception V3 model Figure 1 shows the details of the proposed breast cancer
for sick and healthy breast classification, where they pay detection scheme that consists of three main steps. First,
attention to the physical model camera sensitivity of breast. the data preprocessing aims to reduce training time, enhance
Ting et al. [22] presented a convolutional neural network image, alleviate overfitting and learn feature, which includes
improvement for breast cancer classification (CNNI-BCC), several techniques such as resizing and balance, Gamma
which can classify breast cancer medical images into healthy, transform, data augmentation and shuffling and splitting.
benign and malignant patients in the context of no prior Second, the FastLeakyResNet-CIR is designed to predict
information about cancerous lesion. It improves the breast breast cancer. Third, the improved loss function and improved
cancer lesion classification and helps experts for breast cancer Adam algorithm are introduced to strengthen the robustness
identification. Barbosa et al. [23] introduced the deep-wavelet of the model.
neural network (DWNN) for feature extraction in image
representation, combining it with intelligent algorithms for A. DATA AND PROCESSING
pattern recognition. They demonstrated a sensitivity of 95% 1) RESIZING AND BALANCE
and specificity of 79% using the DWNN for detection of In this study, we utilize the BreaKHis dataset, Breast Cancer
breast lesions in thermographic images. While promising, Histopathological Database, to validate the proposed scheme.
there are some limitations about the DWNN approach. The dataset was previously used by Spanhol et al. [26], which
It relies on handcrafted feature extraction, which may not contains 7909 breast cancer histopathology images acquired
capture discriminative features as effectively as end-to-end on 82 patients with two states benign and malignant. Since
feature learning methods like CNNs. Also, their evaluation the number of normal training sets is much smaller than the
was on a small proprietary dataset, so further validation on number of abnormal training sets in the dataset, we replicate
larger benchmark datasets would be needed. Naseem et al. the normal training sets itself to be the same as the number of
[9] proposed a computer-aided detection approach based on the two training sets. To cut down the data size, all images are
CNN using thermal images, it is faster, reliable and robust, resized to 224*224 before training, which will significantly
and obtains a higher accuracy (92%) and F1-score (92%) that reduce the training time of the model.
70826 VOLUME 12, 2024
R. Zeng et al.: FastLeakyResNet-CIR: A Novel Deep Learning Framework
FIGURE 2. (a) The image before Gamma transform and (b) the image
after Gamma transform.
where N represents the number of samples, L is loss weights ωmalignant and ωbenign in the loss function to account
function, y and ŷ denotes the true value and predicted value, for class imbalance between malignant and benign samples.
respectively. The motivation is to give more influence to minority
However, the traditional loss function suffers from some malignant samples during training. By assigning a higher
drawbacks, for example, 1) it does not cope well with weight ωmalignant to losses on malignant samples, we increase
imbalanced data, resulting in a bias towards the majority their contribution to update the model parameters. This
samples and far from the minority samples; 2) it does not counters the tendency of overwhelmed malignant examples.
make full use of confidence information, lacking targeted The exact values can be tuned as a hyperparameter, with
penalty for uncertain samples; 3) it does not impose common choices being the inverse class frequencies or a fixed
the constraints on the complexity of the model, leading rebalancing ratio. The λ weight for the confidence penalty
to potential overfitting. The above-mentioned issues may is set to 0.1 based on preliminary experiments. As a result,
degrade the performance of the model when dealing with we obtain a detection model for breast cancer that focuses
imbalanced data and uncertain samples, and the ability of more on important class, possesses robustness, and performs
generalizing to new data. well on unseen data.
In the paper, we introduce the category weights, confidence
penalty and L1 regularization into the above loss function,
thus, Eq. (1) can be rewritten as:
2) OPTIMIZER
L(y, ŷ, c) Adam (Adaptive Moment Estimation), proposed by Kingma
N and Ba [27], is an optimizer for optimizing stochastic gradient
1 X
=− (ωmalignant • yi log(ŷi ) descent algorithm for neural network, which can be seen as a
N combination of Momentum and RMSProp.
i=1
+ ωbenign (1 − yi ) log(1 − ŷi )) + λH (c, t) + α • ∥w∥1 The traditional Adam algorithm ignores the stability of
(2) gradient and only considers the first-order and the second-
order moment estimation. It may cause suboptimal learning
where the weights of malignant samples and benign samples rate when dealing with gradient that exhibits high variance or
are ωmalignant and ωbenign , respectively, c is the predicted instability. Secondly, the Adam lacks personalized learning
confidence, t is the threshold, H (•) is the penalty term and rate adjustment, because it utilizes fixed decay rate or step
the corresponding weight is λ, ∥•∥1 denotes L1 regularization to adjust the learning rate without taking into account the
term and the corresponding weight is α, w is the weight vector difference in parameter importance or sensitivity. Thirdly,
parameter. the Adam relies only on global statistical information and
By employing category weights, we can improve the fails to adequately consider the feature of each individual
problem of class imbalance, and enhance the impact of breast batch.
cancer samples in the optimization process. Considering In the paper, we bring some improvements to the traditional
confidence information, we can strengthen the robustness Adam algorithm as follows.
and stability of the proposed model. Introducing a L1 regu- 1) Computing the gradient gt for the current step.
larization term, we can promote the model’s generalization 2) Calculating the stability term st , which is the standard
ability and prevent the overfitting. We introduce category deviation of the gradient plus a small constant.
IV. CONCLUSION
In this study, we present a novel FastLeakyResNet-CIR
model for breast cancer detection and classification. The
proposed framework is composed of convolutional blocks,
adaptive average pooling layer, linear layer and output layer.
It reaches the highest classification accuracy of 98.94%. Also,
the proposed breast cancer prediction scheme achieves an
impressive score in a performance comparison against several
classic breast cancer detection approaches. In addition, the
FastLeakyResNet-CIR effectively solves the problem of class
imbalance, avoids the overfitting and has good generalization
ability and faster convergence speed by introducing the
improved loss function and improved Adam algorithm, thus,
it is a very promising and application-worthy deep learning
model in the field of medical image analysis. In future
work, data enhancement methods and breast cancer severity
diagnosis will be explored.
FIGURE 5. Accuracy (a) and loss (b) comparison results obtained by five
methods.
ACKNOWLEDGMENT
The authors would like to thank an associate editor Prof.
B. RESULT AND COMPARISON Shashikant Patil, Humaira Nisar, and anonymous reviewers,
In this experiment, the iterative process of FastLeakyResNet- for their constructive comments that improved the article
CIR is relative stable, and it converges at the 15th Epoch. greatly.
The whole training process ends early at the 31st Epoch.
The FastLeakyResNet-CIR achieves better classification REFERENCES
accuracy, recall, precision, and F1-score values 98.94%, [1] T. Mahmood, J. Li, Y. Pei, F. Akhtar, A. Imran, and K. U. Rehman, ‘‘A
98.36%, 99.43% and 98.89%, respectively, which illustrates brief survey on breast cancer diagnostic with deep learning schemes using
multi-image modalities,’’ IEEE Access, vol. 8, pp. 165779–165809, 2020.
the effectiveness of the proposed method for detecting breast [2] Y. Gan, L. Li, L. Zhang, S. Yan, C. Gao, S. Hu, Y. Qiao, S. Tang,
cancer. Figure 4 shows the confusion matrix for the test C. Wang, and Z. Lu, ‘‘Association between shift work and risk of prostate
phase of the proposed breast cancer detection framework. cancer: A systematic review and meta-analysis of observational studies,’’
Carcinogenesis, vol. 39, no. 2, pp. 87–97, Feb. 2018.
It can be clearly observed that the 3280 images are tested, [3] X. Wang, G. Chen, Y. Zhang, W. M. Ghareeb, Q. Yu, H. Zhu, X. Lu,
1590 samples are classified as normal, and 1690 samples Y. Huang, S. Huang, D. Hou, and P. Chi, ‘‘The impact of circumferential
are classified as abnormal. Among the normal samples, tumour location on the clinical outcome of rectal cancer patients managed
with neoadjuvant chemoradiotherapy followed by total mesorectal exci-
1564 samples are correctly predicted as normal (TP), while sion,’’ Eur. J. Surgical Oncol., vol. 46, no. 6, pp. 1118–1123, Jun. 2020.
26 samples are incorrectly predicted as abnormal (FN). [4] X. Xie, X. Wang, Y. Liang, J. Yang, Y. Wu, L. Li, X. Sun, P. Bing,
Furthermore, 9 abnormal samples are wrongly predicted B. He, G. Tian, and X. Shi, ‘‘Evaluating cancer-related biomarkers based
on pathological images: A systematic review,’’ Frontiers Oncol., vol. 11,
as normal (FP), and 1681 abnormal samples are correctly Nov. 2021, Art. no. 763527.
predicted as abnormal (TN). [5] P. Peng, Y. Luan, P. Sun, L. Wang, X. Zeng, Y. Wang, X. Cai, P. Ren, Y. Yu,
In addition, for further clarifying the superiority of Q. Liu, H. Ma, H. Chang, B. Song, X. Fan, and Y. Chen, ‘‘Prognostic
factors in stage IV colorectal cancer patients with resection of liver
the FastLeakyResNet-CIR, a comparison is made between and/or pulmonary metastases: A population-based cohort study,’’ Frontiers
the proposed method with some classical neural networks Oncol., vol. 12, Mar. 2022, Art. no. 850937.
[6] N. Wang, J. Chen, W. Chen, Z. Shi, H. Yang, P. Liu, X. Wei, X. Dong, [25] B. Asadi and Q. Memon, ‘‘Efficient breast cancer detection via cascade
C. Wang, L. Mao, and X. Li, ‘‘The effectiveness of case management for deep learning network,’’ Int. J. Intell. Netw., vol. 4, pp. 46–52, 2023.
cancer patients: An umbrella review,’’ BMC Health Services Res., vol. 22, [26] F. A. Spanhol, L. S. Oliveira, C. Petitjean, and L. Heutte, ‘‘A dataset
no. 1, p. 1247, Oct. 2022. for breast cancer histopathological image classification,’’ IEEE Trans.
[7] Y. Yari, T. V. Nguyen, and H. T. Nguyen, ‘‘Deep learning applied Biomed. Eng., vol. 63, no. 7, pp. 1455–1462, Jul. 2016.
for histological diagnosis of breast cancer,’’ IEEE Access, vol. 8, [27] D. P. Kingma and L. J. Ba, ‘‘Adam: A method for stochastic optimization,’’
pp. 162432–162448, 2020. in Proc. Int. Conf. Learn. Represent. (ICLR), 2015.
[8] A. U. Haq, J. P. Li, A. Saboor, J. Khan, S. Wali, S. Ahmad, A. Ali,
G. A. Khan, and W. Zhou, ‘‘Detection of breast cancer through clinical data
using supervised and unsupervised feature selection techniques,’’ IEEE
Access, vol. 9, pp. 22090–22105, 2021.
[9] U. Naseem, J. Rashid, L. Ali, J. Kim, Q. E. U. Haq, M. J. Awan,
and M. Imran, ‘‘An automatic detection of breast cancer diagnosis and
prognosis based on machine learning using ensemble of classifiers,’’ IEEE RUIMING ZENG received the master’s degree
Access, vol. 10, pp. 78242–78252, 2022. in arts from Changsha University of Science and
[10] R. Sood, A. F. Rositch, D. Shakoor, E. Ambinder, K.-L. Pool, E. Pollack, Technology, Changsha, China, in 2011. He is
D. J. Mollura, L. A. Mullen, and S. C. Harvey, ‘‘Ultrasound for breast currently a Lecturer with Changsha Medical Uni-
cancer detection globally: A systematic review and meta-analysis,’’ J. versity. His research interests include linguistics,
Global Oncol., no. 5, pp. 1–17, Dec. 2019. philosophy, and medical science.
[11] C.-X. Chen, J. Zhu, and Z. Zeng, ‘‘Use of ultrasound to observe mycosis
fungoides: A case report and review of literature,’’ Current Med. Imag.
Formerly Current Med. Imag. Rev., vol. 18, no. 7, pp. 771–775, Jul. 2022.
[12] R. Huo, Y. Liu, H. Xu, J. Li, R. Xin, Z. Xing, S. Deng, T. Wang,
H. Yuan, and X. Zhao, ‘‘Associations between carotid atherosclerotic
plaque characteristics determined by magnetic resonance imaging and
improvement of cognition in patients undergoing carotid endarterectomy,’’
Quant. Imag. Med. Surg., vol. 12, no. 5, pp. 2891–2903, May 2022.
[13] S. Radhakrishna, S. Agarwal, P. M. Parikh, K. Kaur, S. Panwar, S. Sharma,
A. Dey, K. K. Saxena, M. Chandra, and S. Sud, ‘‘Role of magnetic
resonance imaging in breast cancer management,’’ South Asian J. Cancer, BOAN QU is currently pursuing the bachelor’s
vol. 7, no. 2, pp. 69–71, 2018. degree in safety engineering with Beijing Uni-
[14] X. Yi, X. Guan, C. Chen, Y. Zhang, Z. Zhang, M. Li, P. Liu, A. Yu, X. Long, versity of Chemical Technology. His research
L. Liu, B. T. Chen, and C. Zee, ‘‘Adrenal incidentaloma: Machine learning- interests include fault diagnosis and monitoring
based quantitative texture analysis of unenhanced CT can effectively of chemical equipment and solving engineering
differentiate sPHEO from lipid-poor adrenal adenoma,’’ J. Cancer, vol. 9, problems based on machine learning.
no. 19, pp. 3577–3582, 2018.
[15] B. He, J. Lang, B. Wang, X. Liu, Q. Lu, J. He, W. Gao, P. Bing,
G. Tian, and J. Yang, ‘‘TOOme: A novel computational framework to infer
cancer tissue-of-origin by integrating both gene mutation and expression,’’
Frontiers Bioengineering Biotechnol., vol. 8, p. 394, May 2020.
[16] B. He, Y. Zhang, Z. Zhou, B. Wang, Y. Liang, J. Lang, H. Lin, P. Bing,
L. Yu, D. Sun, H. Luo, J. Yang, and G. Tian, ‘‘A neural network framework
for predicting the tissue-of-origin of 15 common cancer types based
on RNA-seq data,’’ Frontiers Bioengineering Biotechnol., vol. 8, p. 737,
Aug. 2020. WEI LIU received the B.E. degree in exploration
[17] B. He, Q. Lu, J. Lang, H. Yu, C. Peng, P. Bing, S. Li, Q. Zhou, Y. Liang, and geophysics from China University of Petroleum
G. Tian, ‘‘A new method for CTC images recognition based on machine (East China), Qingdao, China, in 2006, and the
learning,’’ Frontiers Bioeng. Biotechnol., vol. 8, p. 897, Aug. 2020. M.S. degree in exploration geophysics and the
[18] P. Bing, Y. Liu, W. Liu, J. Zhou, and L. Zhu, ‘‘Electrocardiogram Ph.D. degree in geological resources and geo-
classification using TSST-based spectrogram and ConViT,’’ Frontiers
logical engineering from China University of
Cardiovascular Med., vol. 9, Oct. 2022, Art. no. 983543.
Petroleum (Beijing), Beijing, China, in 2009 and
[19] A. Raza, N. Ullah, J. A. Khan, M. Assam, A. Guzzo, and H. Aljuaid,
2016, respectively. He is currently a Lecturer
‘‘DeepBreastCancerNet: A novel deep learning model for breast cancer
detection using ultrasound images,’’ Appl. Sci., vol. 13, no. 4, p. 2082, with Beijing University of Chemical Technology.
Feb. 2023. His research interests include signal analysis and
[20] Y. J. Tan, K. S. Sim, and F. F. Ting, ‘‘Breast cancer detection using processing, seismic data processing and interpretation, and mechanical fault
convolutional neural networks for mammogram imaging system,’’ in Proc. diagnosis.
Int. Conf. Robot., Autom. Sci. (ICORAS), Nov. 2017, pp. 1–5.
[21] S. Mambou, P. Maresova, O. Krejcar, A. Selamat, and K. Kuca, ‘‘Breast
cancer detection using infrared thermal imaging and a deep learning
model,’’ Sensors, vol. 18, no. 9, p. 2799, Aug. 2018.
[22] F. F. Ting, Y. J. Tan, and K. S. Sim, ‘‘Convolutional neural network
improvement for breast cancer classification,’’ Expert Syst. Appl., vol. 120,
pp. 103–115, Apr. 2019. JIANGHAO LI is currently pursuing the bache-
[23] V. A. de Freitas Barbosa, M. A. de Santana, M. S. Andrade, lor’s degree in medical imaging with Changsha
R. C. F. de Lima, and W. P. dos Santos, ‘‘Deep-wavelet neural networks Medical University. His research interests include
for breast cancer early diagnosis using mammary termographies,’’ in Deep data extraction, data verification, and statistical
Learning for Data Analytics: Foundations, Biomedical Applications and analysis.
Challenges. Amsterdam, The Netherlands: Elsevier, 2020, pp. 99–124.
[24] A. Mohiyuddin, A. Basharat, U. Ghani, V. Peter, S. Abbas, O. B. Naeem,
and M. Rizwan, ‘‘Breast tumor detection and classification in mammogram
images using modified YOLOv5 network,’’ Comput. Math. Methods Med.,
vol. 2022, pp. 1–16, Jan. 2022.
HONGSHEN LI is currently pursuing the bach- SHUANGNI DUAN received the master’s degree
elor’s degree in computer science with Changsha from the University of South China, Hunan, China,
Medical University. His research interests include in 2011. She is currently a Lecturer with the
deep learning and data analysis. School of Public Health, Xiangnan University. Her
research interests include public health and health
policy.
PINGPING BING received the Ph.D. degree in LEMEI ZHU received the master’s degree from the
geophysics from China University of Petroleum University of South China, China, in 2011. She is
(Beijing), Beijing, China, in 2012. She is currently currently with the School of Public Health, Chang-
an Associate Professor with Changsha Medical sha Medical University. Her research interests
University. Her research interests include opti- include molecular pharmaceutics, novel nanofab-
mization algorithms, signal analysis and model rication, and processing technologies.
prediction, and deep learning.