Pso 0
Pso 0
net/publication/340695850
CITATIONS READS
0 73
4 authors:
Some of the authors of this publication are also working on these related projects:
Conference Paper Discovering the classification rules for Egyptian stock market using genetic programming View project
All content following this page was uploaded by Ahmed Hamza Asad on 21 April 2020.
Abstract—the majority cancer mortality among women is highly experienced. While, other countries examination
due to breast cancer over the world wide. Recent researches system approves the single reading in the screening method
have shown the effectiveness of x-ray mammography in early [5]. The early detection of cancer is a complex task that
detection of breast cancer. Unfortunately, the present systems requires scanning and screening huge number of images.
for early detection are expensive and needs extremely complex Hence, it is easy to make mistakes and miss classifying
algorithms. The crucial challenge in designing a computer- cancer cases.
aided detection (CAD) systems for breast cancer are the
segmentation phase, which requires highly complex The accuracy of detecting and diagnosing mammography
computation. Hence, this paper proposes a CAD system to be rely on many factors like breast density, breast structure and
utilized for breast cancer detection in mammographic datasets. other factor related to the radiologist himself such as his
The segmentation step is performed by a Particle Swarm opinion, skills and experience. It is predetermined that 70%
Optimization Algorithm (PSO). Statistical, textural and shape of all missed breast cancer are due to misunderstanding,
feature are calculated over the segmented region. A non linear while the other 30% are due to disregard lesions [6].
support vector machine (SVM) is exploited in the next phase in Moreover, there is a deficiency in the trained radiologists
order to analyze the extracted features and classify the cause a growing attention in analyzing the mammogram
mammograms into normal, benign or malignant. For the sack images using computerized way.
of evaluating the performance, the experiment is performed on
Mini-MIAS database . The obtained accuracy rates based on Therefore, the need for developing the Computer-aided
10-folds cross validation are 85.4% for classifying normal from detection (CAD) systems become indispensable in order to
abnormal, 89.5% for classifying malignant from benign. The enhance the sensitivity of mammography screening. The
experiment shows that the classification accuracy is 81% when sensitivity is mainly increased by specifying suspected
classifying normal, malignant or benign. The result regions on mammograms such as micro calcifications and
compromises with recent researches concurs that the proposed tumors [7, 8]. When applying the CAD system in cancer
algorithm compromises between the achieved accuracy to detection, only one radiologist will be responsible for
complexity cost. approving the concluded decision and has to decide to recall
a woman or not to perform a further physical examinations.
Keywords—breast cancer, CAD system, Mammography, PSO,
Segmentation, SVM.
[9]. The CAD-based assessment possibly depends on a single
referred area that the radiologist didn't notice at first. In
I. INTRODUCTION addition, CAD system has been recommended to be used for
training and educated the radiologists [10].
The biggest threat that faces the women in our
community is the breast cancer. Regarding the results of This paper studies and proposes a method for computer-
National Cancer Registry Program (NCRP) [1] in Egypt, the aided detection of breast abnormalities that would assist the
breast cancer event rate per 100,000 cases were calculated early discovery of the breast cancer in mammograms. The
and reported as 32.0% [2]. For all kinds of cancers, the prime proposed system aims to help radiologists by providing him
factor for enhancing the opportunity for recovery is the early with a guide decision for the patient’s status. The particle
detection of neoplasm. Computerized mammography swarm optimization algorithm is borrowed into the field of
screening program can assist the early diagnosing of breast segmenting the mammogram images for achieving a positive
cancer [3]. In mammography one is not looking at an identification of a breast abnormalities. In other words, the
individual cell level but at image level, which hopefully main contribution of current work is to propose and
capture the effect of such cell level changes through x-ray implement a fully automated CAD system for early breast
imagery. Screening with mammography is an effective tool cancer detection from mammography. The suggested system
to reduce the mortality rates, in which the images has an is intended to be economical system with reasonable
indication to what happening in cell level. There are variant complexity to be appropriate for early inspection method in
examination systems that may be different from country to our low income countries. The proposed CAD system is
country, also other aspects such as examination period, age, founded on PSO and utilizing the support vector machine
methods, technologies, and regulation diverge extremely [4]. (SVM) for classifying the region of interest, which is
For example, some countries recommended that two abstracted from mammogram images, into three classes (i.e.
radiologists read the mammograms with one of them is normal, malignant and benign). PSO is used to determine the
, , … , Optimal
(1)
⋮ ⋮ ⋮ ⋮ hyper plane
, , … ,
Shape Features
It is very important to differentiate malignant from benign Vectors B
tumor. Several primary shape features are considered as a key
feature for this task. In general, shapes descriptors (i.e. shape Fig. 1: SVM functions
features) are several set of numbers that are produced to Figure 1 illustrate the idea of SVM for building the
describe a given object shape [22]. Shape features can be hyperplans. This idea behind SVM have to retain as much
divided into two classes, region features and boundary safety margin, which points toward how far the decision
features [23]. The common used shape features (descriptors) boundary is from the data. In working space the data points
include the Area, compactness, circularity, Eccentricity, can be linearly separable or multi-dimensionally separable.
brightness, Elongatedness and the Convexity. These The SVM kernel function varies according to the described
descriptors is selected to be applied on the segmented tumor hyperplane (i.e. linear or nonlinear) and it is described by a
to accomplish the task of calculating shape features at the dot product in the space of higher dimension.
feature extraction step of the proposed method.
IV. PROPOSED METHOD
A CAD system (computer-Aided detection) is proposed
for breast cancer in mammograms. The suggested CAD
system consists of five steps as shown in Figure 2. First, the
image preprocessing step which removes the labels and
artifacts. Second is the pectoral muscle removal from the
breast area. Third, the segmentation in which the region of
interest (ROI) is separated from breast image based on the
particle swarm optimization algorithm Fourth, interested
features are extracted that both the texture and shape features
are computed over ROI. Fifth, the classification step based on
multi-class SVM is conducted in order to divide the detected
ROIs into normal and abnormal (benign and malignant). The
detailed description of the suggested CAD system is
described in the following.
Pectorial
Preprocessing Muscule Segmentation Classification
Removal
Figure 2: CAD System using Particle Swarm Optimization algorithm Fig. 4: Different types of Breast mass in mammograms
Preprocessing Segmentation
Mammograms define the intensity of the breast density in The process of finding set of pixels that correlated
the image, where the fatty tissue appear as dark region and together is called “segmentation”. The segmentation process
glandular tissue appear as light region on the images. Hence, aims to decompose the digital mammogram image into
locating the mass represent a very hard task in the regions, which have symmetrical characteristics. Each
mammogram images. Therefore, it is important to improve identified regions represents the information that it belongs to
the fineness of the image which requires preprocessing of the and structuring elements to distinguish the ROIs. The main
targeted image before analysis and diagnosis. aim of segmentation the mammogram images in this CAD
The preprocessing is a necessary step for the system is region of interest segmented from the breast tissue
mammography images in order to accomplish the task of as shown in Figure 4. Image segmentation is the second step
noise removal, artifacts and label removal and background of breast cancer detection using CAD schemes, which
removal. To fully preparing the mammogram image for the separating suspicious regions that may contain masses from
segmentation step, the pectoral muscle has to be removed out breast tissue, i.e., partitioning the breast mammogram image
of the breast region. Scanning mammograms generate into several regions, and extracting ROIs and suspicious mass
artifacts (i.e. scanning artifacts, high and low intensity labels candidates from the mammogram image.in our proposed
and tape artifacts), as in the case of the Mini-MIAS dataset System , We chose the metaheuristic algorithm Particle
images. For this purpose we select automatically only the Swarm Optimization (PSO).to perform this task
area containing the breast region and removing these Proposed Fitness (cost) function for Segmentation
artifacts. For Artifacts and noise removal step we using an
automatically threshold suitable for Dense categories for each For the sake of segmenting the mammography images, a
mammogram image and then keep just the largest area which fitness function is designed in such a way to compromise
includes the breast with the pectoral muscle region. between two desired parameters to be optimized. These
parameters are first; the summation of distances ( ) between
Pectoral Removal all pixels and each other of the same cluster given by the
The upper right or left corner of the mammogram image following equation 8. The second parameter is the number of
represent the location of pectoral muscle. The pectoral clusters K
muscle region represents a high density and it appears ( , ) ⋯ ( , )
brighter than breast tissue that could interfere the
=∑ ⋮ ⋱ ⋮ (8)
examination of mammograms as shows in Figure 3. In the
density mammogram the pectoral muscle region are not ( , ) ⋯ ( , )
visible and very difficult to be removed, In some cases, = (2)
pectoral muscles are visible and easy to extracted from the
breast tissue[14]. where is the total fitness function. , are a weight
factor which are fractions less than one and their sum is 1.
For this purpose we applied, a fully automatic pectoral The weight factors are used to combine the two objectives
muscle removal method for successful extract the pectoral into single function and limit the maximum of the function to
muscle region is proposed [25]. one.
V. EXPERIMENT low intensity artifacts (label and other noise). Figure 5(b)
shows the breast region after selecting the large object,
Mini-Mias Database removing the label and noise and orienting the breast image.
For evaluated the proposed system, we used Digital Figure 5(c) represents the results of extracting the pectoral
mammography images which obtained from the mini-MIAS muscle area from mammogram.
dataset. This dataset consists of right and left breast images
of all breast density categories glandular, fatty and dense On the other hand, in figure 7 in Column (a) the original
breasts [27]. The Mini-MIAS Dataset consisting of three mammogram the second column of that figure illustrates the
hundred twenty two record of left and right breast images for output of the segmentation step shows the segmented ROI
one hundred sixty one patients. The mammogram images are using PSO. The first row illustrate an image of normal case,
categorized into three categories: malignant, benign and second row illustrate an image of benign case and last row
normal Divided into 208 normal cases, 63 benign cases and illustrate an image of malignant case.
51 malignant cases. The abnormalities consist of 5 categories The step of features extracting computes 20 different
as follows: architecturally irregular, Ill-defined, features used in classification. Figure 7 presents the value of
Circumscribed, asymmetrical and speculated masses. The “volume” feature for the different types of masses. It is clear
images’ size is 1024x1024pixels. The images are represented that there is a definite barrier between the three classes which
in grayscale in which the pixel intensity ranges between 0 could be determined by the chosen classifier. This barrier
and 255. A nonlinear multi-class SVM classifier is used for also clear in figure 8 which shows a map of 3 different
classify segmented ROIs in mammograms images into the features of the three classes. These features are standard
three classes. The chosen kernel for SVM is radial basis deviation, energy and homogeneity.
function (rbf) kernel which is the popular kernel among
researches for its high accuracy among other kernels. From
the total of 322 mammogram images from mini-MIAS
dataset, 207 are normal cases, 64 are benign cases and the
remaining 51 are malignant cases.
VI. RESULTS AND DISCUSSION
3B