Classification of MC Clusters in Digital Mammography Via Haralick Descriptors and Heuristic Embedded Feature Selection Method
Classification of MC Clusters in Digital Mammography Via Haralick Descriptors and Heuristic Embedded Feature Selection Method
Classification of MC Clusters in Digital Mammography Via Haralick Descriptors and Heuristic Embedded Feature Selection Method
ABSTRACT
Characterizing the texture of mammographic tissue is an efficient and robust tool for the diagnosis of
microcalcification (MC) clusters in mammography because it does not require a prior MC segmentation stage. This
work is not only intended to validate MCs surrounding tissue hypothesis that reveals the potential of breast tissue
surrounding MCs to diagnose microcalcifications, but to present an improvement over the existing methods by
introducing a new heuristic feature selection based on particle swarm optimization and KNN classifier (PSO-KNN).
Using MC clusters from mini-MIAS and a local dataset, our results demonstrate the effectiveness of the proposed
characterization and feature selection methods.
Keywords: Microcalcification cluster, surrounding tissue, Haralick measures, embedded feature selection.
1 INTRODUCTION
Morphology based methods are the primary tools for
diagnosis and decision making on the nature of
mammographic microcalcifications [1]. However, a key
and challenging step for analyzing clustered
microcalcifications using their shape is the segmentation
stage [2]. An alternative and promising method for
characterizing MCs is by analyzing the texture of
mammographic regions enclosing them [3]. Texturebased computer-aided diagnosis of MCs overrides the
need for the MC segmentation stage. A texture-based
diagnosis approach is also more suitable for
characterizing the texture dependency and spectral
properties that are invisible to human eyes and cannot be
described using shape measures. Such alternative method
has been investigated in several studies [3]-[9]. A
common shortcoming of [4]-[6], [8]-[9] is the bias of
texture analysis due to the presence of breast
calcifications that are tiny deposits of calcium, which
cannot be considered malignant or benign lesions. A few
studies [3],[7] from the literature have attempted to
minimize the bias of the texture-based diagnosis by
excluding image locations that correspond to the
microcalcifications before characterizing the malignancy
of a given mammographic region. In answering the
question: can the texture of breast tissue surrounding
Volume 6 Number 4
Page 8
www.ubicc.org
(1)
Where w is constant, typically in interval [0.0 1.0], and
represents the inertia of the movement, r1 and r2 are
(2)
No.
Measure description
1
2
3
4
5
6
Autocorrelation
11
Contrast
12
Sum average
Correlation
13
Sum variance
Cluster Prominence
14
Sum entropy
Cluster Shade
15
Difference variance
Dissimilarity
16
Energy
17
Entropy
18
Homogeneity
19
Maximum probability
20
Difference entropy
Information measure
of correlation 1
Information measure
of correlation 2
Inverse difference
normalized (INN)
Inverse difference
moment normalized
10
Volume 6 Number 4
Page 9
www.ubicc.org
Digital
Mammogram
PSO-KNN
Feature Selection
KNN Classifier
Region Selection*
Haralick Feature
Extraction
MC
Segmentation
Srrounding Tissue
Segmentation
Figure 1: Characterization of MC clusters using surrounding tissue and PSO-KNN embedded feature selection.
`
Volume 6 Number 4
Page 10
www.ubicc.org
(a)
(b)
(c)
Figure 2: Segmentation of surrounding tissue. (a) An original mammographic region of size of 256 256 pixels
with a malignant MC cluster, (b) segmented microcalcifications and (c) surrounding tissue obtained by subtracting
image regions corresponding to the segmented MCs from the original region shown in (a), that is subtracting the
corresponding gray-level image representation of fig.2b from the image in fig.2a.
(3)
EXPERIMENTAL RESULTS
Volume 6 Number 4
Page 11
www.ubicc.org
TPF
FPF
Accuracy
Fitness
miniMIAS
3
3
3
3
3
3
3
3
5
7
2
6
7
4
6
5
1
1
0.95
1
0.94
0.88
0.94
0.88
1
1
0.92
0.92
0.94
1.0
0.94
1.0
1
1
0.94
0.96
0.94
0.94
0.94
0.94
0.064
0.095
0.138
0.151
0.196
0.161
0.180
0.175
Bronson
Table 3: Details of PSO-KNN learning models that achieve the highest classification performance.
Dataset
TPF
FNF
Accuracy
1.0
1.0
1.0
Haralick Features
Avg. of Inverse difference normalized
Avg. of Difference entropy
SD of Cluster Shade
miniMIAS
SD of Cluster Prominence
Range of inverse difference normalized
Bronson
0.88
1.0
0.94
Volume 6 Number 4
Page 12
www.ubicc.org
CONCLUSIONS
REFERENCES
[1] M. Kallergi: Computer-aided diagnosis of
mammographic microcalcification clusters. Medical
Physics, Vol. 31 No. 2, pp. 314-326(2004).
2009. CADx of
mammographic mass and clustered microcalcifications: A review, Medical Physics, Vol. 36 No. 6 ,
pp. 2052-2068 (2009).
[4] H. S.
Volume 6 Number 4
Page 13
www.ubicc.org
[15] R.
Volume 6 Number 4
Page 14
www.ubicc.org