Fuzzy C-Means Clustering Using Principal Component Analysis For Image Segmentation
Fuzzy C-Means Clustering Using Principal Component Analysis For Image Segmentation
ISSN No:-2456-2165
Abstract:- Nowadays, Image segmentation is the area in The FCM uses the membership function and an
which most of the research is carried out. It is considered objective function to perform cluster classification. This
as one of the most crucial fields in image analysis. It is classification is based on some features, and an image has
used to divide an image into meaningful regions and thus lots of features. So, which feature should be considered for
extract the region of interests. These regions are classification is an issue? This issue is resolved by replacing
considered as objects. Fuzzy c-means (FCM) clustering is all the features with a standard set of features. That will
one of the best clustering method used for image reproduce the inter-correlations of the features [6]. Then
segmentation, but have a drawback of unknown cluster according to the features selected the number of clusters is
number. This paper focuses on this drawback of FCM calculated. This paper uses PCA to identify the number of
and to overcome it, the Principal component analysis clusters for FCM.
(PCA) is used. PCA is used for detection of cluster
numbers for FCM because of its dimension reduction The remaining paper organised as follows: Section 2
capability. The cluster number is the important factor on explains FCM and PCA, section 3 includes the proposed
which the clustering result depends. Experimental results algorithm, section 4 gives the experimental results, and
show that the proposed method efficiently calculate the section 5 concludes the study.
cluster number for different test images and gives
effective results. II. FUZZY C-MEANS
Keywords:- Clustering, Cluster Number, Fuzzy C-Means, Clustering involves partition of image pixels into
Image Compression, Image Segmentation, Principal different clusters such as the pixels in the same clusters are
Component Analysis. similar features, whereas pixels of different clusters are as
dissimilar as possible. The clusters are identified through
I. INTRODUCTION similarity measures. FCM clustering allows a single pixel to
belong to two or more clusters. It is achieved by assigning
Image segmentation segments the image into membership to each pixel corresponding to each cluster
meaningful objects or region of interests. It makes the feature based on the distance between the cluster center and the
extraction easy and lowers the computational complexity of pixel. Minimum the distance between pixel and the center
image by compressing it while maintaining the quality of the more the membership of pixel towards that specific cluster
image. Thus, it simplifies the image into something more center. The sum of membership of each pixel should be one
useful and meaningful [1]. [7]. The membership of each pixel is updated in each
iteration until there is no further updation. The membership
Clustering term was first given by Tryon in 1939 [2]. It is of each pixel is calculated by the membership function,
a process of classifying an image into regions based on which is as follows:
homogeneity or heterogeneity of some feature criterion. The
2
classified regions of clustering process are called clusters. ( −1)
𝜇𝑖𝑗 = 1⁄∑𝑐𝑘=1(𝑑𝑖𝑗⁄𝑑𝑖𝑘) 𝑚 (1)
These clusters are regions which have minimum intra-cluster
distance and maximum inter-cluster distance [3]. Clustering Where uij is membership of ith pixel to the jth cluster
algorithms are categorized into hard and soft clustering. Hard
center, dij represents Euclidean distance between the ith pixel
clustering refers to K-means clustering, where each pixel
and jth cluster center, m is the fuzziness index, and c
belongs specifically to a single cluster. It has a drawback that represents cluster number. Along with the membership
it cannot converge optimally in the general case. Soft function, the cluster center is also updated in each iteration.
clustering refers to Fuzzy c-means clustering, which was
proposed by Dunn in 1972 [4]. It is the extended version of (∑𝑛 𝑚
𝑖=1(𝑢𝑖𝑗) 𝑥𝑖)
k-means in which there is a membership function which 𝑣𝑗 = , ∀𝑗 = 1,2, … 𝑐 (2)
(∑𝑛 𝑚
𝑖=1(𝑢𝑖𝑗) )
states the membership of each pixel to the different clusters.
That means in it a pixel can belong to two or more clusters. Where vj represents the jth cluster center, n is the
Although the computation cost of FCM is higher than the k- number of pixels and xi is the ith pixel.
means, however, it gives optimal results [5].
Algorithm:
Table 1 Shows the (a) Original Image, (b) Segmented Image and (c) Cluster Number Calculated from the Proposed Method
Test images
Segmented
images
Cluster
numbers 4 4 3 2
The table 1 shows that the test images and their segmented images through Fuzzy c-means whose cluster numbers are calculated from
the proposed algorithm. The figure also shows the calculated cluster number.
To quantitatively analyse the results of the proposed method, the size of a compressed or segmented image and the original image is
given in Table 2
Table 2 Result Analysis Shows the Compressed Size of Images after Segmentation.
Result analysis Image 1 Image 2 Image 3 Image 4
Original size (kb) 52.9 73.2 68.8 68.3
Calculated 22.5 44.7 36.3 33.0
Cluster-1
Segmented Calculated 20.5 25.6 23.2 23.0
image size (kb) Cluster
Calculated 50.8 52.2 55.2 46.3
Cluster+1
Calculated Cluster-1 57.5 38.9 47.2 51.7
Calculated 61.2 65.0 66.3 66.4
Compression Cluster
percentage Calculated 3.96 28.6 31.4 32.2
Cluster+1
The table 2 shows that the proposed method gives good VI. CONCLUSIONS
results in terms of segmented image as well as a compressed
image. The comparative results are shown in the table which The cluster number of an image is calculated using PCA
includes the size of segmented image using calculated cluster applied to the covariance matrix. PCA is a dimension reducing
number (represented in bold characters), one less than the algorithm which done the cluster number calculation task very
calculated cluster number and one greater than the calculated efficiently. The image obtained after segmentation using the
cluster number. The results show that the size of the segmented calculated cluster number is more compressed and small in size
image using cluster number obtained from proposed algorithm is than the other segmented images. Hence, the results show that
less than the other cluster numbers. It is also found that the the cluster number obtained from the proposed method gives
compression percentage of all the segmented test images using more accurate results than the other cluster numbers. The cluster
the proposed method is above 50 percent, which is considered to number computation is a significant problem in the automation
be useful to reduce the size of the image. Hence, it is found that of the FCM algorithm, which is fulfilled by the proposed
the proposed method gives good segmented image and has less algorithm. There are other problems in FCM which can be
size than the original image. considered for further research.