0% found this document useful (0 votes)

12 views10 pages

An Efficient Fuzzy Clusnjkstering Algorithm

This paper presents an enhanced fuzzy clustering algorithm that improves the traditional K-means method by using a novel approach to initialize cluster centers based on data partitioning along the axis with the highest variance. The proposed algorithm aims to reduce clustering errors and improve the likelihood of each cluster containing data points, addressing issues such as sensitivity to initial conditions and the formation of empty clusters. Experimental results indicate that this modified approach converges to better clustering outcomes compared to the conventional K-means algorithm.

Uploaded by

bkomachi.arimakana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views10 pages

An Efficient Fuzzy Clusnjkstering Algorithm

Uploaded by

bkomachi.arimakana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

D. Vanisri et. al.

/ International Journal of Engineering Science and Technology

Vol. 2(10), 2010, 5949-5958

An Efficient Fuzzy Clustering Algorithm

Based on Modified K-Means
1
D.Vanisri *,
Department of Computer Technology,
Kongu Engineering College,
Perunudai-638 052, Tamilnadu,, INDIA

2
Dr.C.Loganathan
2Principal, Maharaja Arts and Science College,
Coimbatore, Tamilnadu, INDIA

Abstract

Fuzzy K-means clustering algorithm is very much useful for exploring the structure of a set of patterns, especially
when the clusters are overlapping. K-means algorithm is simple with low time complexity, and can process the
large data set quickly. But conventional K-means algorithm cannot get high clustering precise rate, and easily be
affected by clustering center random initialized and isolated points. This paper proposes an algorithm to compute
initial cluster centers for K-means clustering. A cutting plane is used to partition the data in a cell that divides cell in
to two smaller cells. The plane is perpendicular to the data axis with high variance and is intended to reduce the sum
squared errors of the two cells while at the same time keeping the two cells apart. Cells are partitioned one at a time
till the number of cells equals to the predefined number of clusters, K. The centers of the K cells become the initial
cluster centers for K-means. The experimental results suggest that the proposed algorithm is effective, converge to
better clustering results than those of the random initialization method. The research also indicated the proposed
algorithm would greatly improve the likelihood of every cluster containing some data in it. The research also
indicated the proposed algorithm would greatly improve the likelihood of every cluster containing some data in it

Keywords: Clustering, Fuzzy C-Means, Fuzzy Clustering Algorithm.

1. Introduction

Clustering is the task of dividing data points into homogeneous classes or clusters so that items in the same class are
as similar as possible and items in different classes are as dissimilar as possible. Clustering is a form of data
compression, where a large number of samples are converted into a small number of representative prototypes or
clusters. An ideal clustering algorithm classifies data such that samples that belong to a cluster are close to each
other while samples from different clusters are further away from each other.
In non-fuzzy or hard clustering, data is partitioned into crisp clusters, where each data point belongs to exactly
one cluster. Whereas in fuzzy clustering technique, the data points can belong to more than one cluster, and
associated with each of the points are membership grades which indicate the degree to which the data points belong
to the different clusters. In real applications, fuzzy clustering is often best suited as there is often no sharp boundary
between clusters for the data. In fuzzy clustering, membership degrees between zero and one are used instead of
crisp assignments of the data to clusters.
There are various algorithms for clustering approaches. K-means is a unique and effective algorithm, based on a
given number of clusters the algorithm iterates to find best clusters for the objects. Although K-means algorithm is
simple and can be used for a wide variety of data types, it is highly sensitive to initial positions of cluster centers.
The main aim of this paper is Enhancing K-Means Algorithm with Initial Cluster Centers Derived from Data
Partitioning along the Data Axis with the Highest Variance. The centers of the K cells become the initial cluster
centers for K-means. The approach desires to come up with a better clustering algorithm. This method reduces the
effects to single clustering algorithm caused by shape of data distributing , input sequence of data , change of

ISSN: 0975-5462 5949

D. Vanisri et. al. / International Journal of Engineering Science and Technology
Vol. 2(10), 2010, 5949-5958

parameters, makes clustering results not easily plunged partly optimism. Thus this paper proposes an enhanced k
means algorithm which can produce better results than the conventional K- means.

2. Related work

The (N. Vlassis et al., 2003) proposed the global k-means clustering algorithm that constructs initial centers by
recursively partitioning data space into disjoint subspaces using a k-d trees method. The cutting hyper plane used in
the method is defined as the plane that is perpendicular to the highest variance axis derived by principal component
analysis. The partitioning is performed until each of the leaf nodes (bucket) contains less than a predefined number
of data instances (bucket size) or the predefined number of buckets has been created. The centroids of data in the
final buckets are then used as initial centers for K-means.
New approach (Giovanna Castellano , 2003) proposed an approach for automatic discovery of transparent
diagnostic rules from data. The approach depends on a fuzzy clustering technique that is defined by three sequential
steps. Firstly, the Crisp Double Clustering algorithm is applied on available symptoms measurements, to provide a
set of representative multidimensional prototypes that are further clustered onto each one-dimensional projection.
The resulting clusters are used in the second step, in which a set of fuzzy relations are defined in terms of
transparent fuzzy sets. In the final step, the derived fuzzy relations are applied to define a set of fuzzy rules, which
set up the knowledge base of a fuzzy inference system that can be used for fuzzy diagnosis. The experiments were
applied to the Aachen Aphasia dataset as a real-world benchmark and compared with related work.
Fuzzy Logic formularizes an intuitive theory based on human reason of approximation. It differs from the
traditional logic methods where crisp or exact results are expected. The concept of fuzzy logic was first put forth by
(U. Maulik, 2000). Fuzzy Logic is used in problems where the results can be approximate rather than exact. Hence,
the principles of fuzzy logic suit well to clustering problems. The results are determined by some degree of
closeness to true or to false. Clustering problems generally measure some kind of closeness between similar objects.
Fuzzy Logic approach is used in various fields to provide flexibility to classical algorithm, due to its applicability to
problems that do not require hard solutions.
K-means is one of the widely used clustering algorithms and has been used in various fields of science and
technology. The major drawback of the k-means algorithm is that it produces empty clusters depending on initial
center vectors. This problem is considered insignificant for static execution of the k-means and it can be easily
solved by executing the algorithm for a number of times. But when k-means is presented as an integral part of some
higher level application, this empty cluster problem may produce irregular behavior of the system and may lead to
significant performance degradation. Next (R. Dubes, A. Jain, 1998) presents a modified version of the k-means
algorithm that efficiently eliminates this empty cluster problem. Based on the experimental results of the proposed
algorithm, it is observed that there is no performance degradation due to incorporated modification.
It is very difficult to build a perfect classifier with 100% prediction accuracy because of the complexity of
biomedical classification problems. Hence it is better to build an effective Decision Support System (DSS), which
should not only predict unseen samples accurately, but also work in a human-understandable way. Then (Yuanchen
He et al, 2006) proposed a novel adaptive Fuzzy Association Rules (FARs) mining algorithm, named FARM-DS, to
build such a DSS for binary classification problems in the biomedical domain. Four steps are executed to mine
FARs, in the training phase, which are thereafter used to predict unseen samples in the testing phase. The
experiment of the new FARM-DS algorithm is conducted on two publicly available medical datasets. The
experimental results show that FARM-DS is very significant in terms of prediction accuracy. Moreover, the mined
FARs provides strong decision support on disease diagnoses due to their easy interpretability.

3.Methodology

The K-means clustering algorithms are the simplest methods of clustering data. This is a widely-used clustering
algorithm, owing to its simple and convenience.
Algorithm description of K-means is as follows:
1. K instances were randomly selected as the initial clustering centers.
2. The process is repeated.
3. The distance of the instance to the clustering center is computed, and clusters this instance to the class
which distance is minimal.

ISSN: 0975-5462 5950

D. Vanisri et. al. / International Journal of Engineering Science and Technology
Vol. 2(10), 2010, 5949-5958

4. These centroids of the clusters are updated correctly.

5. The process is repeated until these centroids do not change.

 K-means only can be used under the situation that the average value has been defined. This may not suit
There are certain disadvantages in this K-means algorithm. They are

 K must be given by users. Besides, it's sensitive to the initial value, and can lead to different clustering
some applications, such as mobile objects clustering, data concerned about classified attributes.

 K-means is not fit to non-convex cluster, or big difference on size. Besides, it's sensitive to "noise" and
results with different initial value.

isolated points data, a little data like this can make huge effects on average values.
So to overcome these draw backs, a modified K- means has to be incorporated.

A. Enhanced K- Means Approach

This proposed approach mainly focuses on the initialization of cluster centers for K-means. The proposed
algorithm follows a novel approach that performs data partitioning along the data axis with the highest variance.
The approach has been used successfully for color quantization [8]. The data partitioning tries to divide data space
into small cells or clusters where intercluster distances are large as possible and intracluster distances are small as
possible.

Fig. 1 Diagram of ten data points in 2D, sorted by its X value, with an ordering number for each data point

Consider ten data points in 2D data space as shown in figure 1. The aim is to partition the ten data points into two
disjoint cells where the sum of total clustering errors of the two cells is minimum as represented in figure 2.
Consider a cutting plane perpendicular to X-axis used to partition the data. Let C1 and C2 be the first cell and the
second cell respectively and and be the cell centroids of the first cell and the second cell, respectively. The
total clustering error of the first cell is thus computed by:

and the total clustering error of the second cell is thus computed by:

Where is the data in a cell. As a result, the sum of total clustering errors of both cells is minimal which is
represented in figure 2.

ISSN: 0975-5462 5951

D. Vanisri et. al. / International Journal of Engineering Science and Technology
Vol. 2(10), 2010, 5949-5958

Fig. 2 Diagram of partitioning a cell of ten data points into two smaller cells, a solid line represents the intercluster distance and dash lines
represent the intracluster distance

Fig. 3 Illustration of partitioning the ten data points into two smaller cells using m as a partitioning point. A solid line in the square represents
the distance between the cell centroid and a data in cell, a dash line represents the distance between m and data in each cell and a solid dash
line represents the distance between m and the data centroids in each cell.
The partition could be done using a cutting plane that passes through m. Thus

, , ,

, , , .| |

ISSN: 0975-5462 5952

D. Vanisri et. al. / International Journal of Engineering Science and Technology
Vol. 2(10), 2010, 5949-5958

m is considered as the partitioning data point where |C1| and |C2| are the numbers of data points in cluster C1 and C2
respectively. The total clustering error of the first cell can be minimized by reducing the total discrepancies between
all data in first cell to m, which is computed by:

The same argument is also true for the second cell. The total clustering error of the second cell is minimized by
reducing the total discrepancies between all data in second cell to m, which is computed by the following equation.

Where , is the distance between m and each data in each cell. Therefore the problem to minimize the sum of
total clustering errors of both cells can be transformed into the problem to minimize the sum of total clustering error
of all data in the two cells to m.

The relationship between the total clustering error and the clustering point is illustrated in Fig. 4, where the
horizontal-axis represents the partitioning point that runs from 1 to n where n is the total number of data points and
the vertical-axis represents the total clustering error. When m=0, the total clustering error of the second cell is equal
to the total clustering error of all data points while the total clustering error of the first cell is zero. On the other
hand, when m=n, the total clustering error of the first cell equals to the total clustering error of all data points, while
the total clustering error of the second cell is zero.

Fig. 4 Graphs depict the total clustering error, lines 1 and 2 represent the total clustering error of the first cell and second cell, respectively, Line
3 represents a summation of the total clustering errors of the first and the second cells.

The parabola curve shown in Fig. 4 represents a summation of the total clustering error of the first cell and the
second cell, represented by the dash line 2. Note that the lowest point of the parabola curve is the optimal clustering
point (m). At this point, the summation of the total clustering error of the first cell and the second cell are minimum.
Since time complexity of finding the optimal point m is , then the distances between adjacent data along the
X-axis is used to find the approximated point of n but with time of ).

ISSN: 0975-5462 5953

D. Vanisri et. al. / International Journal of Engineering Science and Technology
Vol. 2(10), 2010, 5949-5958

Fig. 5 Illustration of ten data points, a solid line represents the distance between adjacent data along the X-axis and a dash line represents the
distance between m and any data point

The task of approximating the optimal point (m) in 2D is thus replaced by finding m in one-dimensional line as
shown in Fig. 6.

Fig. 6 Illustration of the ten data points on a one-dimensional line and the relevant Dj

The point (m) is therefore a centroid on the one-dimensional line which is shown in Fig. 6 yields

, ,

Let
∑ and a centroidDist can be computed by:

Therefore, the total clustering errors of the two smaller cells partitioned by the plane passing through the data
point nearest to centroidDist are similar. The X-axis or Y-axis can be possible chosen as the principal axis for data
partitioning. However, data axis with the highest variance will be chosen as the principal axis for data partitioning.
The reason is to make the inter distance between the centers of the two cells as large as possible while the sum of
total clustering errors of the two cells are reduced from that of the original cell. To partition the given data into k
cells, we start with a cell containing all given data and partition the cell into two cells. Later on we select the next
cell to be partitioned that yields the largest reduction of total clustering errors (or Delta clustering error). The sum of
Total clustering errors of the two sub cells of the original is defined as Total clustering error of the original cell.
This is done so that every time a partition on a cell is performed, it will minimize the sum of total clustering errors
for all cells.

Now, the partitioning algorithm is used to partition a given set of data into k cells. The centers of the cells can
then be used as good initial cluster centers for the K-means algorithm. Following are the steps of the proposed
algorithm.
1. Let cell c contain the entire data set.

ISSN: 0975-5462 5954

D. Vanisri et. al. / International Journal of Engineering Science and Technology
Vol. 2(10), 2010, 5949-5958

2. Sort all data in the cell c in ascending order on each attribute value and links data by a linked list for each
attribute.
3. Compute variance of each attribute of cell c. Choose an attribute axis with the highest variance as the
principal axis for partitioning.
4. Compute squared Euclidean distances between adjacent data along the data axis with the highest variance

, and compute the ∑

5. Compute centroid distance of cell c:

Where dsumi is the summation of distances between the adjacent data

6. Divide cell c into two smaller cells. The partition boundary is the plane perpendicular to the principal axis
and passes through a point m whose dsumi approximately equals to centroidDist. The sorted linked lists of
cell c are scanned and divided into two for the two smaller cells accordingly
7. Compute Delta clustering error for c as the total clustering error before partition minus total clustering
error of its two sub cells and insert the cell into an empty Max heap with Delta clustering error as a key.
8. Delete a max cell from Max heap and assign it as a current cell.
9. For each of the two sub cells of c, which is not empty, perform step 3 - 7 on the sub cell.
10. Repeat steps 8 - 9. Until the number of cells (Size of heap) reaches K.
11. The centroids of cells have to be used in max heap as the initial cluster centers for K-means clustering.

4. Experimental Results

The experiments for the proposed algorithm are evaluated on iris and wine data sets from UCI Machine Learning
Repository (C.l. Blake, C.J. Merz, 1998). The clustering results of the K-Means algorithm using random initial
centers and initial centers derived by the proposed algorithm is compared based on the observed results.
The measurements used for comparing the clustering results are
1. The sum of the squared error distances between the data and the centroid of their clusters (SSE). The SSE
results on 10 UCI data sets are shown in Fig. 7
2. Entropy to measure impurity of each cluster

log

where c is number of classes of data, is the proportion of data of class j in a given cluster.

ISSN: 0975-5462 5955

D. Vanisri et. al. / International Journal of Engineering Science and Technology
Vol. 2(10), 2010, 5949-5958

10
Random Proposed Approach
8

0
Iris (3 clut) Iris (6 clut) wine(3 clut)
Fig. 7 SSE Results on Iris and Wine Based On Number of Clusters

The averaged Entropy for all clusters is then used for comparison. The averaged Entropy of iris and wine in UCI
data sets are shown in Fig. 8. From the two measurements we can see that the proposed algorithm outperform the
random initialization algorithm in most cases. The proposed algorithm also performs much better than the random
initialization algorithm as the required number of clusters increases. The execution times of K-Means using
proposed algorithm is also much less than the average execution times of K-Means when using random
initialization algorithm for iris and wine data sets. This may be due to the initial cluster centers generated by the
proposed algorithm are quite close to the optimal solutions. The execution time comparisons for the two UCI data
sets are shown in Fig.9. The clustering results from proposed algorithm with the results from the Clustering Center
Initialization Algorithm (CCIA) are also compared. Fig. 10 shows the clustering results in terms of classification
error (%) when a class of data in a cluster is predicted to be the majority class of data in the cluster.
It can be seen that proposed algorithm performances are comparable to the CCIA. However, the proposed
algorithm is much simpler to implement than CCIA.
7 Random Proposed Approach
6
5
4
3
2
1
0
Iris wine

Fig. 8 Entropy results on UCI data sets: Wine, Iris

100
Random
80 Proposed…
Time in secs

0
Iris(3 clut) Iris(6 clut) wine(3 clut) wine(6 clut)

Fig. 9 Execution time results based on Number of clusters

ISSN: 0975-5462 5956

D. Vanisri et. al. / International Journal of Engineering Science and Technology
Vol. 2(10), 2010, 5949-5958

30
25 CCIA
Random

Error Rate %
20 Proposed Approach
15
10
5
0
Iris Wine

Fig. 10 Classification error comparisons among the three methods, Cluster Center Initialization Algorithm (CCIA), Random Initialization
Algorithm and Proposed Algorithm

5. Conclusion

A novel initialization algorithm of cluster centers for K means algorithm has been proposed. The algorithm was
based on the data partitioning algorithm used for color quantization. A data set was partitioned into k clusters in
such a way that the sum of the total clustering errors for all clusters was reduced as much as possible while inter
distances between clusters are maintained to be as large as possible. The proposed algorithm is very effective,
converges to better clustering results and almost all clusters have some data in it. The experimental results show that
the proposed algorithm performs better than random initialization and can reduce running time of K-Means
significantly for iris and wine datasets. The performances of proposed algorithm are also comparable to the CCIA
however the proposed algorithm is much simpler and easier to implement.

References

[1] R. Dubes and A.Jain, “Algorithms for Clustering Data, Prentice-Hall, Englewood Cliffs”, NJ, 1998.
[2] N. Vlassis, A. Likas and J.J. Verbeek, “The Global k-means Clustering algorithm”, Pattern Recognition , Volume 36, Issue 2, pp. 451- 461,
2003.
[3] P.S. Bradley and U.M. Fayyad, “Refining initial points for K-means Clustering”, Proceeding of The Fifteenth International Conference on
Machine Learning, Morgan Kaufmann, San Francisco, CA, 1998, pp. 91-99.
[4] C.L. Blake, C.J. Merz. UCI Repository of machine learning databases. University of California, Irvine, Department of Information and
Computer Science, 1998.
[5] J. Han and M. Kamber, “Data Mining: Concepts and Techniques”, Morgan Kaufmann Publishers, San Diego, 2001.
[6] P. Mitra, C.A. Murthy, S.K. Pal, “Density based multi scale data condensation”, IEEE Trans, Pattern Anal, Machine Intell, 24 (6), pp. 734–
747, 2002.
[7] S. S. Khan and A. Ahmad, “Cluster Center Initialization for K-mean Clustering”, Pattern Recognition Letters, Volume 25, Issue 11, pp.
1293-1302, 2004.
[8] Y. Sirisathitkul, S. Auwatanamongkol and B. Uyyanonvara, “Color image quantization using distances between adjacent colors along the
color axis with highest color variance”, Pattern Recognition Letters, Volume 25, Issue 9, pp. 1025-1043, 2004.
[9] Yuanchen He, Yuchun Tang, Yan-Qing Zhang, Rajshekhar Sunderraman, “Adaptive Fuzzy Association Rule mining for effective
decision support in biomedical applications”, International Journal of Data Mining and Bioinformatics Vol. 1, No.1 pp. 3 – 18, 2006.
[10] Pratima Gautam, Neelu Khare, K. R. Pardasani, “A Model for Mining Multilevel Fuzzy Association Rule in Database”, Journal of
Computing, Vol. 2, Issue 1, January 2010.
[11] Giovanna Castellano,Anna M. Fanelli, Corrado Mencar, "A Fuzzy Clustering Approach for Mining Diagnostic Rules", IEEE transaction in
2003.
[12] W. Pedrycz, J.V. de Oliveira, “Optimization of Fuzzy Models”, IEEE Trans. on Systems, Man and Cybernetics B, vol. 26 No. 4, 1996.
[13] D. Nauck, R. Kruse, “Obtaining Interpretable Fuzzy Classification Rules from Data”, Artificial intelligence in medicine”, vol 16, No 2, pp
129-147, 1999.
[14] Jim C. Bezdek, “Fuzzy Mathematics in Pattern Classification.” Cornell University, Ithaca, 1973.
[15] S. L. Chiu. “Fuzzy model identification based on cluster estimation” Journal of Intelligent and Fuzzy Systems, 1994.
[16] S. Bandyopadhyay, U. Maulik, and M. K. Pakhira, “Partitional clustering using simulated annealing with probabilistic redistribution,” in
International Journal Pattern Recognition and Artificial Intelligence, vol. 15, pp. 269--285, 2001.
[17] U. Maulik and S. Bandyopadhyay, “Genetic algorithms based clustering technique,” in Pattern Recognition, vol. 33, pp. 1455- 1465, 2000.

ISSN: 0975-5462 5957

D. Vanisri et. al. / International Journal of Engineering Science and Technology
Vol. 2(10), 2010, 5949-5958

Biographical notes:

D. Vanisri has received the Master of Science in Mathematics in 2001 from Madurai Kamaraj University. Then
she completed her Master of Philosophy in Mathematics in the year 2003. She has presented many papers in
national and international conferences and also guided many UG projects. She has published a paper in international
journal. Now she is doing research in the field of Fuzzy clustering and rule mining at Mother Terasa Women’s
University, Kodaikannal. Currently she is working as a Lecturer in the Department of Computer Technology and
Applications, Kongu Engineerring College, Tamilnadu.

Dr.C.Loganathan qualified basically with B.Sc and M.Sc in Mathematics in 1978 and 1980 respectively from
Madras University and subsequently with M.Phil and Ph.D in Mathematics from Bharathiar University has served
in various capacities as faculty member and Head of the Department of Mathematics at Kongu Engineering College,
Perundurai for more than a decade. He is at present working as Principal, Maharaja Arts and Science College,
Coimbatore. His unquenchable thirst for academic achievements had culminated in the publication of series of
research papers, numbering more than 12 in the leading-referred national and international journals. As a research
guide, he has produced many M.Phil and Ph.D candidates. He is a reviewer of many referred international journals.
His areas of interest encompass Applied Mathematics, Control Theory, Numerical Methods, Quantitative
Techniques and Neural Networks. He has co-authored the books on “Quantitative Methods in Management,
Engineering Mathematics I and Engineering Mathematics II”.

ISSN: 0975-5462 5958

7.introduction To Clustering
No ratings yet
7.introduction To Clustering
11 pages
Unit 18
No ratings yet
Unit 18
4 pages
Types of Statistical Distributions
No ratings yet
Types of Statistical Distributions
34 pages
Formula Sheet - ANOVA, Chi-Square & Regression
No ratings yet
Formula Sheet - ANOVA, Chi-Square & Regression
1 page
G-01 KAN Guide On Measurement Uncertainty (En)
No ratings yet
G-01 KAN Guide On Measurement Uncertainty (En)
32 pages
1 s2.0 S0020025522014633 Main
No ratings yet
1 s2.0 S0020025522014633 Main
33 pages
FCM - The Fuzzy C-Means Clustering Algorithm
No ratings yet
FCM - The Fuzzy C-Means Clustering Algorithm
13 pages
Pest Identification Using Matlab
100% (1)
Pest Identification Using Matlab
14 pages
K - Means Clustering Algorithm Applications in Data Mining and Pattern Recognition
No ratings yet
K - Means Clustering Algorithm Applications in Data Mining and Pattern Recognition
8 pages
Mcaschsyll 2013
No ratings yet
Mcaschsyll 2013
134 pages
Chapter 10 - Isoparametric Elements: Learning Objectives
No ratings yet
Chapter 10 - Isoparametric Elements: Learning Objectives
88 pages
Umbrello Handbook X
No ratings yet
Umbrello Handbook X
41 pages
1.3 Translational Equilibrium Statics
No ratings yet
1.3 Translational Equilibrium Statics
55 pages
Java Module Part1
No ratings yet
Java Module Part1
74 pages
PHY5501 8501 Lecture4 TEM 2020
No ratings yet
PHY5501 8501 Lecture4 TEM 2020
60 pages
A Comparative Study of Data Clustering
No ratings yet
A Comparative Study of Data Clustering
21 pages
MATH 4 - Differential-Equations
No ratings yet
MATH 4 - Differential-Equations
23 pages
ME Math 10 Q2 1002 PS
No ratings yet
ME Math 10 Q2 1002 PS
26 pages
Fast and Robust General Purpose Clustering Algorit
No ratings yet
Fast and Robust General Purpose Clustering Algorit
29 pages
Genedata
No ratings yet
Genedata
67 pages
Azimi 2017
No ratings yet
Azimi 2017
26 pages
Lecture 4 - Metrology & Measurement
No ratings yet
Lecture 4 - Metrology & Measurement
15 pages
Automatic Clustering With Single Optimal Solution
No ratings yet
Automatic Clustering With Single Optimal Solution
13 pages
A Review On K Means Clustering
No ratings yet
A Review On K Means Clustering
7 pages
ComparisonofK MeansandFuzzyC MeansAlgorithmsonDifferentClusterStructures
No ratings yet
ComparisonofK MeansandFuzzyC MeansAlgorithmsonDifferentClusterStructures
11 pages
Amidakuji.: Wednesday, January 26, 2011
No ratings yet
Amidakuji.: Wednesday, January 26, 2011
10 pages
Clustering
No ratings yet
Clustering
9 pages
(IJCST-V1I3P1) : D.Vanisri
No ratings yet
(IJCST-V1I3P1) : D.Vanisri
8 pages
Normalization Based K Means Clustering Algorithm
No ratings yet
Normalization Based K Means Clustering Algorithm
5 pages
Pgpool II Tutorial
No ratings yet
Pgpool II Tutorial
6 pages
Wahid Ali - 231203 - 132815
No ratings yet
Wahid Ali - 231203 - 132815
10 pages
Odd 9302002 V 2
No ratings yet
Odd 9302002 V 2
18 pages
V5I5201647
No ratings yet
V5I5201647
13 pages
Fuzzy Cluste2ring A Historical Perspective
No ratings yet
Fuzzy Cluste2ring A Historical Perspective
11 pages
Oddnew 9401004 V 1
No ratings yet
Oddnew 9401004 V 1
44 pages
Fuzzy Means Algorithm
No ratings yet
Fuzzy Means Algorithm
14 pages
2 New 9401011 V 1
No ratings yet
2 New 9401011 V 1
40 pages
An Improved K-Means Cluster Algorithm Using Map Reduce Techniques To Mining of Inter and Intra Cluster Datain Big Data Analytics
No ratings yet
An Improved K-Means Cluster Algorithm Using Map Reduce Techniques To Mining of Inter and Intra Cluster Datain Big Data Analytics
12 pages
1 s2.0 S0952197615001736 Main
No ratings yet
1 s2.0 S0952197615001736 Main
14 pages
Data Clustering Using Kernel Based
No ratings yet
Data Clustering Using Kernel Based
6 pages
Type-II Fuzzy Possibilistic C-Mean Clustering: M.H. Fazel Zarandi, M. Zarinbal, I.B. Turksen
No ratings yet
Type-II Fuzzy Possibilistic C-Mean Clustering: M.H. Fazel Zarandi, M. Zarinbal, I.B. Turksen
6 pages
Anupama Luthra - 2011
No ratings yet
Anupama Luthra - 2011
21 pages
Comparison of K-Means and Fuzzy C-Means Algorithms On Different Cluster Structures
No ratings yet
Comparison of K-Means and Fuzzy C-Means Algorithms On Different Cluster Structures
11 pages
An Improved K-Means Algorithm Based On Mapreduce and Grid: Li Ma, Lei Gu, Bo Li, Yue Ma and Jin Wang
No ratings yet
An Improved K-Means Algorithm Based On Mapreduce and Grid: Li Ma, Lei Gu, Bo Li, Yue Ma and Jin Wang
12 pages
Welcome To International Journal of Engineering Research and Development (IJERD)
No ratings yet
Welcome To International Journal of Engineering Research and Development (IJERD)
5 pages
Comparative Analysis of K-Means and Fuzzy C-Means Algorithms
No ratings yet
Comparative Analysis of K-Means and Fuzzy C-Means Algorithms
5 pages
Measure of Dispersion
No ratings yet
Measure of Dispersion
11 pages
Enhancing The Exactness of K-Means Clustering Algorithm by Centroids
No ratings yet
Enhancing The Exactness of K-Means Clustering Algorithm by Centroids
7 pages
An Improvement in K Means Clustering Algorithm IJERTV2IS1385
No ratings yet
An Improvement in K Means Clustering Algorithm IJERTV2IS1385
6 pages
APMOPS (SMOPS) 2008 First Round With Answers
No ratings yet
APMOPS (SMOPS) 2008 First Round With Answers
6 pages
Statistical Considerations On The K - Means Algorithm
No ratings yet
Statistical Considerations On The K - Means Algorithm
9 pages
New Eart 9401022 V 2
No ratings yet
New Eart 9401022 V 2
46 pages
A Portfolio Optimization Algorithm Using Fuzzy Granularity Based Clustering
No ratings yet
A Portfolio Optimization Algorithm Using Fuzzy Granularity Based Clustering
15 pages
A Dynamic K-Means Clustering For Data Mining-Dikonversi
No ratings yet
A Dynamic K-Means Clustering For Data Mining-Dikonversi
6 pages
Optimization of Fuzzy C Means With Darwinian Particle Swarm Optimization On MRI Image
No ratings yet
Optimization of Fuzzy C Means With Darwinian Particle Swarm Optimization On MRI Image
4 pages
Amalgam Clustering Algorithm
No ratings yet
Amalgam Clustering Algorithm
9 pages
16-10-2022 - Jr.C-120 - Jee-Adv (2019-P2) - WTA-11 - Key & Sol's
No ratings yet
16-10-2022 - Jr.C-120 - Jee-Adv (2019-P2) - WTA-11 - Key & Sol's
8 pages
A Novel Kernelized Fuzzy Clustering Algorithm For Data Classification
No ratings yet
A Novel Kernelized Fuzzy Clustering Algorithm For Data Classification
6 pages
SAT5
No ratings yet
SAT5
17 pages
An Analysis of Fuzzy C Means and Logical Average Distance Measure Algorithms Using MRI Brain Images
No ratings yet
An Analysis of Fuzzy C Means and Logical Average Distance Measure Algorithms Using MRI Brain Images
5 pages
A Hybrid Algorithm Based On KFCM-HACO-FAPSO For Clustering ECG Beat
No ratings yet
A Hybrid Algorithm Based On KFCM-HACO-FAPSO For Clustering ECG Beat
6 pages
Oddnew 9401007 V 1
No ratings yet
Oddnew 9401007 V 1
36 pages
Oddnew 9401006 V 1
No ratings yet
Oddnew 9401006 V 1
35 pages
Research On K-Means Clustering Algorithm An Improved K-Means Clustering Algorithm
No ratings yet
Research On K-Means Clustering Algorithm An Improved K-Means Clustering Algorithm
5 pages
AIP Confe Improved FCM Ensemble Single Mean
No ratings yet
AIP Confe Improved FCM Ensemble Single Mean
9 pages
Teaching Learning Based Optimization: Application and Variation
No ratings yet
Teaching Learning Based Optimization: Application and Variation
5 pages
Comprehensive Review of K-Means Clustering Algorithms
No ratings yet
Comprehensive Review of K-Means Clustering Algorithms
5 pages
Fuzzy Clustering: Presented by CH - Srikanth (07991A1268)
No ratings yet
Fuzzy Clustering: Presented by CH - Srikanth (07991A1268)
11 pages
8 Esh Narayan 734 Research Article CSIT June 2012
No ratings yet
8 Esh Narayan 734 Research Article CSIT June 2012
9 pages
1 s2.0 S016d5011415003188 Main
No ratings yet
1 s2.0 S016d5011415003188 Main
8 pages
Impact of Outlier Removal and Normalization Approa
No ratings yet
Impact of Outlier Removal and Normalization Approa
6 pages
WWWWW Clustering Algorithm
No ratings yet
WWWWW Clustering Algorithm
7 pages
6.0 CX1104 Part2Introduction 28sep2022
No ratings yet
6.0 CX1104 Part2Introduction 28sep2022
6 pages
The International Journal of Engineering and Science (The IJES)
No ratings yet
The International Journal of Engineering and Science (The IJES)
4 pages
New 9769401016 V 2
No ratings yet
New 9769401016 V 2
22 pages
A Tutorial On Clustering Algorithms
No ratings yet
A Tutorial On Clustering Algorithms
4 pages
Case Problem 3
No ratings yet
Case Problem 3
5 pages
Ijert Ijert: Enhanced Clustering Algorithm For Classification of Datasets
No ratings yet
Ijert Ijert: Enhanced Clustering Algorithm For Classification of Datasets
8 pages
6 New 9401001 V 1
No ratings yet
6 New 9401001 V 1
18 pages
Toyjklecm An Implementation of Ecm Using Hessian Curves
No ratings yet
Toyjklecm An Implementation of Ecm Using Hessian Curves
17 pages
VU Research Portal
No ratings yet
VU Research Portal
17 pages
Det KSP
No ratings yet
Det KSP
4 pages
MMZ XRF O0 Ra Pre 0 ZB XGXW W1 Er 02 OAYQum QDD78 HQP
No ratings yet
MMZ XRF O0 Ra Pre 0 ZB XGXW W1 Er 02 OAYQum QDD78 HQP
4 pages
CFD Course Notes v14
No ratings yet
CFD Course Notes v14
20 pages
PoeSynthesis Single Crystal X-Ray Analysis and Vibrational Spectral Studies of 5 6-Dibromo-11 12 13 14-Tetrahydro-15-Oxa-10b 14a-Diaza-11 14-Methanodib
No ratings yet
PoeSynthesis Single Crystal X-Ray Analysis and Vibrational Spectral Studies of 5 6-Dibromo-11 12 13 14-Tetrahydro-15-Oxa-10b 14a-Diaza-11 14-Methanodib
12 pages
ttePhysRevResearch 2 043035
No ratings yet
ttePhysRevResearch 2 043035
11 pages
A Genetic K-Means Clustering Algorithm Based On The Optimized Initial Centers
No ratings yet
A Genetic K-Means Clustering Algorithm Based On The Optimized Initial Centers
7 pages
newBRM 40 2 457
No ratings yet
newBRM 40 2 457
10 pages
New 9401017 V 1
No ratings yet
New 9401017 V 1
10 pages
Odd 9401005 V 1
No ratings yet
Odd 9401005 V 1
10 pages
A Publication of The American Physical Society A Publication of The American Physical Society
No ratings yet
A Publication of The American Physical Society A Publication of The American Physical Society
8 pages
A Dynamic K-Means Clustering For Data Mining
No ratings yet
A Dynamic K-Means Clustering For Data Mining
6 pages
Poewong Et Al 2002 Arene Synthesis by Extrusion Reaction 16 Coplanar and Stable Derivatives of 13 14 Didehydro Tribenzo A
No ratings yet
Poewong Et Al 2002 Arene Synthesis by Extrusion Reaction 16 Coplanar and Stable Derivatives of 13 14 Didehydro Tribenzo A
9 pages
New 90769401009 V 1
No ratings yet
New 90769401009 V 1
9 pages
An Efficient Incremental Clustering Algorithm
No ratings yet
An Efficient Incremental Clustering Algorithm
3 pages
New11 Artikel
No ratings yet
New11 Artikel
8 pages
Department of Education Division of Cebu Province
No ratings yet
Department of Education Division of Cebu Province
5 pages
I Jcs It 20140506204
No ratings yet
I Jcs It 20140506204
4 pages
hklj9PhysRevA 94 042311
No ratings yet
hklj9PhysRevA 94 042311
8 pages
Comparison of Metaheuristic Approach To Kernel Possibilistic Fuzzy C
No ratings yet
Comparison of Metaheuristic Approach To Kernel Possibilistic Fuzzy C
4 pages
Na 2010
No ratings yet
Na 2010
5 pages
Research On K Mean Algorithm
No ratings yet
Research On K Mean Algorithm
5 pages
Balanced K-Means Revisited-1
No ratings yet
Balanced K-Means Revisited-1
3 pages
End of HjkAngew Chem Int Ed - 2015 - Nobusue - Tetracyclopenta Def JKL PQR VWX Tetraphenylene A Potential Tetraradicaloid
No ratings yet
End of HjkAngew Chem Int Ed - 2015 - Nobusue - Tetracyclopenta Def JKL PQR VWX Tetraphenylene A Potential Tetraradicaloid
5 pages
Modeling Key Parameters For Greenhouse Using Fuzzy Clustering Technique
No ratings yet
Modeling Key Parameters For Greenhouse Using Fuzzy Clustering Technique
4 pages
OLD Notyegfyui
No ratings yet
OLD Notyegfyui
4 pages
13232PhysRevLett 94 230403
No ratings yet
13232PhysRevLett 94 230403
4 pages
Optimizing of Fuzzy C-Means Clustering Algorithm Using GA: Mohanad Alata, Mohammad Molhim, and Abdullah Ramini
No ratings yet
Optimizing of Fuzzy C-Means Clustering Algorithm Using GA: Mohanad Alata, Mohammad Molhim, and Abdullah Ramini
6 pages
Fuzzy C-Means - Review
No ratings yet
Fuzzy C-Means - Review
3 pages
C 39 Tyo 930001006
No ratings yet
C 39 Tyo 930001006
3 pages
Naukri Kailas Madane
No ratings yet
Naukri Kailas Madane
2 pages
Fghjkand Up
No ratings yet
Fghjkand Up
2 pages
Errata
No ratings yet
Errata
2 pages
Probability Questions
No ratings yet
Probability Questions
2 pages
Poewilcox Farley 2002 Dicycloocta 1 2 3 4 Def 1 2 3 4 JKL Biphenylene Benzenoid Atropism in A Highly Antiaromatic Polycycle
No ratings yet
Poewilcox Farley 2002 Dicycloocta 1 2 3 4 Def 1 2 3 4 JKL Biphenylene Benzenoid Atropism in A Highly Antiaromatic Polycycle
2 pages
I Jsa It 04132012
No ratings yet
I Jsa It 04132012
4 pages
Kindergarten Math Shapes Unit
No ratings yet
Kindergarten Math Shapes Unit
4 pages
Introduction to Quantum Computing & Machine Learning Technologies: 1, #1
From Everand
Introduction to Quantum Computing & Machine Learning Technologies: 1, #1
M. Sreedevi
No ratings yet

An Efficient Fuzzy Clusnjkstering Algorithm

Uploaded by

An Efficient Fuzzy Clusnjkstering Algorithm

Uploaded by

D. Vanisri et. al.

/ International Journal of Engineering Science and Technology

An Efficient Fuzzy Clustering Algorithm

Keywords: Clustering, Fuzzy C-Means, Fuzzy Clustering Algorithm.

ISSN: 0975-5462 5949

ISSN: 0975-5462 5950

4. These centroids of the clusters are updated correctly.

A. Enhanced K- Means Approach

ISSN: 0975-5462 5951

ISSN: 0975-5462 5952

ISSN: 0975-5462 5953

ISSN: 0975-5462 5954

, and compute the ∑

5. Compute centroid distance of cell c:

Where dsumi is the summation of distances between the adjacent data

ISSN: 0975-5462 5955

Fig. 8 Entropy results on UCI data sets: Wine, Iris

Fig. 9 Execution time results based on Number of clusters

ISSN: 0975-5462 5956

ISSN: 0975-5462 5957

ISSN: 0975-5462 5958

You might also like