Journal of Computer Applications - WWW - Jcaksrce.org - Volume 4 Issue 2

The document discusses using genetic algorithms to refine clusters obtained from k-means clustering. It begins with background on k-means clustering and its limitations. Kernel k-means is presented as an extension that addresses one limitation by mapping data to a feature space. The document proposes using a genetic algorithm to refine clusters after k-means clustering to improve cluster quality. It aims to contribute a method for refining already obtained clusters, unlike prior work that focused only on accelerating the clustering algorithms.

Uploaded by

Journal of Computer Applications

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

136 views5 pages

Journal of Computer Applications - WWW - Jcaksrce.org - Volume 4 Issue 2

Uploaded by

Journal of Computer Applications

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Journal of Computer Applications (JCA)

ISSN: 0974-1925, Volume IV, Issue 2, 2011

40

Abstract--- K-means clustering is a popular clustering
algorithm based on the partition of data. However, there are
some shortcomings of it, such as its requiring a user to give out
the number of clusters at first, and its sensitiveness to initial
conditions, and second it can only find linearly separable
clusters. There are a lot of variations of the k-means clustering
algorithm. Kernel k-means is an extension of the standard
kmeans algorithm to solve the second limitation of k-means
clustering. Recent attempts have adapted the k-means
clustering algorithm as well as genetic algorithms based on
rough sets to find interval sets of clusters. And an important
point is, so far, the researchers havent contributed to improve
the cluster quality once it is clustered. In this paper, we have
proposed a new context to improve the cluster quality from
k-means clustering using Genetic Algorithm (GA). The
performance is analyzed and compared with the standard and
kernel k-means clustering in medical domain.
Index Terms K-means, Kernel K-means, Genetic Algorithm.
I. INTRODUCTION
Clustering techniques have become very popular in a number
of areas, such as engineering, medicine, biology and data
mining [1,2]. A good survey on clustering algorithms can be
found in [3]. The k-means algorithm [4] is one of the most
widely used clustering algorithms. The algorithm partitions
the data points (objects) into C groups (clusters), so as to
minimize the sum of the (squared) distances between the data
points and the center (mean) of the clusters. In spite of its
simplicity, the k-means algorithm involves a very large
number of nearest neighbor queries. The high time
complexity of the k-means algorithm makes it impractical for
use in the case of having a large number of points in the data
set. Reducing the large number of nearest neighbor queries in
the algorithm can accelerate it. In addition, the number of
distance calculations increases exponentially with the
increase of the dimensionality of the data [5-7].
Many algorithms have been proposed to accelerate the
k-means. In [5,6], the use of kd-trees[8] is suggested to
accelerate the k-means. However, backtracking is required, a
case in which the computation complexity is increased [7].
Kd-trees are not efficient for higher dimensions.
Furthermore, it is not guaranteed that an exact match of the
nearest neighbor
can be found unless some extra search is done as discussed in
[9]. Elkan[10] suggests the use of triangle inequality to
Ms. K.Arun Prabha
a,
*
Assistant Professor, Department of Computer Science,
Vellalar College for Women (Autonomous), Erode, India.
(Email: [email protected])
R.Saranya
b,1
Research Scholar, Department of Computer Science,
Vellalar College for Women (Autonomous), Erode, India.
(Email: [email protected])
accelerate the k-means. In [11], it is suggested to use RTrees.
Nevertheless, R-Trees may not be appropriate for higher
dimensional problems. In [12-14], the Partial Distance (PD)
algorithm has been proposed. The algorithm allows early
termination of the distance calculation by introducing a
premature exit condition in the search process. Recently,
Kernel -means [15] is an extension of the standard k-means
algorithm that maps data points from the input space to a
feature space through a nonlinear transformation and
minimizes the clustering error in feature space. Thus,
nonlinearly separated clusters in input space are obtained,
overcoming the second limitation of k-means.
As seen in the literature, the researchers contributed only to
accelerate the algorithm; there is no contribution in cluster
refinement. In this study, we propose a new algorithm to
improve the k-means using Genetic Algorithm (GA) is
applied to refine the cluster to improve the quality.
The paper is organized as follows: the following section
presents the general k-means algorithm. Section 3 presents
the kernel k-means clustering and Section 4 discusses the
proposed cluster refinement algorithm with genetic
algorithm. Section 5 presents the results and the work is
concluded in section 6.
II. STANDARD K-MEANS CLUSTERING
One of the most popular clustering techniques is the k-means
clustering algorithm. Starting from a random partitioning, the
algorithm repeatedly (i) computes the current cluster centers
(i.e. the average vector of each cluster in data space) and (ii)
reassigns each data item to the cluster whose centre is closest
to it. It terminates when no more reassignments take place.
By this means, the intra-cluster variance, that is, the sum of
squares of the differences between data items and their
associated cluster centers is locally minimized. k -means
strength is its runtime, which is linear in the number of data
elements, and its ease of implementation. However, the
algorithm tends to get stuck in suboptimal solutions
(dependent on the initial partitioning and the data ordering)
and it works well only for spherically shaped clusters. It
requires the number of clusters to be provided or to be
determined (semi-) automatically. In our experiments, we run
k-means using the correct cluster number.
1. Choose a number of clusters k
2. Initialize cluster centers
1
,
k
a. Could pick k data points and set cluster
centers to these points
b. Or could randomly assign points to clusters
and take means of clusters
3. For each data point, compute the cluster center it
is closest to (using some distance measure) and
assign the data point to this cluster.
4. Re-compute cluster centers (mean of data
points in cluster)
Refinement of K-Means Clustering Using
Genetic Algorithm
K.Arun Prabha
a,
*, R.Saranya
b,1
Refinement of K-Means Clustering Using Genetic Algorithm
41
5. Stop when there are no new re-assignments.
III. KERNEL K-MEANS CLUSTERING
Kernel k-means [15] is a generalization of the standard
k-means algorithm where data points are mapped from input
space to a higher dimensional feature space through a
nonlinear transformation | and then k-means is applied in
feature space. This results in linear separators in feature space
which correspond to nonlinear separators in input space.
Thus, kernel k-means avoids the limitation of linearly
separable clusters in input space that k-means suffers from.
The objective function that kernel k-means tries to minimize
is the clustering error in feature space. We can define a kernel
matrix K e R
NN
, where K
ij
= | (X
i
)
T
|(X
j
). Any
positive-semidefinite matrix can be used as a kernel matrix.
Notice that in this case cluster centers m
k
in feature space
cannot be calculated. Usually, a kernel function K(x
i
, x
j
) is
used to directly provide the inner products in feature space
without explicitly defining transformation | (for certain
kernel functions the corresponding transformation is
intractable), hence K
ij
= K(x
i
, x
j
). Some kernel function
examples are given in Table 1. Kernel k-means is described
in the following algorithm.
Input: Kernel Matrix K, number of clusters k, initial cluster
centers C
1
,,C
k
Output: Final Clusters C
1
,,C
k
with clustering error E
a. For all points x
n
, n = 1,,N do
i. For all clusters C
i
where i = 1 to k do
Compute || |(x
n
) m
i
||
2
ii. End
iii. Find c
*
(x
n
) = arg min
i
(|| |(x
n
) m
i
||
2
)
b. End for
c. For all clusters C
i
where i = 1 to k do
Update cluster C
i
= {x
n
c
*
(x
n
) = i
d. End
e. If converged then
Return final clusters C
1
,,C
k
and the Error
f. Else
Goto Step (a)
g. End if
Table 1. Examples of Kernel Functions
Polynomial Kernel K(x
i
, x
j
) = [(x
i
)
T
x
j
+ ]
o
Gaussian Kernel K(x
i
, x
j
) = exp ( -|| x
i
x
j
||
2
/ 2o
2
)
Sigmoid Kernel K(x
i
, x
j
) = tanh ((x
i
T
x
j
) + u)
It can be shown that kernel k-means monotonically
converges if the kernel matrix is positive semidefinite, i.e., is
a valid kernel matrix. If the kernel matrix is not positive
semidefinite, the algorithm may still converge, but this is not
guaranteed.
IV. GENETIC ALGORITHM BASED REFINEMENT
Genetic algorithm (GA) [16] is randomized search and
optimization techniques guided by the principles of evolution
and natural genetics, having a large amount of implicit
parallelism. GA perform search in complex, large and
multimodal landscapes, and provide near-optimal solutions
for objective or fitness function of an optimization problem.
In GA, the parameters of the search space are encoded in the
form of strings (called chromosomes). A collection of such
strings is called a population. Initially, a random population
is created, which represents different points in the search
space. An objective and fitness function is associated with
each string that represents the degree of goodness of the
string. Based on the principle of survival of the fittest, a few
of the strings are selected and each is assigned a number of
copies that go into the mating pool. Biologically inspired
operators like cross-over and mutation are applied on these
strings to yield a new generation of strings. The process of
selection, crossover and mutation continues for a fixed
number of generations or till a termination condition is
satisfied. An excellent survey of GA along with the
programming structure used can be found in [17]. GA have
applications in fields as diverse as VLSI design, image
processing, neural networks, machine learning, job shop
scheduling, etc.
The basic reason for our refinement is, in any clustering
algorithm the obtained clusters will never give 100% quality.
There will be some errors known as mis-clustered. That is, a
data item can be wrongly clustered. These kinds of errors can
be avoided by using our refinement algorithm.
The cluster obtained from the kernel k-means clustering is
considered as input to our refinement algorithm. Initially a
random point is selected from each cluster; with this a
chromosome is build. Like this an initial population with 10
chromosomes is build. For each chromosome the entropy is
calculated as fitness value and the global minimum is
extracted. With this initial population, the genetic operators
such as reproduction, crossover and mutation are applied to
produce a new population. While applying crossover
operator, the cluster points will get shuffled means that a
point can move from one cluster to another. From this new
population, the local minimum fitness value is calculated and
compared with global minimum. If the local minimum is less
than the global minimum then the global minimum is
assigned with the local minimum, and the next iteration is
continued with the new population. Otherwise, the next
iteration is continued with the same old population. This
process is repeated for N number of iterations.
A. String Representation
Here the chromosomes are encoded with real numbers; the
number of genes in each chromosome is equal to the number
of clusters. Each gene will have 5 digits for vector index. For
example, our data set contains 5 clusters, so a sample
chromosome may looks like as follows:
00100 10010 00256 01875 00098
Here, the 00098 represents, the 98
th
instance is available at
first cluster and the second gene says that the 1875 instance is
at second cluster. Once the initial population is generated
now we are ready to apply genetic operators.
B. Reproduction (selection)
The selection process selects chromosomes from the mating
pool directed by the survival of the fittest concept of natural
genetic systems. In the proportional selection strategy
adopted in this article, a chromosome is assigned a number of
copies, which is proportional to its fitness in the population,
Journal of Computer Applications (JCA)
ISSN: 0974-1925, Volume IV, Issue 2, 2011
42
that go into the mating pool for further genetic operations.
Roulette wheel selection is one common technique that
implements the proportional selection strategy.
C. Crossover
Crossover is a probabilistic process that exchanges
information between two parent chromosomes for generating
two child chromosomes. In this paper, single point crossover
with a fixed crossover probability of p
c
is used. For
chromosomes of length l, a random integer, called the
crossover point, is generated in the range [1, l-1]. The
portions of the chromosomes lying to the right of the
crossover point are exchanged to produce two offspring.
D. Mutation
Each chromosome undergoes mutation with a fixed
probability p
m
. For binary representation of chromosomes, a
bit position (or gene) is mutated by simply flipping its value.
Since we are considering real numbers in this paper, a
random position is chosen in the chromosome and replace by
a random number between 0-9.
After the genetic operators are applied, the local minimum
fitness value is calculated and compared with global
minimum. If the local minimum is less than the global
minimum then the global minimum is assigned with the local
minimum, and the next iteration is continued with the new
population. The cluster points will be repositioned
corresponding to the chromosome having global minimum.
Otherwise, the next iteration is continued with the same old
population. This process is repeated for N number of
iterations. From the following section, it is shown that our
refinement algorithm improves the cluster quality. The
algorithm is given as:
1. Choose a number of clusters k
2. Initialize cluster centers
1
,
k
based on
mode
3. For each data point, compute the cluster center it
is closest to (using some distance measure) and
assign the data point to this cluster.
4. Re-compute cluster centers (mean of data
points in cluster)
5. Stop when there are no new re-assignments.
6. GA based refinement
a. Construct the initial population (p1)
b. Calculate the global minimum (Gmin)
c. For i = 1 to N do
i. Perform reproduction
ii. Apply the crossover operator
between each parent.
iii. Perform mutation and get the new
population. (p2)
iv. Calculate the local minimum
(Lmin).
v. If Gmin < Lmin then
a. Gmin = Lmin;
b. p1 = p2;
d. Repeat
V. EXPERIMENTS & RESULTS
For clustering, two measures of cluster goodness or quality
are used. One type of measure allows us to compare different
sets of clusters without reference to external knowledge and
is called an internal quality measure. The other type of
measures lets us evaluate how well the clustering is working
by comparing the groups produced by the clustering
techniques to known classes. This type of measure is called
an external quality measure. One external measure is entropy
[18], which provides a measure of goodness for un-nested
clusters or for the clusters at one level of a hierarchical
clustering. Another external measure is the F-measure,
which, as we use it here, is more oriented toward measuring
the effectiveness of a hierarchical clustering. The F measure
has a long history, but was recently extended to data item
hierarchies in [19].
Entropy
We use entropy as a measure of quality of the clusters (with
the caveat that the best entropy is obtained when each cluster
contains exactly one data point). Let CS be a clustering
solution. For each cluster, the class distribution of the data is
calculated first, i.e., for cluster j we compute p
ij
, the
probability that a member of cluster j belongs to class i.
Then using this class distribution, the entropy of each cluster
j is calculated using the standard formula
_
=
i
ij ij j
p p E ) log(
where the sum is taken over all classes. The total entropy for
a set of clusters is calculated as the sum of the entropies of
each cluster weighted by the size of each cluster:
_
=
-
=
m
j
j j
CS
n
E n
E
1
where n
j
is the size of cluster j, m is the number of clusters,
and n is the total number of data points.
F measure
The second external quality measure is the F measure [19], a
measure that combines the precision and recall ideas from
information retrieval [20]. We treat each cluster as if it were
the result of a query and each class as if it were the desired set
of data items for a query. We then calculate the recall and
precision of that cluster for each given class. More
specifically, for cluster j and class i
Recall( i, j ) = n
ij
/ n
i
Precision( i, j ) = n
ij
/ n
j
where n
ij
is the number of members of class i in cluster j, n
j
is
the number of members of cluster j and ni is the number of
members of class i.
The F measure of cluster j and class i is then given by
F(i, j) = (2 * Recall( i, j ) * Precision( i, j )) / ((Precision( i, j )
+ Recall( i, j ))
For an entire hierarchical clustering the F measure of any
class is the maximum value it attains at any node in the tree
and an overall value for the F measure is computed by taking
Refinement of K-Means Clustering Using Genetic Algorithm
43
the weighted average of all values for the F measure as given
by the following.
{ }
_
=
i
i
j i F
n
n
F ) , ( max
where the max is taken over all clusters at all levels, and n is
the number of data items.
The following table presents the results, shows that our
proposed method outperforms than the standard method.
Table 2. Performance Analysis of Cluster Quality
Wisconsin Breast Cancer
Dataset
Dermatology
Dataset
K-
Means
Kernel
K-
Means
Refined
K-
Means
with GA
K-
Means
Kernel
K-
Means
Refine
d K-
Means
with
GA
No. of
Classes
2 2 2 6 6 6
No. of
Clusters
2 2 2 6 6 6
Entropy 0.3637 0.2373 0.1502 0.1826 0.0868 0.0103
F-
measure
0.9125 0.9599 0.9799 0.8303 0.8537 0.8841
VI. CONCLUSION
In this paper, we have proposed a new framework to improve
the cluster quality from k-means clustering using genetic
algorithm. The proposed algorithm is tested in medical
domain and show that refined initial starting points and post
processing refinement of clusters indeed lead to improved
solutions. The method is scalable and can be coupled with a
scalable clustering algorithm to address the large-scale
clustering problems in data mining. Experimental results
show that the proposed algorithm achieves better results than
the conventional and kernel k-means algorithm when applied
to real data sets.
REFERENCES
[1] Lv T., Huang S., Zhang X., and Wang Z,
Combining Multiple Clustering Methods Based on
Core Group. Proceedings of the Second
International Conference on Semantics, Knowledge
and Grid (SKG06), pp: 29-29, 2006.
[2] Nock R., and Nielsen F., On Weighting Clustering.
IEEE Transactions and Pattern Analysis and
Machine Intelligence, 28(8): 1223-1235, 2006.
[3] Xu R., and Wunsch D., Survey of clustering
algorithms. IEEE Trans. Neural Networks, 16 (3):
645-678, 2005.
[4] MacQueen J., Some methods for classification and
analysis of multivariate observations. Proc. 5th
Berkeley Symp. Math. Stat. and Prob, pp: 281-97,
1967.
[5] Kanungo T., Mount D.M., Netanyahu N., Piatko C.,
Silverman R., and Wu A.Y., An efficient k-means
clustering algorithm: Analysis and implementation.
IEEE Trans. Pattern Analysis and Machine
Intelligence, 24 (7): 881-892, 2002.
[6] Pelleg D., and Moore A., Accelerating exact
k-means algorithm with geometric reasoning.
Proceedings of the fifth ACM SIGKDD
International Conference on Knowledge Discovery
and Data Mining, New York, pp. 727-734, 1999.
[7] Sproull R., Refinements to Nearest-Neighbor
Searching in K-Dimensional Trees. Algorithmica,
6: 579-589, 1991.
[8] Bentley J., Multidimensional Binary Search Trees
Used for Associative Searching. Commun. ACM,
18 (9): 509-517, 1975.
[9] Friedman J., Bentley J., and Finkel R., An
Algorithm for Finding Best Matches in Logarithmic
Expected Time. ACM Trans. Math. Soft. 3 (2):
209-226, 1977.
[10] Elkan, C., Using the Triangle Inequality to
Accelerate k-Means. Proceedings of the Twentieth
International Conference on Machine Learning
(ICML-2003), pp. 609-616, 2003.
[11] Hjaltason, R. and Samet H., Distance Browsing in
Spatial Databases. ACM Transactions on Database
Systems, 24 (2): 26-42, 1999.
[12] Proietti, G. and Faloutsos C., Analysis of Range
Queries and Self-spatial Join Queries on Real
Region Datasets Stored using an R-tree. IEEE
Transactions on Knowledge and Data Engineering,
5 (12): 751-762, 2000.
[13] Cheng D., Gersho B., Ramamurthi Y., and Shoham
Y., 1984. Fast Search Algorithms for Vector
Quantization and Pattern Recognition. Proceeding
of the IEEE International Conference on Acoustics,
Speech and Signal Processing, 1, pp:1-9, 1984.
[14] Bei C., and Gray, R., An Improvement of the
Minimum Distortion Encoding Algorithm for
Vector Quantization. IEEE Transactions on
Communications, 33 (10): 1132-1133, 1985.
[15] Scholkopf B., Smola J., and Muller R., Nonlinear
component analysis as a kernel eigenvalue
problem, Neural Comput., 10(5):12991319,
1998.
[16] Davis (Ed.) L., Handbook of Genetic Algorithms,
Van Nostrand Reinhold, New York, 1991.
[17] Michalewicz Z., Genetic Algorithms, Data
Structures" Evolution Programs, Springer, New
York, 1992.
[18] Shannon CE., A mathematical theory of
communication, Bell System Technical Journal,
27:379-423 and 623-656, July and October, 1948.
[19] Kowalski G, Information Retrieval Systems
Theory and Implementation, Kluwer Academic
Publishers, 1997.
[20] Larsen B., and Aone C. Fast and Effective Text
Mining Using Linear-time Document Clustering,
KDD-99, San Diego, California, 1999.
Journal of Computer Applications (JCA)
ISSN: 0974-1925, Volume IV, Issue 2, 2011
44
BIOGRAPHY
Ms. K. Arun Prabha M.C.A., M.Phil., is
currently working as an Assistant
Professor in the Department of Computer
Science, Vellalar College for Women
(Autonomous), Erode. She has got 14 years
of teaching experience and 3 years of
research experience. She has published 4 papers in the
national/International journals/conferences and also
presented 5 papers in national/International journals
/conferences. Her areas of interest include Data Mining and
Soft Computing.
R. Saranya received her Bachelors Degree
(B.C.A) in Computer Application and Master
Degree (M.Sc) in Computer Science from
Vysya College, Periyar University, Salem.
She is currently pursuing her M.Phil research
in Vellalar College for Women
(Autonomous), Erode. Her areas of interests include Data
mining and Soft Computing.

15 Minutes To Happiness
100% (1)
15 Minutes To Happiness
245 pages
Biomedical Instrumentation (Tic-801)
100% (5)
Biomedical Instrumentation (Tic-801)
283 pages
Pre AIATS-01 (All RM Batch) 08-11-2024 (English)
No ratings yet
Pre AIATS-01 (All RM Batch) 08-11-2024 (English)
23 pages
Multiple Choice Questions in Pain Management Accessible DOCX Download
100% (9)
Multiple Choice Questions in Pain Management Accessible DOCX Download
17 pages
Achilles
100% (1)
Achilles
65 pages
Agroforestry Species A Crop Sheets Manual 1980
100% (1)
Agroforestry Species A Crop Sheets Manual 1980
329 pages
Permanent Tissue
No ratings yet
Permanent Tissue
3 pages
Helping Children with Complex Needs Bounce Back Resilient Therapy for Parents and Professionals 1st Edition Kim Aumann pdf download
No ratings yet
Helping Children with Complex Needs Bounce Back Resilient Therapy for Parents and Professionals 1st Edition Kim Aumann pdf download
102 pages
Notes Dan Winter Genes Annunaki Compilation.
83% (6)
Notes Dan Winter Genes Annunaki Compilation.
34 pages
The Basic Slippery Slope Argument - DOUGLAS WALTON
No ratings yet
The Basic Slippery Slope Argument - DOUGLAS WALTON
39 pages
Dissertation Definition of Terms
100% (2)
Dissertation Definition of Terms
8 pages
1 s2.0 S0020025522014633 Main
No ratings yet
1 s2.0 S0020025522014633 Main
33 pages
Geography and Ecology: Sasan Gir, Sasan, Junagadh
100% (1)
Geography and Ecology: Sasan Gir, Sasan, Junagadh
46 pages
Unit 5
No ratings yet
Unit 5
63 pages
Sale Catalog - Diamond Genetics Online Elite Bull Sale
No ratings yet
Sale Catalog - Diamond Genetics Online Elite Bull Sale
48 pages
K-Means Clustering Method For The Analysis of Log Data
No ratings yet
K-Means Clustering Method For The Analysis of Log Data
3 pages
George Church Explains How DNA Will Be Construction Material of The Future - Neanderthal DER SPIEGEL
No ratings yet
George Church Explains How DNA Will Be Construction Material of The Future - Neanderthal DER SPIEGEL
15 pages
Anupama Luthra - 2011
No ratings yet
Anupama Luthra - 2011
21 pages
Chapter One
No ratings yet
Chapter One
125 pages
Final Documentation
No ratings yet
Final Documentation
68 pages
Jaipur National University: Project Design With Seminar
100% (1)
Jaipur National University: Project Design With Seminar
26 pages
Unit 4
No ratings yet
Unit 4
46 pages
Genedata
No ratings yet
Genedata
67 pages
Clustering
No ratings yet
Clustering
34 pages
ML UNIT 4 Sir
No ratings yet
ML UNIT 4 Sir
42 pages
K Means Algorithm
No ratings yet
K Means Algorithm
4 pages
Machine Learning-4
No ratings yet
Machine Learning-4
73 pages
6.03factorsaffectingrateofphotosynthesis QP
No ratings yet
6.03factorsaffectingrateofphotosynthesis QP
17 pages
P-3 1 2-Kmeans
No ratings yet
P-3 1 2-Kmeans
43 pages
Biochemical Role of Ascorbic Acid During The Extra
100% (1)
Biochemical Role of Ascorbic Acid During The Extra
8 pages
Intro Data Science: Cluster Analysis
No ratings yet
Intro Data Science: Cluster Analysis
60 pages
Fast and Robust General Purpose Clustering Algorit
No ratings yet
Fast and Robust General Purpose Clustering Algorit
29 pages
Brócolis e TEA
No ratings yet
Brócolis e TEA
15 pages
Enhancing Clustering Performance: A Hybrid Generalized K-Means Approach
No ratings yet
Enhancing Clustering Performance: A Hybrid Generalized K-Means Approach
9 pages
Comparative Analysis of Kmeans Technique On Non Convex Cluster
No ratings yet
Comparative Analysis of Kmeans Technique On Non Convex Cluster
7 pages
Analysis and Study of K Means Clustering Algorithm IJERTV2IS70648
No ratings yet
Analysis and Study of K Means Clustering Algorithm IJERTV2IS70648
6 pages
XCELLigence
No ratings yet
XCELLigence
20 pages
Dynamic Approach To K-Means Clustering Algorithm-2
No ratings yet
Dynamic Approach To K-Means Clustering Algorithm-2
16 pages
Clustering Techniques - Hierarchical, K-Means Clustering
No ratings yet
Clustering Techniques - Hierarchical, K-Means Clustering
22 pages
NCERT Ebook For Animal Kingdom - Animal Kingdom - Chapter 4 - NCERT Biology - XI
No ratings yet
NCERT Ebook For Animal Kingdom - Animal Kingdom - Chapter 4 - NCERT Biology - XI
33 pages
Annurev Psych 042023 101155
No ratings yet
Annurev Psych 042023 101155
27 pages
U1 - KMeans - 5th Sem - DS
No ratings yet
U1 - KMeans - 5th Sem - DS
14 pages
2ND Quarter - General Biology 2
No ratings yet
2ND Quarter - General Biology 2
34 pages
Research On K-Means Clustering Algorithm An Improved K-Means Clustering Algorithm
No ratings yet
Research On K-Means Clustering Algorithm An Improved K-Means Clustering Algorithm
5 pages
Normalization Based K Means Clustering Algorithm
No ratings yet
Normalization Based K Means Clustering Algorithm
5 pages
5 - Clustering
No ratings yet
5 - Clustering
13 pages
V5I5201647
No ratings yet
V5I5201647
13 pages
Enhancing The Exactness of K-Means Clustering Algorithm by Centroids
No ratings yet
Enhancing The Exactness of K-Means Clustering Algorithm by Centroids
7 pages
K-Means and MAP REDUCE Algorithm
No ratings yet
K-Means and MAP REDUCE Algorithm
13 pages
AP Biology Unit 1 Journal
No ratings yet
AP Biology Unit 1 Journal
17 pages
1 s2.0 S0031320319301608 Main
No ratings yet
1 s2.0 S0031320319301608 Main
18 pages
An Improved K-Means Cluster Algorithm Using Map Reduce Techniques To Mining of Inter and Intra Cluster Datain Big Data Analytics
No ratings yet
An Improved K-Means Cluster Algorithm Using Map Reduce Techniques To Mining of Inter and Intra Cluster Datain Big Data Analytics
12 pages
Expert Systems With Applications: Jing Xiao, Yuping Yan, Jun Zhang, Yong Tang
No ratings yet
Expert Systems With Applications: Jing Xiao, Yuping Yan, Jun Zhang, Yong Tang
8 pages
An Improvement in K Means Clustering Algorithm IJERTV2IS1385
No ratings yet
An Improvement in K Means Clustering Algorithm IJERTV2IS1385
6 pages
01 Cpe 213 Lecture Two Crop Anatomy
No ratings yet
01 Cpe 213 Lecture Two Crop Anatomy
7 pages
A Review On K Means Clustering
No ratings yet
A Review On K Means Clustering
7 pages
Data Clustering Using Kernel Based
No ratings yet
Data Clustering Using Kernel Based
6 pages
Le G11 General-Biology 1 Final
No ratings yet
Le G11 General-Biology 1 Final
5 pages
A Dynamic K-Means Clustering For Data Mining-Dikonversi
No ratings yet
A Dynamic K-Means Clustering For Data Mining-Dikonversi
6 pages
An Improved K-Means Algorithm Based On Mapreduce and Grid: Li Ma, Lei Gu, Bo Li, Yue Ma and Jin Wang
No ratings yet
An Improved K-Means Algorithm Based On Mapreduce and Grid: Li Ma, Lei Gu, Bo Li, Yue Ma and Jin Wang
12 pages
Welcome To International Journal of Engineering Research and Development (IJERD)
No ratings yet
Welcome To International Journal of Engineering Research and Development (IJERD)
5 pages
K Means
No ratings yet
K Means
7 pages
Implementing A Cooperative Mac Protocol For Wireless Lans
No ratings yet
Implementing A Cooperative Mac Protocol For Wireless Lans
8 pages
The International Journal of Engineering and Science (The IJES)
No ratings yet
The International Journal of Engineering and Science (The IJES)
4 pages
Cambridge International Examinations Cambridge International General Certificate of Secondary Education
No ratings yet
Cambridge International Examinations Cambridge International General Certificate of Secondary Education
14 pages
A Study On Functional Brain Metabolism Using PET Scan Image Datasets - An Analysis
No ratings yet
A Study On Functional Brain Metabolism Using PET Scan Image Datasets - An Analysis
3 pages
A K-Means Based Genetic Algorithm For Data Clustering: Advances in Intelligent Systems and Computing October 2017
No ratings yet
A K-Means Based Genetic Algorithm For Data Clustering: Advances in Intelligent Systems and Computing October 2017
12 pages
An Efficient Incremental Clustering Algorithm
No ratings yet
An Efficient Incremental Clustering Algorithm
3 pages
18 A Comparison of Various Distance Functions On K - Mean Clustering Algorithm
No ratings yet
18 A Comparison of Various Distance Functions On K - Mean Clustering Algorithm
9 pages
B43 Exp5 ML
No ratings yet
B43 Exp5 ML
6 pages
Indexing and Ranking in Spatial Database
No ratings yet
Indexing and Ranking in Spatial Database
10 pages
Research On K Mean Algorithm
No ratings yet
Research On K Mean Algorithm
5 pages
Statistical Considerations On The K - Means Algorithm
No ratings yet
Statistical Considerations On The K - Means Algorithm
9 pages
Na 2010
No ratings yet
Na 2010
5 pages
Acivity Group 2 Fossil Evidence
No ratings yet
Acivity Group 2 Fossil Evidence
3 pages
Cambridge International Advanced Subsidiary and Advanced Level
No ratings yet
Cambridge International Advanced Subsidiary and Advanced Level
12 pages
K Means Clustering
No ratings yet
K Means Clustering
3 pages
K-Means Clustering Algorithm and Its Improvement R
No ratings yet
K-Means Clustering Algorithm and Its Improvement R
6 pages
A Comparative Study On The Topology Control Mechanism Using GAHCT and FLHCT For An N-Tier Heterogeneous Wireless Sensor Network
No ratings yet
A Comparative Study On The Topology Control Mechanism Using GAHCT and FLHCT For An N-Tier Heterogeneous Wireless Sensor Network
8 pages
MMZ XRF O0 Ra Pre 0 ZB XGXW W1 Er 02 OAYQum QDD78 HQP
No ratings yet
MMZ XRF O0 Ra Pre 0 ZB XGXW W1 Er 02 OAYQum QDD78 HQP
4 pages
A Wavelet-Based Anytime Algorithm For K-Means Clustering of Time Series
No ratings yet
A Wavelet-Based Anytime Algorithm For K-Means Clustering of Time Series
12 pages
Comparative Analysis and Inferences For Enhancement of Cognitive Radio Networks
No ratings yet
Comparative Analysis and Inferences For Enhancement of Cognitive Radio Networks
6 pages
Standardization and Its Effects On K-Means Clustering Algorithm
No ratings yet
Standardization and Its Effects On K-Means Clustering Algorithm
6 pages
A Comparative Study of K-Means, DBSCAN and OPTICS
No ratings yet
A Comparative Study of K-Means, DBSCAN and OPTICS
6 pages
Brain Tumor Detection Using Modified Histogram Thresholding-Quadrant Approach
No ratings yet
Brain Tumor Detection Using Modified Histogram Thresholding-Quadrant Approach
5 pages
Ijert Ijert: Enhanced Clustering Algorithm For Classification of Datasets
No ratings yet
Ijert Ijert: Enhanced Clustering Algorithm For Classification of Datasets
8 pages
A Tutorial On Clustering Algorithms
No ratings yet
A Tutorial On Clustering Algorithms
4 pages
AK-means: An Automatic Clustering Algorithm Based On K-Means
No ratings yet
AK-means: An Automatic Clustering Algorithm Based On K-Means
6 pages
Fuzzy Logic Based Evaluation of Performance of Students in Colleges
No ratings yet
Fuzzy Logic Based Evaluation of Performance of Students in Colleges
4 pages
I Jsa It 04132012
No ratings yet
I Jsa It 04132012
4 pages
Implementing and Improvisation of K-Means Clustering: International Journal of Computer Science and Mobile Computing
No ratings yet
Implementing and Improvisation of K-Means Clustering: International Journal of Computer Science and Mobile Computing
5 pages
A Genetic K-Means Clustering Algorithm Based On The Optimized Initial Centers
No ratings yet
A Genetic K-Means Clustering Algorithm Based On The Optimized Initial Centers
7 pages
Assignment No. A6: 1 Title
No ratings yet
Assignment No. A6: 1 Title
5 pages
Analysis&Comparisonof Efficient Techniquesof
No ratings yet
Analysis&Comparisonof Efficient Techniquesof
5 pages
Chapter 3: HW 3
No ratings yet
Chapter 3: HW 3
2 pages
Test Reference Range Significance of Abnormal Findings
No ratings yet
Test Reference Range Significance of Abnormal Findings
1 page
Text Books
No ratings yet
Text Books
1 page
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet

Journal of Computer Applications - WWW - Jcaksrce.org - Volume 4 Issue 2

Uploaded by

Journal of Computer Applications - WWW - Jcaksrce.org - Volume 4 Issue 2

Uploaded by

Journal of Computer Applications (JCA)

ISSN: 0974-1925, Volume IV, Issue 2, 2011

You might also like