A New Hybrid Clustering Algorithm Based On K-Means and Ant Colony Algorithm
A New Hybrid Clustering Algorithm Based On K-Means and Ant Colony Algorithm
A new hybrid clustering algorithm based on K-means and ant colony algorithm
Jue Lu
Rongqiang Hu
School of Information
Wuhan University of Technology
Wuhan, P. R. China
[email protected]
School of Automation
Wuhan University of Technology
Wuhan, P. R. China
[email protected]
I.
INTRODUCTION
2
r Oi Neigh ( r )
(1)
f (Oi ) =
0
otherwise
Here: Neigh(r) denotes ants local range. It refers to a
circular area that r is its radius on the grid. d (oi , o j )
presents the distance between object Oi and O j in the
attribute space. Euclidean distance and cosine distance are
often used to express the distance between objects. is
defined as a co-efficient of colony similarity.
B. Probability transformation function
The task of probability transformation function is to
transform colony similarity into the probability of ants
transporting un-clustering object. Colony similarity is its
variable. The co-domain of probability transformation
function is from 0 to 1. It usually can be drawn into two
corresponding curves on the co-ordinate. One expresses the
probability of pick-up, another expresses the probability of
drop. The regulation of probability transformation function
is as followed: The more the colony similarity is, the less
the probability of pick-up is; the less the colony similarity is,
the more the probability of pick-up is. To the probability of
drop, visa versa. According to this principle, we can choose
sigmoid function as probability transformation function. The
probability of pick-up and the probability of drop are
individually defined as followed:
(2)
Pp = 1 Sigmoid ( f (Oi ))
Pd = Sigmoid ( f (Oi ))
Here: Sigmoid ( x) =
(3)
cx
1 e
1 + e cx
Proceedings of the 2nd International Conference on Computer Science and Electronics Engineering (ICCSEE 2013)
K-MEANS ALGORITHM
E = p mi
i =1 pCi
(4)
Proceedings of the 2nd International Conference on Computer Science and Electronics Engineering (ICCSEE 2013)
1
2
3
ant clustering
algorithm
time
accuracy
7.3s
56
7.1s
58
7.2s
55
K-means
algorithm
time
accuracy
0.018s
83
0.017s
75
0.018s
77
hybrid clustering
algorithm
time
accuracy
6.5s
88
6.6s
79
6.5s
85
ACKNOWLEDGMENT
The authors are grateful to ZuoDai, professor of WHUT
for his remarks and good suggestions.
Proceedings of the 2nd International Conference on Computer Science and Electronics Engineering (ICCSEE 2013)
REFERENCES
[1] Al-Sultan K S. A Tabu Search Approach to the Clustering Problem[J].
Pattern Recogn, 1995, 28: 1443-1451
[2] Maulik U, Bandyopadhyay S. Genetic Algorithm-based Clustering
Technique[J]. Pattern Recognition, 2003, 33(9): 1455-1465.
[3] Zhang R, Peng H. A Faster Simulated Annealing Algorithm for the
Data Clustering and Its Application [J]. ComputerEngineering and
Application , 2001, 15(1): 85-87.
[4] Kao Y, Cheng K. An ACO-Based Clustering Algorithm[C]//ANTS
2006, LNCS 4150Berlin: Springer, 2006: 340-347
[5] M. Dorigo, G. Di Caro, and L. M. Gambardella, Ant algorithms for
discrete optimization, Artificial Life, vol. 5, no. 2, pp. 137-172, 1999