0% found this document useful (0 votes)

17 views

2000 - Genetic Clustering Algorithms

Uploaded by

Aishwarya Desai

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

2000 - Genetic Clustering Algorithms

Uploaded by

Aishwarya Desai

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

European Journal of Operational Research 135 (2001) 413±427

www.elsevier.com/locate/dsw

Theory and Methodology

Genetic clustering algorithms

Yu-Chiun Chiou a, Lawrence W. Lan b,*

a
Aviation and Maritime Management Department, Chang Jung Christian University, Tainan 711, Taiwan, ROC
b
Institute of Trac and Transportation, National Chiao Tung University, 4F, 114 Sec. 1, Chung-Hsiao W. Rd., Taipei 10012,
Taiwan, ROC

Received 15 November 1999; accepted 20 November 2000

Abstract

This study employs genetic algorithms to solve clustering problems. Three models, SICM, STCM, CSPM, are de-
veloped according to dierent coding/decoding techniques. The eectiveness and eciency of these models under
varying problem sizes are analyzed in comparison to a conventional statistics clustering method (the agglomerative
hierarchical clustering method). The results for small scale problems (10±50 objects) indicate that CSPM is the most
eective but least ecient method, STCM is second most eective and ecient, SICM is least eective because of its
long chromosome. The results for medium-to-large scale problems (50±200 objects) indicate that CSPM is still the most
eective method. Furthermore, we have applied CSPM to solve an exempli®ed p-Median problem. The good results
demonstrate that CSPM is usefully applicable. Ó 2001 Elsevier Science B.V. All rights reserved.

Keywords: Genetic algorithms; Clustering; p-Median problem

1. Introduction cessing and computer graphics, etc.). Clustering is

mainly to group all objects into several mutually
Clustering, so-called set partitioning, is a basic exclusive clusters in order to achieve the maximum
and widely applied methodology. Application or minimum of an objective function. Clustering is
®elds include statistics, mathematical program- rapidly becoming computationally intractable as
ming (such as location selecting, network parti- problem scale increases, because of the combina-
tioning, routing, scheduling and assignment torial character of the method. Brucker [7] and
problems, etc.) and computer science (including Welch [34] proved that, for speci®c objective
pattern recognition, learning theory, image pro- functions, clustering becomes an NP-hard problem
when the number of clusters exceeds 3. Even the
best algorithms developed for some speci®c
* objective functions, exhibit complexities of
Corresponding author. Tel.: +886-2-2311-0094; fax: +886-2-
2331-2160.
O N 3 log N or O N 3 [15], leaving much room
E-mail address: [email protected] for improvement. The heuristic algorithms for
http://www.itt.nctu.edu.tw (L.W. Lan). clustering can be divided into four categories:

0377-2217/01/$ - see front matter Ó 2001 Elsevier Science B.V. All rights reserved.
PII: S 0 3 7 7 - 2 2 1 7 ( 0 0 ) 0 0 3 2 0 - 9
414 Y.-C. Chiou, L.W. Lan / European Journal of Operational Research 135 (2001) 413±427

conventional statistics clustering, mathematical functions without a need for additional informa-
programming, network programming, and genetic tion in the search. This study attempts to develop
algorithms (GAs). The algorithms for conven- coding/decoding techniques for GAs to solve si-
tional statistic clustering [3,14,18,25,33] include multaneously the optimal number of clusters and
agglomerative hierarchical clustering method and the optimal clustering result in comparison to the
K-means. The algorithms for mathematical pro- conventional statistic clustering method (the ag-
gramming [8,11,17,27,28,30±32] range from dy- glomerative hierarchical clustering method).
namic programming, Lagrangian relaxation, linear
relaxation, column generation, branch-and-price
and Lipschitz continuous. The algorithms for the 2. Mathematical model of clustering
network programming [2,12] include graph theo-
retic relaxations and network relaxation. The al- The mathematical model of clustering of given
gorithms for GAs are rapidly developed recently number (m) of clusters is
[4,6,10,19,20,22±24,26] including group-numbers
encoding method (e.g. binary code, Boolean CAm
matching code), group-separators encoding meth- Max F X 1
od and evolution program method.
subject to
While the aforementioned studies have pro- X
posed ways to solve clustering problems, two main Xij 1 all i; 2
research gaps still remain. First, the number of X
j

clusters must be subjectively determined in ad- Xjj m; 3

vance. This number cannot be determined simul- j
taneously by the model. Therefore, the above Xij 6 Xjj all i; j; 4
studies involve a complex procedure that exhaus-
Xij f0; 1g all i; j; 5
tively compares all the optimum clustering for
every given number of clusters, then determines
where Xij 1 denotes that ith object is assigned to
the number of clusters of best objective value.
jth cluster, Xij 0 otherwise, i; j f1; 2; . . . ; N g;
Exceptions for this gap are the studies of Lozano
N is the number of objects, m is the number of
et al. [22] and Lunchian et al. [25] in which they
clusters, F X is the objective function. In the
only solved the optimal number of clusters without
application ®eld of statistics, F X can be generally
developing an explicit algorithm for the assign-
de®ned as [30]:
ment problem. Second, most of the non-GAs
based algorithms are limited in applications. They F X d min kfd S1 ; . . . ; d Sm gkp ; 6
are proposed under a speci®c form of the objective Pm 2pm
r
function such as convex function, or proposed by F X max fr Si ; Sj ; 1 6 i 6 j 6 mg p ; 7
Pm 2pm
the assumption that the feasible set is a convex
hull, or proposed with the help of additional in- F X s max kfs S1 ; . . . ; s Sm gkp ; 8
Pm 2pm
formation such as the gradient of the objective
function. where Si is ith cluster, i 1; . . . ; m. Pm is a clus-
GAs, ®rst proposed by Holland [16], are gen- tering result of m-clustering, Pm fS1 ; . . . ; Sm g: pm
eral-purpose search algorithms that have the is a set of all possible clustering results of m-clus-
characteristics of stochastic search, multi-points tering. F X d is the diameter function of clustering
search, direct search and parallel search. The re- and d Sj maxOk ;Ol 2Sj dkl is the diameter of jth
r
lated articles have proved the eectiveness and cluster; F X is the distance function of clustering
eciency of GAs in application to the combina- and r Si ; Sj minOk 2Si ;Ol 2Sj dkl is the distance or
s
torial optimization problems [5,9,13,21,29]. discrimination between ith and jth clusters; F X
Directly using the ®tness to evaluate the chromo- is the split function of clustering and s Sj
somes, GAs can be applied to various objective minOk 2Sj ;Ol 62Sj dkl is shortest distance between jth
Y.-C. Chiou, L.W. Lan / European Journal of Operational Research 135 (2001) 413±427 415

cluster with other clusters. k kp represents the lp - The second operator ± crossover ± is to com-
norm. fg represents the measure of a vector of bine the features of two parent structures to form
diameter or distance. Ok ; Ol are kth and lth ob- two osprings. The simplest way to make a
jects, respectively. crossover is to swap a corresponding segment of
The total number of feasible
solutions of CAm the parents. One-point crossover, two-point
Pm
is jpm j 1=m! j0 1 m j m N
j [1]. There are a crossover and uniform crossover are often em-
j
ployed.
total of 511 feasible solutions for 10 objects
The last operator ± mutation ± is to alter one or
(N 10) to be divided into 2 clusters (m 2).
more genes of osprings with a very low proba-
There are a total of 42,525 feasible solutions for 10
bility to avoid being trapped in a local optimum.
objects to be divided into 5 clusters.
If the number of clusters (m) is not given ex- The resulting ospring is then evaluated and in-
serted back into the population. This process
ogenously, then CAm , without the constraint (3),
continues until a predetermined criterion (e.g.
becomes [CA], that is:
maximum number of generations, minimum value
CA of ®tness improved between two adjacent genera-
Max F X 9 tions or certain mature rate) is reached.
subject to
X
Xij 1 all i; 10 4. Problem formulation
j

Xij 6 Xjj all i; j; 11 The eectiveness and eciency of GAs vary
Xij f0; 1g all i; j; 12 with various coding/decoding techniques. This
study proposes three coding/decoding techniques
As to P
the number m of feasible solutions of [CA] is for GAs to solve clustering problems. They are
N
jpj m1 jpm j. There are 52 feasible solutions at the simultaneously clustering method (SICM),
N 5,113,608 at N 10 and 1:99 107 at N 15. the stepwise clustering method (STCM) and the
Obviously, the complexity of [CA] is exponential cluster seed points method (CSPM). Then, these
to the problem size. models are compared with a conventional sta-
tistics clustering model ± the agglomerative hi-
erarchical clustering method (AHCM). The
3. Genetic algorithms details of these four models are described as
follows.
Genetic algorithms are general-purpose search
algorithms that use principles inspired by natural
population genetics to evolve solutions to prob-
4.1. Agglomerative hierarchical clustering method
lems. The basic idea of genetic algorithms is to
(AHCM)
maintain a population of chromosomes that rep-
resent candidate solutions. A chromosome is
AHCM involves a series of successive merges.
composed of a series of genes that represent deci-
Initially, there are as many clusters as objects.
sion variables or parameters. Each member of the
These initial groups are merged according to their
population is evaluated and assigned a measure of
degree of improvement in the objective values.
its ®tness as a solution. There are three genetic
Eventually, all subgroups are fused into a single
operators: selection, crossover and mutation [13]:
cluster [33]. The following are the steps in AHCM
The ®rst operator ± selection ± is to assign the
for grouping N objects in a maximizing problem
reproduction possibilities to chromosomes based
for example:
on their ®tness. A Monte Carlo wheel is often
employed. That is, the higher ®tness a chromo- Step 0. Start with N clusters, each containing a sin-
1
some is, the more possible it is selected. gle object, that is, Si fOi g; i 1; . . . ; N . An
416 Y.-C. Chiou, L.W. Lan / European Journal of Operational Research 135 (2001) 413±427

N N symmetric matrix increments of an objec- Table 2

tive function MF fDFij ; i; j 1; . . . ; N ; i 6 jg; Matching rules for gene strings, integers, and clusters
where DFij represents the incremental of objective Gene strings Integers Clusters
value in case that ith cluster and jth cluster are 000 0 1
fused into a single cluster. Let k 1. 001 1 2
010 2 3
Step 1. If DFvu maxfDFij ; i; j 1; . . . ; N k;
011 3 4
i 6 jg and v > u, then let Su k1 Suk [ Svk ; 100 4 5
k1 k k1 k k1 k
S1 S1 ; . . . ; Su 1 Su 1 ; Su1 Su1 ; . . . ; 101 5 6
k1 k k1 k k1 110 6 7
Sv Sv1 ; . . . ; SN v 1 SN v and SN k U.
k 111 7 8
Calculate the objective value F X of the parti-
tion. Let k k 1.

Step 2. Repeat Step 1 until k N 1. F X string. Take 17 objects for example, there are at
k
maxfF X ; k 1; . . . ; N 1g. most 8 partition sets 8 17=2. Because each
object is likely to be assigned to any set, these
clusters require three genes to represent them, as
4.2. Simultaneously clustering method (SICM) shown in Table 2.
Replacement of each row with these three genes
If there must be at least two objects to form a not only curtails the length of chromosomes (four
cluster, the maximal number of clusters, K, is genes can represent the problem of 33 objects, ®ve
equal to N =2 ([ ] is the Gauss sign). Then, there genes can represent the problem of 65 objects) but
are altogether N K decision variables of prob- also avoids the problem that an object might be
lem [CA], as shown in Table 1. If there are more assigned to several clusters or be unassigned. Table
than two variables with value of 1 in the same 3 illustrates a feasible clustering result for these 17
column, it means that they mutually form a objects. The chromosome of Table 3 is composed
cluster. However, if Xik is encoded as a gene, it of 51 genes (000001011000101101010000001000
will cause the length of chromosome be too long 000011011000000101010). Every three genes of the
(for instance, the number of decision variables for chromosome are then decoded into an integer of
10 objects is 50, for 20 objects is 200, and for 60 0±7 sequentially, representing the cluster into
objects is 1800) and will result in an insuciency which each object is assigned according to the
of computer memory. In addition, it will be dif- matching rules stated in Table 2. After being de-
®cult to handle the constraints if one and only coded, the chromosome represents that ®ve clus-
one variable equals 1 and else equals 0 in the ters are formed. The clusters consist of 6; 2; 2; 3; 3
same row. objects, respectively.
In order to deal with the problem, SICM uses a
coding/decoding technique to replace each row of
the decision variable matrix with a shorter gene 4.3. Stepwise clustering method (STCM)

Table 1 STCM successively solves the optimal binary

Relationship between N objects and K clusters clustering of a cluster until the objective value
Object Clusters cannot be further improved. An initial single
1 2 ... k ... K
cluster containing all objects is divided into two
subgroups such that the objective function is op-
1 X11 X12 X1k X1K
2 X21 X22 X2k X2K
timized at this stage. Through each binary clus-
..
. tering process, a cluster is divided into two
i. Xi1 Xi2 Xik Xik subgroups. A cluster is called fathomed when it
..
cannot be further binary clustered to improve the
N XN 1 XN 2 XNk XNK objective value. This concept is similar to that of
Y.-C. Chiou, L.W. Lan / European Journal of Operational Research 135 (2001) 413±427 417

Table 3
Relationship between objects and clusters (17 objects for example)
Objects Clusters Encoding
1 2 3 4 5 6 7 8
1 1 0 0 0 0 0 0 0 000
2 0 1 0 0 0 0 0 0 001
3 0 0 0 1 0 0 0 0 011
4 1 0 0 0 0 0 0 0 000
5 0 0 0 0 0 1 0 0 101
6 0 0 0 0 0 1 0 0 101
7 0 0 1 0 0 0 0 0 010
8 1 0 0 0 0 0 0 0 000
9 0 1 0 0 0 0 0 0 001
10 1 0 0 0 0 0 0 0 000
11 1 0 0 0 0 0 0 0 000
12 0 0 0 1 0 0 0 0 011
13 0 0 0 1 0 0 0 0 011
14 1 0 0 0 0 0 0 0 000
15 1 0 0 0 0 0 0 0 000
16 0 0 0 0 0 1 0 0 101
17 0 0 1 0 0 0 0 0 010
Subtotal 6 2 2 3 0 3 0 0

Fig. 1. Framework of STCM.

branch-and-bound. When all clusters are fath- The following are the algorithms for STCM under
omed, STCM has attained the optimal clustering. the depth ®rst principal, which fathoms individual
The framework of the model is depicted by Fig. 1. branch one at a time.
418 Y.-C. Chiou, L.W. Lan / European Journal of Operational Research 135 (2001) 413±427

Step 0. Let S 0 stand for the cluster containing all chromosome can be largely curtailed and can be
objects. The problem of optimally dividing S 0 further reduced in the evolutions of optimization

into two subgroups, namely S 0 and S 0 , can be stages. Consider N objects for instance, let
formulated as the following 0±1 mathematical pro- jS 0 j N denote N objects in set S 0 , the length of

gramming: the chromosome at the stage 0 is N. If jS 0 j L0 ,
the length of the chromosome at the stage 1 is

MP 0 N L0 . If jS 1 j L1 , the length of chromosome at
the stage 2 can be further shortened as
0
Max F X 13 N L0 L1 , and so forth.

subject to Xi f0; 1g i 1; . . . ; jS 0 j; 14

4.4. Cluster seed points method (CSPM)
where Xi 1 denotes that ith object of S 0 is
grouped in the cluster S 0 ; Xi 0 denotes that ith CSPM ®rst employs GAs to select the most

object of S 0 is grouped in the cluster S 0 . Xi suitable cluster seeds from all objects, then as-
is encoded as the gene of chromosomes (the signs the rest of the objects to each cluster ac-
length of chromosomes is jS 0 j), then GAs are cording to their similarity to the cluster seed or
employed to solve MP 0 by maximizing F X 0 to to their degree of improvement of the objective
attain the optimal binary clustering: S 0 function. The number of cluster seeds represents

fOi j Xi 1; Oi 2 S 0 g and S 0 fOi j Xi 0; the number of clusters and the characteristics of
Oi 2 S 0 g . these cluster seeds determine the clustering re-
Step 1. Let S 1 S 0 and renumber the objects of sult. The framework of CSPM is depicted by
S 1 . Formulate the optimal binary clustering prob- Fig. 2.
lem of S 1 as MP 1 , which is also solved by GAs. The following are the steps of the assignment
F X
1
is the objective value of optimal binary algorithm in Fig. 2.
clustering of S 1 under the assumption that the
Step 0. Let k 1 and S be a set of all objects, that
other cluster (S 0 ) remaining unchanged. The clus-
is, S fO1 ; . . . ; ON g. CPm is a set of cluster seeds,
tering result is: S 1 fOi j Xi 1; Oi 2 S 1 g and
that is, CPm fc1 ; . . . ; cm g. NP is a set of non-clus-
S 1 fOi j Xi 0; Oi 2 S 1 g. Three clusters are ter seeds, that is, NP S CPm Sj fcj g;
0
1
formed, they are: S 0 , S 1 and S 1 . F X is the j 1; . . . ; m.
objective value of these three clusters. Step 1. Let Ok denote kth object of NP. If
Step 2. Let S i S i 1 and solve MP i by k k k k
F S1 ; . . . ; Sj 1 ; Sj [ fOk g; Sj1 ; . . . ; Smk
GAs. k k k k

Step 3. Repeat Step 2 until S k U, then this Maxi fF S1 ; . . . ; Si1 ; Si [ fOk g; Si1 ; . . . ; Smk g,
branch is fathomed. There is a total of k 1 clus- then Ok is assigned to jth cluster.
k k 1
ters, that is S 0 ; S 1 ; . . . ; S k 1 and S k : S k can no Step 2. Let Sj Sj [ fOk g and k k 1. If
longer be divided in the following steps. F X k is k < N m 1, return to Step 1, otherwise termi-
the optimal objective value of these k 1 clusters. nate.
Step 4. Choose one of the remaining branches to
Once the clustering result, Pm , is obtained, the
be binarily clustered by repeating Steps 2 and 3 un-
objective value F X which represents the ®tness of
til it is fathomed.
this set of cluster seeds is also determined. How-
Step 5. If all branches are fathomed, then stop.
ever, CSPM employs GAs to search for the opti-
The clusters formed are the optimal clustering re-
mal cluster seeds by encoding variables Xi as genes
sult of STCM. Otherwise, go to Step 4.
to represent related objects, where Xi 1 denotes
In comparison to SICM, the coding/decoding that the ith object is chosen as a cluster seed, and
of STCM are much simpler because the length of Xi 0 otherwise.
Y.-C. Chiou, L.W. Lan / European Journal of Operational Research 135 (2001) 413±427 419

Fig. 2. Framework of CSPM.

5. Computational experiments

5.1. Experimental design

A random number generator is using to gen-

erate 200 two-dimensional objects (ai and
bi ; i 1; . . . ; 200 as shown in Fig. 3. All objects
with coordinate of (ai ; bi ) are uniformly distributed
within the square formed by four corners of which
coordinates are (0, 0), (0, 1), (1, 0) and (1, 1), re-
spectively. In order to analyze the eectiveness and
eciency of four models under varying problem
sizes, we choose the ®rst 50 objects merely in
testing of small problems (10±50 objects) and use
all the objects to be involved in testing of medium-
Fig. 3. Distribution of 200 two-dimensional objects.
to-large problems (50±200 objects).
Generally, minimizing the sum of squared er-
rors is chosen as the objective function for clus- squared errors is 0. Nevertheless, in this study,
tering problems. However, it is only applicable to the objective function is to maximize the ratio of
problems in which the number of clusters is between-clusters variability to within-clusters
speci®ed. Employing this objective function to variability, to determine simultaneously the opti-
solve the optimal number of clusters will optimal number of clusters and the optimal result of
mally result in N clusters such that the sum of clustering [15].
420 Y.-C. Chiou, L.W. Lan / European Journal of Operational Research 135 (2001) 413±427

N m 5.2. The results

F
m 1
5.2.1. Small scale problems N 5 50
Pm 2 2
n
a a
b b Table 4 and Fig. 4 summarize the objective
k1 k k k
P h i; 15 values solved by AHCM, SICM, STCM and
m P nk 2 k 2 CSPM under various problem sizes. The objective
k1 i1 a i
a k b i b
values solved by SICM are all signi®cantly (at 5%
Pk Pk level of signi®cance) inferior to those of AHCM,
where ak 1=nk ni1 ai ; bk 1=nk ni1 bi ; showing the ineectiveness of SICM. At N 10,
PN PN
a 1=N i1 ai ; b 1=N i1 bi nk is the 20 and 30, the objective values solved by STCM
number of objects in the kth cluster, that is, are not signi®cantly dierent from those of
nk jSk j. In order to avoid clusters with only one AHCM, but at N 40 and 50 STCM shows more
object, a penalty of J M (J is the number of one- eective than AHCM. The objective values solved
object clusters; M is a big number) is added to the by CSPM are not signi®cantly dierent from those
objective value. of AHCM at N 10 and 20, but CSPM demon-
The mechanism of genetic algorithms in these strates more eective than AHCM at N 30, 40
three models is tested according to the following: and 50. For further comparison between STCM
population of each generation 100, roulette and CSPM, we test the null hypothesis H0 : F2 F3
wheel selecting, two points crossover at a rate of against alternative hypothesis H1 : F2 < F3 at
1.00 and gene mutation at a rate of 0.01. The N 10; 20; 30; 40 and 50, respectively. Their
stopping rule is preset as mature rate reaching corresponding Z values are 0.00, 0.69, 13.45, 16.70
80%. That is, GAs will continue to evolve until and 11.17, indicating that CSPM is signi®cantly
there are over 80% chromosomes with the same more eective than STCM at N 30; 40 and 50.
®tness in an epoch. Due to the stochastic charac- The testing results show that CSPM is the most
teristics of GAs, the following empirical compari- eective method, followed by STCM, and SICM is
son of our proposed methods are analyzed using the least eective due to its long chromosome (®ve
hypothesis test on the results obtained from 30 times that of CSPM) which leads to the need to
dierent executions. search a larger feasible space.

Table 4
Eectiveness of four methods N 5 50a
Number AHCM SICM STCM CSPM
of objects F F1 dF1 F2 dF2 F3 dF3
F1 =F Z1 F2 =F Z2 F3 =F Z3
10 13.06 12.45 0.73 13.06 0.00 13.06 0.00
(0.95) ()4.63 ) (1.00) (0.00) (1.00) (0.00)
20 26.95 11.38 4.49 26.77 0.69 26.87 0.36
(0.42) ()18.97 ) (0.99) ()1.43) (1.00) ()1.28)
30 28.70 13.83 3.83 28.82 2.81 36.40 1.28
(0.48) ()21.25 ) (1.00) (0.22) (1.27) (32.90 )
40 37.10 11.55 4.44 38.72 2.88 48.99 1.75
(0.31) ()31.55 ) (1.04) (3.08 ) (1.32) (37.20 )
50 38.24 14.81 4.50 51.18 4.47 61.01 1.81
(0.39) ()28.53 ) (1.34) (15.86 ) (1.60) (68.98 )
a
(1) F stands for the objective value of AHCM. (2) Fi and dFi represent the means and standard deviations of objective values solved by
SICM, STCM and CSPM
p with 30 dierent executions, i 1; 2; 3. (3) Eectiveness index of the ith method is de®ned as Fi =F .
(4) Zi Fi F = dFi = 30 which follows Normal distribution (0,1). (5)* denotes that the null hypothesis test (H0 : Fi F ) is rejected
at the 5% level of signi®cance.
Y.-C. Chiou, L.W. Lan / European Journal of Operational Research 135 (2001) 413±427 421

Fig. 4. Eectiveness of four methods N 5 50.

Table 5
Eciencies of four methods N 5 50a
Number of objects AHCM SICM STCM CSPM
SS SS 1 dSS1 SS 2 dSS2 SS 3 dSS3
SS=SS 1 Z1 SS=SS 2 Z2 SS=SS 3 Z3
10 172 1987 1193 2917 418 11,372 1724
(0.09) (9.06) (0.06) (36.08) (0.02) (36.08)
20 1347 7383 2506 7533 1020 133,743 42,927
(0.18) (16.08) (0.18) (17.06) (0.01) (17.06)
30 4521 12,963 3538 14,323 2018 511,915 220,156
(0.35) (20.02) (0.32) (12.74) (0.01) (12.74)
40 10,696 33,020 10,250 20,970 1776 1,257,755 454,935
(0.32) (17.62) (0.51) (15.14) (0.01) (15.14)
50 20,871 68,033 17,141 28,080 2143 2,226,590 633,469
(0.31) (21.73) (0.74) (19.25) (0.01) (19.25)
a
(1) SS stands for the number of solutions searched by AHCM. (2) SS i and dSSi represent the means and standard deviations of the
number of solutions searched by SICM, STCM and CSPM p with 30 dierent executions, i 1; 2; 3. (3) Eciency index of the ith
method is de®ned as SS=SS i . (4) Zi SS i SS= dSSi = 30 which follows Normal distribution (0,1). (5) * denotes that the null hy-
pothesis test (H0 : SS i SS) is rejected at the 5% level of signi®cance.

Table 5 summarizes the number of solutions N 40 and 50. Similarly, the results of two-tailed
searched until the ``optimal solution'' being ob- tests of H0 : SS 1 SS 3 against H1 : SS 1 6 SS 3 and
tained by four methods under various problem H0 : SS 2 SS 3 against H1 : SS 2 6 SS 3 have shown
sizes. Obviously, AHCM, which has the least that CSPM is the least ecient method.
number of solutions searched, is the most ecient Fig. 5 shows an optimal clustering result of
method. Further comparison between SICM and CSPM at N 50. The ®gure shows that, by
STCM is made by a two-tailed test of maximizing the ratio of between-clusters variabil-
H0 : SS 1 SS 2 against H1 : SS 1 6 SS 2 . The Z val- ity to within-cluster variability, these 50 objects
ues at N 10; 20; 30; 40 and 50 are 4.03, 0.30, have been divided into 11 clusters. Each cluster
1.83, )6.34 and )12.67, respectively, implicitly contains 2±8 objects. Since the objects in the same
showing that SICM is more ecient than STCM clusters are adjacent to each other and no object is
at N 10, not signi®cantly dierent from STCM obviously grouped into a wrong cluster, the result
at N 20 and 30, and less ecient than STCM at of clustering appears to be good.
422 Y.-C. Chiou, L.W. Lan / European Journal of Operational Research 135 (2001) 413±427

is more eective than AHCM for small scale

problems N 5 50, but less eective than AHCM
for medium-to-large scale problems 50 5
N 5 200.

6. Applications

CSPM can be applied to problems of p-Median

because of not only its eectiveness, but also its
search procedure (cluster seed points are chosen
®rst, and the rest of the objects are assigned later).
A p-Median problem is chosen as an example to
examine the applicability of this method.

Fig. 5. Optimal clustering result of CSPM (N 50).

6.1. Problem statement

5.2.2. Medium-to-large scale problems 50 5 N 5 There are 25 districts uniformly distributed in a

200 square area. Each link that connects two adjacent
If SICM were applied to larger scale problems, districts is 1 km long. We consider setting up
the length of the chromosome would be too long, several public facilities, such as hospitals, schools
saturating the computer memory. Thus we only or ®re departments in some districts to serve all
analyze the clustering results of AHCM, STCM districts. More facilities represent a higher total
and CSPM, illustrated in Fig. 6. It shows obvi- set-up cost. However, if facilities are insucient or
ously that CSPM is the most eective, especially set up in the wrong place, the total cost of incon-
for the larger scale problems. AHCM is slightly venience across all districts using the facilities will
more eective than STCM. It shows that STCM increase. Thus the problem of optimal locations

Fig. 6. Means and 95% upper/lower con®dence intervals of STCM and CSPM 50 5 N 5 200.
Y.-C. Chiou, L.W. Lan / European Journal of Operational Research 135 (2001) 413±427 423

and serviceable areas of facilities can be formu- 6.2. The results

lated as follows:
Without a loss of generality, let b 1 dollars/
km. Figs. 7±14 show the optimal designs of CSPM
SC
according to dierent set-up costs a, ranging
X XX
Min Z a Xjj b jai aj j jbi bj jXij from 0.5 to 15 dollars/facility, respectively. In
j i j these ®gures, a circle denotes a district, a shad-
owed circle indicates that a public facility has been
16

subject to
X
Xij 1 all i; 17
j

Xij 6 Xjj all i;j; i 1;... ;25; 18

Xij f0; 1g all i;j; i 1;.. .;25; j 1;... ;25; 19

where Z represents the total cost function, a is the

set-up cost of a public facilities (dollars/facility), b
is the unit distance cost for a district to access its
public facility (dollars/kilometer), (ai ; bi ) is the
coordinate of ith district, Xjj 1 denotes that jth
district has a public facility, Xjj 0 otherwise.
Xij 1 denotes that ith district uses the public
facility located at jth district, Xij 0 otherwise.
The mechanism of GAs follows the set-ups in Fig. 8. The optimal design at a 1.
Section 5.1.

Fig. 7. The optimal design at a 0:5. Fig. 9. The optimal design at a 2.

424 Y.-C. Chiou, L.W. Lan / European Journal of Operational Research 135 (2001) 413±427

Fig. 10. The optimal design at a 3.

Fig. 12. The optimal design at a 5.

Fig. 11. The optimal design at a 4.

Fig. 13. The optimal design at a 10.

set up in that district, and the districts connected

with links are all served by the same public facility. (Fig. 10). At a 15 dollars/facility, the optimal
At a 0:5 dollars/facility, the optimal number of number of facilities is only one and the total dis-
facilities is 25, that is, each district has one public tance cost reaches 60 dollars (Fig. 14). Fig. 15
facility and the total distance cost is 0 (Fig. 7). At further illustrates that while the set-up cost of a
a 3 dollars/facility, the optimal number of facility increases, the optimal number of facilities
facilities is 6 and total distance cost is 22 dollars decreases and the total distance cost increases.
Y.-C. Chiou, L.W. Lan / European Journal of Operational Research 135 (2001) 413±427 425

dium-to-large problems (50±200 objects) indicate

that CSPM is still the most eective method.
AHCM is slightly more eective than STCM. Both
results show that CSPM can solve clustering
problems eectively. CSPM can be easily applied
to problems of p-Median or location selection
because of its eectiveness and its search procedure
(cluster seed points is chosen ®rst, and the rest of
the objects are assigned later). CSPM is highly
applicable as evidenced by the reasonable results
of the application to an exempli®ed p-Median
problem.
Since the search space for CSPM is propor-
tional to 2N (search space for N 10 is
210 1024, N 100 is 2100 1:27 1030 ), the
larger the scale of the problem is, the less eec-
tiveness of CSRM would be anticipated. Future
studies can examine the feasibility of employing a
Fig. 14. The optimal design at a 15. hybrid GA model (combining other heuristic al-
gorithms, such as simulated annealing) to further
6.3. Concluding remarks enhance its eectiveness and eciency. The clus-
tering problems solved by GAs using personal
This study discusses the eectiveness and e- computers, however, involves the storage of a
ciency of solving clustering problems by employing certain population of chromosomes, which is
GAs. Varying the techniques of coding/decoding, likely to cause an insuciency of computer
we proposed SICM, STCM and CSPM models memory as the problem scale gets larger (say
and tested them with 200 two-dimensional objects. N > 200). Therefore, reducing the storage re-
The results from small scale problems (10±50 ob- quirements is worthy of further investigation.
jects) show that CSPM is most eective but least Future studies can also be conducted by com-
ecient, STCM is second most eective and e- paring the eectiveness and eciency of our
cient, and SICM is least eective because of its proposed methods with the GA based algorithms
long chromosome required. The results from me- mentioned in the references.

Fig. 15. Optimal number of facilities and total distance costs vs. facility set-up cost.
426 Y.-C. Chiou, L.W. Lan / European Journal of Operational Research 135 (2001) 413±427

Acknowledgements [13] D. Goldberg, Genetic Algorithms in Search Optimization

and Machine Learning, Addison-Wesley, Reading, MA,
1989.
The authors are greatly indebted to two referees [14] P. Hansen, M. Delattre, Complete-link cluster analysis by
for their constructive comments. This paper is graph coloring, Journal of the American Statistical Asso-
partially sponsored by the National Science ciation 73 (1978) 397±403.
Council of the Republic of China under contract [15] P. Hansen, B. Jaumard, Minimum sum of diameters
clustering, Journal of Classi®cation 4 (1987) 215±226.
number NSC88-2211-E-009-019.
[16] J.H. Holland, Adaptation in Nature and Arti®cial Systems,
University of Michigan Press, Ann Arbor, MI, 1975.
[17] R.E. Jensen, A dynamic programming algorithm for
References cluster analysis, Operations Research 12 (1969) 1034±1057.
[18] R.A. Johnson, D.W. Wichern, Applied Multivariate Statis-
[1] M. Abramowitz, I.A. Stegun (Eds.), Handbook of Math- tical Analysis, Second ed., Prentice-Hall, New York, 1988.
ematical Functions, National Bureau of Standards, Ap- [19] D.R. Jones, M.A. Beltramo, Solving partitioning problems
plied Mathematical Series 55, U.S. Department of with genetic algorithms, in: Proceedings of the Fourth
Commerce, 1964. International Conference on Genetic Algorithms, 1991, pp.
[2] A.I. Ali, H. Thiagarajan, A network relaxation based 442±449.
enumeration algorithm for set partitioning, European [20] F. Kettaf, J. Asselin de Beauville, Genetic and fuzzy based
Journal of Operational Research 38 (1989) 76±85. clustering, in: Proceedings of the Fifth Conference on
[3] M.R. Anderberg, Cluster Analysis for Applications, Aca- International Federation of Classi®cation Societies, 1996,
demic Press, New York, 1973. pp. 100±103.
[4] C. Alippi, R. Cucchiara, Cluster partitioning in image [21] F.T. Lin, C.Y. Kao, C.C. Hsu, Applying the genetic
analysis classi®cation: A genetic algorithm approach, in: approach to simulated annealing in solving some NP-hard
Proceedings of the 1992 IEEE International Conference on problems, IEEE Transactions on Systems Man and
Computer Systems and Software Engineering, 1992, pp. Cybernetics 23 (1993) 1752±1767.
423±427. [22] J.A. Lozano, P. Larranga, M. Grana, Partitional cluster
[5] H. Aytug, G. Koehler, J.L. Snowdon, Genetic learning of analysis with genetic algorithm: Searching for the number
dynamic scheduling within a simulation environment, of clusters, Data Science, Classi®cation and Related
Computers and Operations Research 21 (1994) 909±925. Methods (1998) 117±125.
[6] J.N. Bhuyan, V.V. Raghavan, V.K. Elayavalli, Genetic [23] C.B. Lucasius, A.D. Dane, G. Kateman, On k-medoid
algorithms with an ordered representation, in: Proceedings clustering of large data sets with the aid of a genetic
of the Fourth International Conference on Genetic Algo- algorithm: Background, feasibility, and comparison, Anal-
rithms, 1991, pp. 408±415. ytica Chimica Acta 282 (1993) 647±669.
[7] P. Brucker, On the complexity of clustering problems, in: [24] S. Lunchian, H. Lunchian, M. Petriuc, Evolutionary
M. Beckmenn, H.P. Kunzi (Eds.), Optimization and automated classi®cation, in: Proceedings of the First IEEE
Operations Research, Lecture Notes in Economics and Conference on Evolutionary Computation, 1994, pp. 585±
Mathematical Systems, vol. 157, Springer, Berlin, 1978, 589.
pp. 45±54. [25] J. MacQueen, Some methods for classi®cation and analysis
[8] D.G. Cattrysse, M. Salomon, L.N. Van Wassenhove, A set of multivariate observations, in: Proceedings of Fifth
partitioning heuristic for the generalized assignment prob- Berkeley Symposium on Mathematical Statistics and
lem, European Journal of Operational Research 72 (1994) Probability, vol. 1, 1967, pp. 281±297.
167±174. [26] I.R. Moraczawski, W. Borkowski, A. Kierzek, Clustering
[9] D.G. Conway, M.A. Venkataramanan, Genetic search and geobotanical data with the use of a genetic algorithm,
dynamic facility layout problem, Computers and Opera- Coenoses 10 (1995) 17±28.
tions Research 21 (1994) 955±960. [27] E.C. Moshe, C.A. Tovey, J.C. Ammons, Circuit partition-
[10] R. Cucchiara, Analysis and comparison of dierent genetic ing via set partitioning and column generation, Operations
models for the clustering problem in image analysis, in: Research 44 (1996) 65±76.
Proceedings of the International Conference on Arti®cial [28] J.M. Mulvey, H.P. Crowder, Cluster analysis: An applica-
Neural Nets and Genetic Algorithms, Innsbruck, Austria, tion of Lagrangian relaxation, Management Science 25
1993, pp. 423±427. (1979) 329±340.
[11] J. Etcheberry, The set covering problem: A new implicit [29] A.L. Nordstrom, S. Tufekci, A genetic algorithm for the
enumeration algorithm, Operations Research 25 (1977) talent scheduling problem, Computers and Operations
760±772. Research 21 (1994) 941±954.
[12] E. El-Darzi, G. Mitra, Graph theoretic relaxations of set [30] J. Pinte`r, G. Pesti, Set partition by globally optimized
covering and set partitioning problems, European Journal cluster seed points, European Journal of Operational
of Operational Research 87 (1995) 109±121. Research 51 (1991) 127±135.
Y.-C. Chiou, L.W. Lan / European Journal of Operational Research 135 (2001) 413±427 427

[31] M. Savelsbergh, A branch-and-price algorithm for the [33] J.H. Ward, Hierarchical grouping to optimize an objective
generalized assignment problem, Operations Research 45 function, Journal of the American Statistical Association
(1997) 831±841. 58 (1963) 236±244.
[32] M.A. Trick, A linear relaxation heuristic for the general- [34] J.W. Welch, Algorithmic complexity: Three NP-hard
ized assignment problem, Naval Research Logistic 39 problems in computational statistics, Journal of Statistical
(1992) 137±152. Computation and Simulation 15 (1983) 17±25.

Ch. 8 - Test Bank
No ratings yet
Ch. 8 - Test Bank
12 pages
Surface States and Reactivity of Pyrite and Marcasite
No ratings yet
Surface States and Reactivity of Pyrite and Marcasite
8 pages
Arabic Cluster: A Bridge Between East and West: Hayat Kabasakal, Muzaffer Bodur
No ratings yet
Arabic Cluster: A Bridge Between East and West: Hayat Kabasakal, Muzaffer Bodur
15 pages
Contractors' Risks in Design, Novate and Construct Contracts
No ratings yet
Contractors' Risks in Design, Novate and Construct Contracts
8 pages
A Simple Model For Assessing The Energy Performance of Windows
No ratings yet
A Simple Model For Assessing The Energy Performance of Windows
11 pages
13_Dynamic response of conventional and hot isostatically
No ratings yet
13_Dynamic response of conventional and hot isostatically
15 pages
1 s2.0 S092442470200328X Main
No ratings yet
1 s2.0 S092442470200328X Main
7 pages
Bob Lazar UFO - Sport Model
100% (1)
Bob Lazar UFO - Sport Model
6 pages
Hyacinthe 2001
No ratings yet
Hyacinthe 2001
18 pages
Exploring The Relationship Between Total Quality Management and Information Systems Development
No ratings yet
Exploring The Relationship Between Total Quality Management and Information Systems Development
17 pages
Jagdish N Seth
No ratings yet
Jagdish N Seth
3 pages
A low-voltage integrated CMOS analog lock-in amplifier prototype for LAPS applications
No ratings yet
A low-voltage integrated CMOS analog lock-in amplifier prototype for LAPS applications
10 pages
Emotional Intelligence and Transformational Leadership in Retailing
No ratings yet
Emotional Intelligence and Transformational Leadership in Retailing
9 pages
A Simple Subthreshold Swing Model For SH
No ratings yet
A Simple Subthreshold Swing Model For SH
7 pages
دليل تسليم المجرمين
No ratings yet
دليل تسليم المجرمين
146 pages
Healthcare: Dow Corning Pharma Tubing
No ratings yet
Healthcare: Dow Corning Pharma Tubing
4 pages
Pyrolysis Kinetics of Waste PVC Pipe: S. Kim
No ratings yet
Pyrolysis Kinetics of Waste PVC Pipe: S. Kim
8 pages
Based CoMo Sulphided Catalysts Supported
No ratings yet
Based CoMo Sulphided Catalysts Supported
7 pages
Graph-Based Induction and Its Applications: Takashi Matsuda, Hiroshi Motoda, Takashi Washio
No ratings yet
Graph-Based Induction and Its Applications: Takashi Matsuda, Hiroshi Motoda, Takashi Washio
9 pages
Effect of Regioregularity On The Photoresponse of Schottky-Type Junctions Based On Poly (3-Alkylthiophenes)
No ratings yet
Effect of Regioregularity On The Photoresponse of Schottky-Type Junctions Based On Poly (3-Alkylthiophenes)
5 pages
2001 - Problems of Accuracy Control in Cold Forming
No ratings yet
2001 - Problems of Accuracy Control in Cold Forming
6 pages
Campaigns and Political Marketing
100% (1)
Campaigns and Political Marketing
22 pages
(B) (Zotikov, 2006) The Antarctic Subglacial Lake Vostok, Glaciology, Biology and Planetology PDF
100% (1)
(B) (Zotikov, 2006) The Antarctic Subglacial Lake Vostok, Glaciology, Biology and Planetology PDF
150 pages
Alginate Book
100% (1)
Alginate Book
30 pages
2-Adaptive Image Filtering Chapter2
No ratings yet
2-Adaptive Image Filtering Chapter2
13 pages
zhang2001
No ratings yet
zhang2001
20 pages
10-1108_02652320310457776
No ratings yet
10-1108_02652320310457776
13 pages
Bivariate Rhet Analysis
No ratings yet
Bivariate Rhet Analysis
23 pages
Enteroviruses in Patients Experiencing Multiple Episodes of Hand, Foot, and Mouth Disease in The Same Season in Kobe, Japan, 2011
No ratings yet
Enteroviruses in Patients Experiencing Multiple Episodes of Hand, Foot, and Mouth Disease in The Same Season in Kobe, Japan, 2011
3 pages
Glyceryl Trinitrate Ointment As A Potential Treatment For Primary Vaginismus
No ratings yet
Glyceryl Trinitrate Ointment As A Potential Treatment For Primary Vaginismus
2 pages
Artikel
No ratings yet
Artikel
1 page
Construction Management
No ratings yet
Construction Management
11 pages
Sultana 2002
No ratings yet
Sultana 2002
8 pages
DEH-P9650MP: Operation Manual
No ratings yet
DEH-P9650MP: Operation Manual
221 pages
Social Comparison Research
No ratings yet
Social Comparison Research
9 pages
Ramakrishna2001 (Unlocked by WWW - Freemypdf.com)
No ratings yet
Ramakrishna2001 (Unlocked by WWW - Freemypdf.com)
36 pages
Biomedical Applications of Polymer-Composite Materials, A Review
100% (2)
Biomedical Applications of Polymer-Composite Materials, A Review
36 pages
Flapping Foil
No ratings yet
Flapping Foil
11 pages
7 - Cooking
No ratings yet
7 - Cooking
11 pages
Pirolactone+Salicylic Acid
No ratings yet
Pirolactone+Salicylic Acid
5 pages
Manual PDF
No ratings yet
Manual PDF
187 pages
Diurnal Variation of Deep Cloud Systems Over The Indian Region Using INSAT-1B Pixel Data
No ratings yet
Diurnal Variation of Deep Cloud Systems Over The Indian Region Using INSAT-1B Pixel Data
11 pages
1 s2.0 S016815910100137X Main
No ratings yet
1 s2.0 S016815910100137X Main
13 pages
Arnao Etal2001
No ratings yet
Arnao Etal2001
6 pages
Comparing Two Programs of Cognitive Training in Alzheimer
No ratings yet
Comparing Two Programs of Cognitive Training in Alzheimer
7 pages
Travel Behaviour in Dutch Monocentric and Policentric Urban Systems
No ratings yet
Travel Behaviour in Dutch Monocentric and Policentric Urban Systems
14 pages
Study of Five Cell Salvage Machines in Coronary Artery Surgery
No ratings yet
Study of Five Cell Salvage Machines in Coronary Artery Surgery
7 pages
Understanding Complex, Real-World Systems Through Asynchronous
No ratings yet
Understanding Complex, Real-World Systems Through Asynchronous
15 pages
Fei 2001 RESS Safety Medical Robotics
No ratings yet
Fei 2001 RESS Safety Medical Robotics
11 pages
Distinguishing Between Fraunhofer and Fresnel
No ratings yet
Distinguishing Between Fraunhofer and Fresnel
8 pages
Water Activity Calculation
No ratings yet
Water Activity Calculation
12 pages
Aw Dan Osmotik Effec
No ratings yet
Aw Dan Osmotik Effec
12 pages
Voter ID Form
No ratings yet
Voter ID Form
3 pages
2090custbill11212024
No ratings yet
2090custbill11212024
10 pages
1cinetframb. 14216
No ratings yet
1cinetframb. 14216
8 pages
Uniformly Accelerated Motion
No ratings yet
Uniformly Accelerated Motion
12 pages
Non-Destructive Testing of A Building Wall by Studying Natural Thermal Signals
No ratings yet
Non-Destructive Testing of A Building Wall by Studying Natural Thermal Signals
7 pages
Regional Production, Information Communication Technology, and The Developmental State: The Rise of Singapore As A Global Container Hub
No ratings yet
Regional Production, Information Communication Technology, and The Developmental State: The Rise of Singapore As A Global Container Hub
20 pages
Group Method of Data Handling: Fundamentals and Applications for Predictive Modeling and Data Analysis
From Everand
Group Method of Data Handling: Fundamentals and Applications for Predictive Modeling and Data Analysis
Fouad Sabry
No ratings yet
Oriented Gradients Histogram: Unveiling the Visual Realm: Exploring Oriented Gradients Histogram in Computer Vision
From Everand
Oriented Gradients Histogram: Unveiling the Visual Realm: Exploring Oriented Gradients Histogram in Computer Vision
Fouad Sabry
No ratings yet
Artigo 1 Tese Bruno Carlos Lugão Publicado
No ratings yet
Artigo 1 Tese Bruno Carlos Lugão Publicado
16 pages
Recent Trends in Disease Diagnosis Using Soft Computing Techniques: A Review
No ratings yet
Recent Trends in Disease Diagnosis Using Soft Computing Techniques: A Review
13 pages
Hao-Ran Lin, Bing-Yuan Cao, Yun-Zhang Liao (Auth.) - Fuzzy Sets Theory Preliminary - Can A Washing Machine Think - Springer International Publishing (2018)
100% (2)
Hao-Ran Lin, Bing-Yuan Cao, Yun-Zhang Liao (Auth.) - Fuzzy Sets Theory Preliminary - Can A Washing Machine Think - Springer International Publishing (2018)
170 pages
Optimal Kinematic Synthesis of 6-Bar Rack and Pinion Ackerman Steering Linkage
No ratings yet
Optimal Kinematic Synthesis of 6-Bar Rack and Pinion Ackerman Steering Linkage
10 pages
Cs2053 Soft Computing Syllabus R
No ratings yet
Cs2053 Soft Computing Syllabus R
1 page
Tutorial Galgo R
No ratings yet
Tutorial Galgo R
92 pages
The Evolution of Intelligent Systems in The Mining Industry: John A. Meech
No ratings yet
The Evolution of Intelligent Systems in The Mining Industry: John A. Meech
30 pages
Musical Instrument Identi Cation With Feature Selection Using Evolutionary Methods Loughran Thesis
No ratings yet
Musical Instrument Identi Cation With Feature Selection Using Evolutionary Methods Loughran Thesis
281 pages
A Real-Time Parking Prediction System For Smart Cities
No ratings yet
A Real-Time Parking Prediction System For Smart Cities
29 pages
Inft
No ratings yet
Inft
22 pages
Cooperative Coevolution: An Architecture For Evolving Coadapted Subcomponents
No ratings yet
Cooperative Coevolution: An Architecture For Evolving Coadapted Subcomponents
29 pages
Applied Ocean Research: Zhongwei Zhao, Yuyang Bao, Tian Gao, Qi An
No ratings yet
Applied Ocean Research: Zhongwei Zhao, Yuyang Bao, Tian Gao, Qi An
18 pages
Genetic Algorithm Toolbox For SCILABUSERGUIDE
No ratings yet
Genetic Algorithm Toolbox For SCILABUSERGUIDE
18 pages
Geometric Rosette Patterns Analysis and Generation
No ratings yet
Geometric Rosette Patterns Analysis and Generation
10 pages
2020-2-May-Scopus-Supplier Selection in A Manufacturing-DOI 10.5373JARDCSV12SP520201784
No ratings yet
2020-2-May-Scopus-Supplier Selection in A Manufacturing-DOI 10.5373JARDCSV12SP520201784
9 pages
Genetic Algorithms in Supply Chain Management: A Critical Analysis of The Literature
No ratings yet
Genetic Algorithms in Supply Chain Management: A Critical Analysis of The Literature
26 pages
Metals 07 00311
No ratings yet
Metals 07 00311
11 pages
Petroleum: Mohammad Ali Ahmadi, Zhangxing Chen
No ratings yet
Petroleum: Mohammad Ali Ahmadi, Zhangxing Chen
14 pages
(Mitsuo Gen, Runwei Cheng, Lin Lin) Network Models PDF
No ratings yet
(Mitsuo Gen, Runwei Cheng, Lin Lin) Network Models PDF
701 pages
Rubik's Cube: Ibanescu Diana Dudeanu Ermoghen Cracaoanu Sergiu
No ratings yet
Rubik's Cube: Ibanescu Diana Dudeanu Ermoghen Cracaoanu Sergiu
28 pages
Comparison of Genetic Algorithms For Trading Strategies: January 2014
No ratings yet
Comparison of Genetic Algorithms For Trading Strategies: January 2014
13 pages
Master Thesis Presentation
No ratings yet
Master Thesis Presentation
66 pages
Analyzing The Performance of Mutation Operators To Solve The Travelling Salesman Problem
No ratings yet
Analyzing The Performance of Mutation Operators To Solve The Travelling Salesman Problem
18 pages
(Ebook - PDF) Introduction To Genetic Algorithms With Java Applets
No ratings yet
(Ebook - PDF) Introduction To Genetic Algorithms With Java Applets
36 pages
Genetic Algorithm To Generate The Automatic Time-Table - An Over View
No ratings yet
Genetic Algorithm To Generate The Automatic Time-Table - An Over View
4 pages
PSIT104 Soft Computing Techniques: Objective
No ratings yet
PSIT104 Soft Computing Techniques: Objective
2 pages
Project Discover: An Application of Generative Design For Architectural Space Planning
No ratings yet
Project Discover: An Application of Generative Design For Architectural Space Planning
9 pages
Principles of Soft Computing Using Python Programming, 2023
No ratings yet
Principles of Soft Computing Using Python Programming, 2023
347 pages
CS8082-Machine Learning Techniques
No ratings yet
CS8082-Machine Learning Techniques
13 pages
274 - Soft Computing LECTURE NOTES
No ratings yet
274 - Soft Computing LECTURE NOTES
114 pages

2000 - Genetic Clustering Algorithms

Uploaded by

2000 - Genetic Clustering Algorithms

Uploaded by

European Journal of Operational Research 135 (2001) 413±427

Theory and Methodology

Genetic clustering algorithms

Received 15 November 1999; accepted 20 November 2000

Keywords: Genetic algorithms; Clustering; p-Median problem

1. Introduction cessing and computer graphics, etc.). Clustering is

clusters must be subjectively determined in ad- Xjj  m; 3

N N symmetric matrix increments of an objec- Table 2

Table 1 STCM successively solves the optimal binary

Fig. 1. Framework of STCM.

subject to Xi  f0; 1g i  1; . . . ; jS 0 j; 14

Fig. 2. Framework of CSPM.

5.1. Experimental design

A random number generator is using to gen-

N m 5.2. The results

Fig. 4. Eectiveness of four methods N 5 50.

is more eective than AHCM for small scale

CSPM can be applied to problems of p-Median

Fig. 5. Optimal clustering result of CSPM (N  50).

5.2.2. Medium-to-large scale problems 50 5 N 5 There are 25 districts uniformly distributed in a

and serviceable areas of facilities can be formu- 6.2. The results

Xij 6 Xjj all i;j; i  1;... ;25; 18

where Z represents the total cost function, a is the

Fig. 7. The optimal design at a  0:5. Fig. 9. The optimal design at a  2.

Fig. 10. The optimal design at a  3.

Fig. 11. The optimal design at a  4.

set up in that district, and the districts connected

dium-to-large problems (50±200 objects) indicate

Acknowledgements [13] D. Goldberg, Genetic Algorithms in Search Optimization

You might also like

clusters must be subjectively determined in ad- Xjj m; 3

subject to Xi f0; 1g i 1; . . . ; jS 0 j; 14

Fig. 4. Eectiveness of four methods N 5 50.

is more eective than AHCM for small scale

Fig. 5. Optimal clustering result of CSPM (N 50).

Xij 6 Xjj all i;j; i 1;... ;25; 18

Fig. 7. The optimal design at a 0:5. Fig. 9. The optimal design at a 2.

Fig. 10. The optimal design at a 3.

Fig. 11. The optimal design at a 4.