A Discrete Moth-Flame Optimization With An L 2-Norm Constraint For Network Clustering
A Discrete Moth-Flame Optimization With An L 2-Norm Constraint For Network Clustering
3, MAY/JUNE 2022
Abstract—Complex network clustering problems have been and mathematics, complex networks are often represented as a
gained great popularity and widespread researches recently, and graph where vertices and edges denote the objects and the rela-
plentiful optimization algorithms are aimed at this problem. Among
these methods, the optimization methods aiming at multiple tions between them in complex systems, respectively. The clus-
objectives can break the limitations (e.g., instability) of those ter structures are ubiquitous in networks, such as social clusters
optimizing single objective. However, one shortcoming stands out based on different backgrounds or interests in social net-
that these methods cannot balance the exploration and exploitation works [5], and subway stations cluster based on different passen-
well. In another sentence, it fails to optimize solutions on the basis of ger flows in subway networks [6]. Here, a cluster (or
the good solutions obtained so far. Inspired by nature, a new
optimized method, named multi-objective discrete moth-flame community) is defined as a collection of closely-connected sub-
optimization (DMFO) method is proposed to achieve such a set vertices, with sparse connections between different clus-
tradeoff. Specifically, we redefine the simple flame generation (SFG) ters [7]. The aim of network clustering is to detect cluster
and the spiral flight search (SFS) processes with network topology structures in a network, namely, to divide a network into many
structure to balance exploration and exploitation. Moreover, we subsets of vertices according to the properties of vertices.
present the DMFO in detail utilizing a Tchebycheff decomposition
method with an l2 -norm constraint on the direction vector (2-Tch). Network clustering can be essentially modeled as an optimi-
Besides that, experiments are taken on both synthetic and real- zation problem [8]. Generally, such a clustering optimization
world networks and the results demonstrate the high efficiency and problem is deemed to be NP-hard, and several optimization
promises of our DMFO when tackling dividing complex networks. algorithms have been proposed for solving this problem
Index Terms—Network clustering, multi-objective optimiza- recently [9]. Traditional single-objective methods tend to select
tion, discrete moth-flame optimization, decomposition. an optimal objective at first (such as the modularity Q [10], [11],
the modularity density [12] or the community score [13]) and
optimize such an objective by leveraging different strategies. A
I. INTRODUCTION quintessential example is that GA-Net optimizes the community
score by means of a genetic algorithm for network clustering in
C OMPLEX networks are omnipresent in reality, such as
biological networks [1], social networks [2], transportation
networks [3] and biochemical networks [4]. In computer science
social networks [13]. DCRO optimizes the modularity in a way
the chemical reaction process takes place for the acquisition of a
better clustering result [14]. Nevertheless, these methods share a
common shortcoming that they only take structure into account
Manuscript received March 7, 2021; revised September 2, 2021; accepted
February 17, 2022. Date of publication February 23, 2022; date of current ver- when formulating objective functions, even when the multiple
sion May 23, 2022. This work was supported in part by the National Science underlying structural attributes ask for solutions. Therefore, in
Fund for Distinguished Young Scholars under Grant 62025602, in part by the order to overcome such a shortcoming, the multi-objective meth-
National Natural Science Foundation of China under Grants 61976181,
U1803263, and 11931015, in part by Fok Ying-Tong Education Foundation under ods mining multiple characteristics of a network simultaneously
Grant 171105, in part by the Fundamental Research Funds for the Central Univer- have been applied to make network clustering more accurate and
sities under Grants D5000210827 and D5000210738, in part by the Natural Sci-
ence Foundation of Shaanxi Province under Grant 2022JM-325, and in part by
efficient [15]–[20]. For example, a multi-objective network clus-
Tencent Foundation and XPLORER PRIZE. Recommended for acceptance tering method (MOGA-Net) leverages the non-dominated sorting
by Dr. Gaoxi Xiao. (Corresponding authors: Zhen Wang; Chao Gao) genetic algorithm-II (NSGA-II) to improve both the community
Xianghua Li and Chao Gao are with the School of Artificial Intelligence,
Optics, and Electronics (iOPEN), Northwestern Polytechnical University,
score and community fitness simultaneously [20]. Although a
Xian 710072, China, and also with the College of Computer and Information discrete multi-objective particle swarm optimization (MODPSO)
Science & College of Software, Southwest University, Chongqing 400715, China method considers the kernel K-means (KKM) and Ratio
(e-mail: [email protected]; [email protected]).
Xin Qi, Xingjian Liu, and Fan Zhang are with the College of Computer and Cut (RC) to further improve the accuracy of the division
Information Science & College of Software, Southwest University, Chongqing results [15], [21], there is no clear accurate timing for transition
400715, China (e-mail: [email protected]; 2638666276@foxmail. between exploration and development. Therefore it is difficult
com; [email protected]).
Zhen Wang is with the School of Artificial Intelligence, Optics, and Elec- for multi-objective optimization algorithms to achieve a tradeoff
tronics (iOPEN), Northwestern Polytechnical University, Xian 710072, China between exploration and development in the calculation process.
(e-mail: [email protected]). In recent years, more and more meta-heuristic algorithms have
Jiming Liu is with the Department of Computer Science, Hong Kong Bap-
tist University, Hong Kong (e-mail: [email protected]). been applied to solve multi-objective optimization problems. For
Digital Object Identifier 10.1109/TNSE.2022.3153095 example, the monarch butterfly optimization (MBO) is a very
2327-4697 © 2022 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See ht_tps://www.ieee.org/publications/rights/index.html for more information.
Authorized licensed use limited to: University Town Library of Shenzhen. Downloaded on November 08,2023 at 08:42:16 UTC from IEEE Xplore. Restrictions apply.
LI et al.: DISCRETE MOTH-FLAME OPTIMIZATION WITH AN L_2-NORM CONSTRAINT FOR NETWORK CLUSTERING 1777
competitive metaheuristic algorithm, which is very easy to imple- optimal value. More specifically, based on the label-based repre-
ment because it only needs to fine-tune migration and adjust oper- sentation of individuals, we adjust the SFG and SFS processes to
ators [22]. Inspired by the cooperative behaviors and chasing extend a discrete MFO variant. On the basis of two optimization
styles of Harris hawks, a novel population based optimization functions which contradicts each other (i.e., KKM and RC), the
algorithm, called Harris hawks optimization (HHO), is proposed multi-objective optimization network clustering framework gets
to tackle different optimization tasks [23]. A new stochastic opti- optimized. The contributions of the paper are as follows:
mizer, which is called slime mould algorithm (SMA), is designed For the balance between exploration and exploitation
based on the oscillation mode of slime mould [24]. The SMA when clustering networks, the discrete MFO variant
algorithm has several new features with a unique mathematical (denoted as DMFO) is extended for the first time. Such
model to form the optimal path for connecting foods with the a variant redefines the SFG and SFS processes based on
excellent exploratory ability and exploitation propensity. More- the features of networks.
over, A Hunger Games Search (HGS) is designed according to To divide and conquer the network clustering problems,
the hunger-driven activities and behavioral choice of ani- the improved Tchebycheff method (2-Tch) is leveraged
mals [25]. The HGS algorithm incorporates the concept of hunger in our DMFO, which is first used to address network
into the feature process, in other words, an adaptive weight based clustering.
on the concept of hunger is designed and employed to simulate Extensive comparative experiments on both synthetic
the effect of hunger on each search step. Furthermore, a general- and real-world networks validate the good performance
purpose metaheuristic method named moth search algorithm of our proposed DMFO.
(MSA) is proposed to generalize the phototaxis and Levy flights The rest of this paper is designed as follows. The background is
of moths [26]. The features mentioned above correspond to the given in Section II. Our approach for network clustering is dem-
exploitative and explorative process of an optimization algo- onstrated in Section III. Then, extensive experiments are imple-
rithm. The above-mentioned multi-objective optimization algo- mented to prove the superiority of the proposed DMFO algorithm
rithms have proved their superiority in detecting communities in in Section IV. Finally, conclusions are made in Section V.
complex networks. However, due to the disaster of dimensional-
ity, when these algorithms deal with complex optimization II. RELATED WORKS
problems with a large number of local optimal solutions, the con-
A. Problem Formulation
vergence speed is slow and there are easy to fall into the local
optimal. How to balance the exploration ability and the exploita- An undirected graph G ¼ ðV; EÞ is usually utilized to repre-
tion ability to obtain an ideal optimization result is still a big prob- sent a network, where V and E stand for a set of vertices and
lem that needs to be solved urgently. edges, correspondingly. A network cluster is defined as a
The moth-flame optimization (MFO) algorithm is a new sto- group of vertices, where the number of edges inside the same
chastic optimization algorithm [27]. With the simple flame gen- cluster is higher than that of edges between different clus-
eration (SFG) and the spiral flight search (SFS) processes [28] ters [7]. The purpose of network clustering is to divide the net-
merged into it, the MFO can get a good balance between explo- work into closely connected subgraphs Gi ði ¼ 1; . . . ; kÞ in G,
ration and exploitation. Considering these features, Xu et al. where G obeys:
[29] firstly propose a series of mutation-based MFO, such as
LGCMFO and GMFO. These MFO variants use mutation meth- [1ik Gi ¼ G and \1ik Gi ¼ ? (1)
ods such as Gaussian mutation (GM), Cauchy mutation (CM),
Levy mutation (LM) to balance the exploration and exploitation
In reality, network clustering needs to comprehensively
processes in order to improve the performance of the basic
consider the attributes between intra-community and inter-
MFO. Moreover, a double adaptive weight mechanism is intro-
community, which means that the detection process should
duced into the MFO algorithm [30], termed as WEMFO, which
consider the multiple optimization metrics. Currently, the
adjusts the search strategy adaptively in different periods of
widely used optimization metrics are RC and KKM [15],
algorithm, to make it more flexible between global search
which are a pair of conflicting objective functions, corre-
(diversification) and local search (intensification). Furthermore,
sponding to the sparse connection of inter-communities and
the CLSGMFO algorithm introduces two mechanisms, Gaussian
the close connection of intra-communities, respectively. In
mutation and chaotic local search, into the original MFO algo-
order to get an excellent clustering effect reflected by such a
rithm [31]. More specifically, Gaussian mutation operation
pair of metrics, the multi-objective optimization process is
increases the diversity of the moth population during the explo-
defined in (2). More specifically, there are N vertices and jEj
ration phase and the chaotic local search mechanism provides
edges in G, the adjacency matrix of G is A ¼ ðAij ÞNN . If
the original MFO with the ability to jump out of local optima.
there exists an edge between vertices vi and vj in a network,
However, these methods don’t address the impact of premature
Aij ¼1, otherwise, 0.
convergence on the network clustering.
In this paper, we propose a novel multiobjective discrete moth- 8 P
< f1 ¼ KKM ¼ 2ðN cÞ ci¼1 LðVi ;Vi Þ
fame optimization algorithm (DMFO) for processing network jVi j
min
: f2 ¼ RC ¼ P c L ð V ;V Þ (2)
clustering. By continuously adjusting the positions of the flames i i
i¼1 jVi j
and moths, our method can accurately approximate the global
Authorized licensed use limited to: University Town Library of Shenzhen. Downloaded on November 08,2023 at 08:42:16 UTC from IEEE Xplore. Restrictions apply.
1778 IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, VOL. 9, NO. 3, MAY/JUNE 2022
where c represents
P the number of cluster in a network. the convergence speed and computational performance of the
LðVi ; Vi Þ ¼ i2Vi ;j2Vi Aij means the internal link densities of network clustering algorithm.
nodes both coming from the same community i. In contrast, Moreover, some approaches, such as MOEA/D-Net [37],
P
LðVi ; V i Þ ¼ i2Vi ;j2V i Aij represents the external link densities APMOEA [38] and MODBSA/D [39], are proposed based on
of vertices. Specifically, Vi denote a set of nodes coming from the decomposition to explore solution spaces by minimizing two
the ith community while V i stands for a set of nodes which contradictory optimization objectives, i.e., the ratio cut (RC) and
belongs to any other communities except the ith community. negative ration association (NRA). In addition, two contradic-
tory optimization functions, i.e., Kernel K-Means (KKM) and
Ratio Cut (RC), are proposed to develop the discrete multi-objec-
B. Nature-Inspired Computing
tive particle swarm optimization (MODPSO) [15] algorithm for
Network clustering is a key field of complex network dividing networks. This set of objective functions has gained
research, which have been widely studied in recent years. great popularity as soon as proposed. For instance, a quantum-
Numerous nature-inspired optimization algorithms are aimed mechanism-based PSO algorithm with non-dominated sorting
at solving this problem [32]–[34]. These solutions roughly selection mechanism [16], a multi-objective particle swarm opti-
split into two types, i.e., single-objective optimization and mization algorithm (MOPSO-Net) [17], and a network reduc-
multi-objective optimization approaches. tion-based multi-objective evolutionary algorithm (RMOEA) [9],
Concerning the first category of single objective optimization, which all use these two objective functions to achieve the better
only one objective function is selected to be optimized. For solution of network clustering.
instance, Pizzuti developed a genetic algorithm (GA-Net) for However, the above-mentioned optimization algorithms,
network clustering in social networks [13] by optimizing a sim- whether single-objective or multi-objective, are easy to fall
ple and effective fitness function. Chang et al. developed a into the local optimum and it is difficult to achieve the balance
DCRO [14] method, which simulates the chemical reaction of exploration and development in the process of searching for
process, thereby optimizing the modularity to achieve network solutions, which leads to the loss of optimal solutions. The
clustering. A discrete fireworks algorithm (DMFWA) [12] is Moth Flame Optimization (MFO) algorithm is an intelligent
proposed to solve network clustering problems by maximizing optimization algorithm based on population. Its particles move
the modularity density function. in a spiral way around a better solution, which not only ensures
As for the second category about the multi-objective opti- that algorithm is unlikely to fall into the local optimum, but
mization algorithms, they optimize multiple objective func- also enhances the parallel optimization ability of algorithm and
tions simultaneously for network clustering, which can the balance ability of exploration in order to obtain better clus-
compensate for the shortcomings of single-objective optimi- tering solution. Currently, the MFO algorithm has achieved
zation algorithms. In the last few years, many efforts have excellent results in many practical engineering optimization
been devoted to the application of evolutionary algorithms problems [40]–[42].
(EAs) to the development of multi-objective optimization
algorithms. For example, based on the non-dominated sorting C. The Moth-Flame Optimization
genetic algorithm-II (NSGA-II) [20], a variety of multi-
objective network clustering methods have been developed. 1) Simple Flame Generation Process: Inspired by the nat-
Among them, the MOGA-Net algorithm maximizes intra- ural moth navigation mechanism, Mirjalili proposed a novel
connections and minimizes inter-connections simultaneously population-based stochastic optimization algorithm called the
based on the community score and community fitness (e.g., Moth-Flame Optimization (MFO) algorithm [27]. The MFO
two contradictory objective functions) [20], respectively. algorithm contains two populations, namely the moth popula-
Unlike only considering the structure of a network, MOGA- tion and the flame population. The entire space where the moth
@Net algorithm integrates the structure and composition flies is the solution space of the problem. Specifically, each
dimensions contained in the attribute network, and can obtain moth corresponds to a light source, which avoids the algorithm
a division result that is both well-connected and similar falling into the local optimum. When there are enough moths
nodes [35]. Moreover, Li et al. proposes a multi-agent and flames, the flight of moths can search most of the solution
genetic algorithm for community detection by designing a set space, thus ensuring the exploration ability of algorithm. In the
of effective neighborhood-based operators [36]. It uses the process of optimization, the number of flames decreases as the
matrix coding and node similarity to generate the initial pop- round of iterations increases, so that the moth can fully search
ulation, which enhances the diversity of initial individuals the neighborhood space of better solutions, and the utilization
and improves the efficiency of finding the optimal solution. ability of the algorithm is ensured.
In summary, these algorithms achieve superior results in net- At the same time, the MFO algorithm contains two key pro-
work clustering problems, however, due to the curse of cesses: Simple Flame Generation (SFG) and Spiral Flight
dimensionality, they converge slowly when dealing with Search (SFS). Among them, the purpose of SFG process is to
complex optimization problems with a large number of local generate the next generation of flames from the best set of moth
optimal solutions. Meanwhile, they do not address the prob- and flame populations obtained so far. Then, the SFS process
lem of balancing exploration and development in the process imitates the horizontal positioning behavior of moth, guiding
of searching for solutions, which is significantly in improving the moth to fly to the flame in a spiral shape, thereby updating
Authorized licensed use limited to: University Town Library of Shenzhen. Downloaded on November 08,2023 at 08:42:16 UTC from IEEE Xplore. Restrictions apply.
LI et al.: DISCRETE MOTH-FLAME OPTIMIZATION WITH AN L_2-NORM CONSTRAINT FOR NETWORK CLUSTERING 1779
Authorized licensed use limited to: University Town Library of Shenzhen. Downloaded on November 08,2023 at 08:42:16 UTC from IEEE Xplore. Restrictions apply.
1780 IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, VOL. 9, NO. 3, MAY/JUNE 2022
Authorized licensed use limited to: University Town Library of Shenzhen. Downloaded on November 08,2023 at 08:42:16 UTC from IEEE Xplore. Restrictions apply.
LI et al.: DISCRETE MOTH-FLAME OPTIMIZATION WITH AN L_2-NORM CONSTRAINT FOR NETWORK CLUSTERING 1781
Authorized licensed use limited to: University Town Library of Shenzhen. Downloaded on November 08,2023 at 08:42:16 UTC from IEEE Xplore. Restrictions apply.
1782 IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, VOL. 9, NO. 3, MAY/JUNE 2022
Algorithm 3: The DMFO framework. First, the initialization phase is used to initialize moths and
Input: The adjacency matrix A of a network, max generation flames, which can be completed in the linear time OðjEjÞ.
Max iter, mutation probability: pm, neighborhood size: Then, the redesigned SFG process aims to select the flame,
niche, moths or flames size: popsize. which mainly uses the two-way crossover and NBM techni-
Output: A best cluster divisions C ¼ fCi gci¼1 of a given network. ques. Here, the time complexity of such two techniques are
1 Initialize Mð0Þ; both OðNÞ. In addition, the SFG takes OðN popsize
2 Generate a uniformly distributed weight vector w ¼ ðw1 ; . . . ; logðpopsizeÞÞ to select the optimal flames based on the modu-
wpopsize Þ; larity of descending order. Namely, the time complexity of the
3 foreach each moth and flame do SFG process is OðNþN popsize logðpopsizeÞÞ, where
4 Initialize the neighborhood based on the weight vectors; popsize is size of moths or flames. Moreover, the redesigned
5 Initialize reference point z .
SFS process updates moths by redesigning the moth flight
6 while k 2 ½1; Max iter do
strategy for specific problems. That is, the time complexity of
7 F ðkÞ execute Alg:1 : SFG process;
8 MðkÞ execute Alg:2 : SFS process; SFS process is OðD popsizeþNÞ. Finally, DMFO uses the
9 if randð0; 1Þ < pm then NBM operation again for the local search, which takes OðNÞ
10 Mi ðkÞ the NBM operator; time. Therefore, the overall time complexity of our proposed
11 Evaluate all moths MðkÞ based on (15); DMFO is OðMax iterðjEjþN popsize logðpopsizeÞÞ, whe-
12 if Mi ðkÞ is better than Mi ðk 1Þ then re Max iter is the max iteration number.
13 Mi ðkÞ Mi ðkÞ;
14 else
15 Mi ðkÞ Mi ðk 1Þ; IV. EXPERIMENTS
16 Update reference point z ; Aiming to test the behavior of our DMFO method, the exper-
17 return C ¼ fCi gci¼1 ; imental results and analyses are presented in this section.
Section IV-A introduces the experimental settings. Section IV-
B compares our proposed method with some state-of-the-art
MOEA/D, MOEA/D-2TCHMFI proves that choosing p = 2 approaches in both synthetic and real-world networks.
can improve the uniformity of the subproblem update regions, Section IV-C shows the contribution of an optimized Tch-
and the subproblem objective function in 2-Tch has a clearer ebycheff decomposition approach for network clustering.
geometric property in terms of Euclidean distance than that of Section IV-D focuses on the influence of the newly designed
using larger p values [47]. Therefore, the Tchebycheff decom- procedures, i.e., Algorithm 1 and Algorithm 2.
position using l2 -norm form constraint is an excellent decom-
position method. Based on the above decomposition method,
A. Experimental Design
the multi-objective optimization problem is transformed into a
single-objective sub-problem. This paper solves the uneven 1) Datasets and Parameter Setting: The extended Girvan-
distribution of the weight vector of the sub-problem in the pre- Newman (GN) benchmark networks [10] and real-world net-
vious decomposition algorithm that improves the overall works are selected to test the performance of our proposed
application performance of all sub-problem. Finally, we can DMFO method for network clustering.
achieve a better balance about the convergence and diversity As listed in Table I, there are seven real-world networks,
of the algorithm solutions. The definition of the 2-Tch method including the Zachary’s Karate Club network (G1 ) [48], the
is given in (14). Bottlenose Dolphin network (G2 ) [49], the American College
Football Network (G3 ) [10], the Books about US Politics net-
work (G4 ) [50], the Santa Fe Institute (SFI) network (G5 ) [51],
fi ðxÞ zi
min gptch ðFðxÞjw; z Þ ¼ max (14) and the Email network (G6 ) [52], and the Power grid network
x2V 1iq wi
(G7 ) [48]. In the proposed DMFO, all datasets cannot iterate
over 80 times, and the population scale is customized by the
where w ¼ ðw1 ; . . . ; wq Þ with kwk2 ¼ 1 and w1 ; . . . ; wq 0.
dataset size. In order to save computational cost in large data-
z ¼ ðz1 ; . . . ; zq Þ denotes the reference point (i.e., zi ¼ min
sets, the size of population for G1 and G2 is 150, and 100 for
fi ðxjx 2 V; i ¼ 1; . . . ; q) in which q represents the number of
other datasets. Besides that, DMFO updates moths mainly by
objective functions. flames, but it often fails to avoid the local optimization. In
view of this, the mutation rate and crossover rate are pro-
E. Computation Complexity Analysis
moted, i.e., pm ¼ 0:3 and pc ¼ 0:9, on the basis of their
The framework of our proposed DMFO method is presented corresponding references. Moreover, to reduce the statisti-
in Algorithm 3. Moreover, Fig. 4 illustrates the overall flow of cal error, all experiments are the averaged results over 10
DMFO method. As shown in Algorithm 3 and Fig. 4, the main running times.
loop of the overall DMFO is mainly composed of two rede- 2) Evaluation Metrics: In order to measure the perfor-
signed processes (i.e., the SFG and SFS processes). Given a mance of an algorithm, the modularity Q [10] and the normal-
network G with N vertices and jEj edges, the computational ized mutual information(NMI) [53] are usually used as
complexity of DMFO method are analyzed as follows. evaluation metrics. The modularity Q is defined in (15).
Authorized licensed use limited to: University Town Library of Shenzhen. Downloaded on November 08,2023 at 08:42:16 UTC from IEEE Xplore. Restrictions apply.
LI et al.: DISCRETE MOTH-FLAME OPTIMIZATION WITH AN L_2-NORM CONSTRAINT FOR NETWORK CLUSTERING 1783
Fig. 4. The flow chart of the proposed DMFO method. It can be seen from the figure that the overall DMFO framework is mainly composed of two redesigned
processes (i.e. SFG and SFS processes).
Pc A Pc B
2 Cij log Cij N=Ci: C:j
TABLE I
NMI ¼ PcA
i¼1 j¼1
PcB (16)
i¼1 Ci: log ðCi: =N Þ þ
THE RELEVANT INFORMATION OF THE REAL-WORLD NETWORKS
j¼1 C:j log C:j =N
Authorized licensed use limited to: University Town Library of Shenzhen. Downloaded on November 08,2023 at 08:42:16 UTC from IEEE Xplore. Restrictions apply.
1784 IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, VOL. 9, NO. 3, MAY/JUNE 2022
Fig. 5. The NMI values obtained by running 10 times on GN extended Fig. 6. The boxplot of the statistic values with modularity (Q) obtained by
benchmark datasets for different algorithms. The figure indicates the superior DMFO on the GN extended benchmark networks. It can be seen from the
performance of our DMFO than other methods on synthetic datasets. figure that our DMFO method has better robustness in network clustering.
TABLE II
of each vertice in the same cluster surpasses that of other RESULTS OBTAINED BY THE COMPARED ALGORITHMS ON G1 . THE RESULTS
neighbors in other clusters, and the networks have a clear SHOW THAT DMFO CAN FIND THE REAL COMMUNITY STRUCTURE AND ALSO
GET THE BEST MODULARITY Q VALUE
community structure. When m > 0:5, the community struc-
ture will become exceedingly blurred. Therefore, the mixing
parameters m ranges from 0.0 to 0.5, with a spacing of 0.05,
giving rise to a total of 11 synthetic networks in our experi-
ments. We test the proposed algorithm on this set of networks
and utilize NMI and Q value as criteria to estimate the parti-
tioning performance.
Fig. 5 provides the statistical results of the NMI over 10
runs for each algorithm with varying mixing parameters from
0 to 0.5. As can be seen, the proposed DMFO algorithms can
detect the network with the mixing parameter m being 0.45,
and the rest of the algorithms cannot detect it. Among them,
GA and MOGA-Net algorithms have the worst performance,
whose detection scope is restricted to when m is below 0.15.
When m > 0:35, GA-Net completely fails. For the MODPSO,
QDMPSO, MODBSA/D, and MOPSO-Net algorithms, the
performance is similar, and the real division of the network
community below 0.45 can be detected. To summarize, our
proposed DMFO algorithm surpasses the extended GN bench-
mark networks in performance. the administrator and the instructor in the period. Table II
Fig. 6 plots statistical results of Q obtained by DMFO algo- shows the experimental results for Q and NMI of different
rithm on the extended GN-benchmark datasets. As can be algorithms. Results show that the average NMI of MODPSO,
seen from Fig. 6, as the mixing parameter m gradually MODBSA/D, MOPSO-Net, DMFWA, MOCD-ACO, DIM-
increases from 0 to 0.5, the Q value obtained by the algorithm MOEA/D, and DMFO algorithm is 1, which means that real
decreases linearly. Overall, the robustness of the DMFO algo- clusters can be mined by them. The GA-Net, MOGA-Net, and
rithm is very strong, especially when m < 0:4, the standard DCRO methods cannot achieve the accurate division. It can
deviation of the Q value is 0. be seen from this network that our DMFO is effective and
2) Experiments on Real-World Networks: In this subsec- robust.
tion, we estimate the network clustering performance of G2 is an animal social network consisting of 62 bottlenose
DMFO on seven popular real-world networks. The results dolphins living off Doubtful Sound, New Zealand over 7
obtained by our proposed DMFO are represented in boldface. years, based on the social behavior of dolphins [49]. G2 is nat-
G1 is a social network of friendships among 34 members of urally divided into two large clusters with gender(i.e., the
a karate club spanning 2 years [48]. The club is divided into female and the male). In Table III, the results of different
two groups of the same size due to the inconsistence between methods on G2 are reported. As it can be seen from the results,
Authorized licensed use limited to: University Town Library of Shenzhen. Downloaded on November 08,2023 at 08:42:16 UTC from IEEE Xplore. Restrictions apply.
LI et al.: DISCRETE MOTH-FLAME OPTIMIZATION WITH AN L_2-NORM CONSTRAINT FOR NETWORK CLUSTERING 1785
Authorized licensed use limited to: University Town Library of Shenzhen. Downloaded on November 08,2023 at 08:42:16 UTC from IEEE Xplore. Restrictions apply.
1786 IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, VOL. 9, NO. 3, MAY/JUNE 2022
TABLE VI
RESULTS OBTAINED BY THE COMPARED ALGORITHMS ON G5 , G6 AND G7 IN TERMS OF THE MODULARITY Q. IT CAN BE SEEN FROM THE TABLE THAT THE PRO-
POSED DMFO METHOD CAN OBTAIN THE OPTIMAL MODULARITY AMONG ALL COMPARISON ALGORITHMS
TABLE VII can calculate the optimal average value of Q successfully on four
COMPARISON RESULTS IN Q AND NMI ON THE FOUR REAL-WORD DATA- real-world networks. Moreover, DMFO and DMFO-origin can
SETS. THE TABLE INDICATES THAT OUR DMFO METHOD USING A TCHEBY- obtain NMI=1 on both G1 and G2 networks, which verifies the
CHEFF DECOMPOSITION METHOD WITH AN l2 -NORM CONSTRAINTS CAN
OBTAIN FURTHER IMPROVED NETWORK CLUSTERING RESULTS THAN THE high clustering accuracy of the proposed method. Additionally,
DMFO-ORIGIN USING AN ORIGINAL DECOMPOSITION METHOD our DMFO using an improved Tchebycheff decomposition
method with an l2 -norm constraint on direction vectors can effec-
tively find the optimal cluster partition results compared to an
original Tchebycheff method.
Authorized licensed use limited to: University Town Library of Shenzhen. Downloaded on November 08,2023 at 08:42:16 UTC from IEEE Xplore. Restrictions apply.
LI et al.: DISCRETE MOTH-FLAME OPTIMIZATION WITH AN L_2-NORM CONSTRAINT FOR NETWORK CLUSTERING 1787
V. CONCLUSION
We propose a decomposition-based multi-objective discrete
moth-flame optimization method (DMFO) to optimize net-
work clustering. In detail, what is novel in the proposed
method is that a good tradeoff has been achieved between
division, that is, NMI is equal to 1. When m is greater than exploration and exploitation when clustering networks by
0.15, it is difficult for MFO-SFG to find the real partition. redefining the two processes of MFO algorithm, which ensures
Both Q and NMI values of MFO-SFG dropped rapidly. How- the better clustering results. Additionally, the improved Tche-
ever, the performance of MFO-SFS and DMFO has not bycheff decomposition method (2-Tch) divides and conquers
changed much. When m is equal to 0.3, two methods can still the network clustering problem in our algorithm. Experimen-
find close to the real community division. When m¼0.35, the tal results based on synthetic and real-world datasets can prove
performance of MFO-SFS begins to degrade. The average Q the superiority of DMFO method than other comparison algo-
value of MFO-SFS falls below 0.35, and the average NMI rithms. Moreover, the Tchebycheff method with an l2 -norm
falls below 0.53. Conversely, DMFO performs well even constraint on direction vectors can obtain better clustering
when m is 0.45, specifically, its average NMI is equal to 1 results than the original method.
and its average Q is greater than 0.51. There still remains some work related with DMFO that
In order to further study the performance differences of deserves to be further investigated. The proposed DMFO has
three algorithms, we perform the Wilcoxon signed rank shown the effectiveness of detecting single-layer network
test [54] on the results obtained by all three algorithms. The clustering. It is interesting to extend it to community detection
results are listed in Table VIII. According to the p value, it in multi-layer complex networks, because most real-world
can be seen that DMFO shows a significant improvement over communities are multi-layered. Moreover, it is worth noting
the MFO-SFS and MFO-SFG algorithms, with a level of sig- that the computational efficiency of ultra-large-scale networks
nificance a¼0.05. While the MFO-SFS algorithm shows a sig- algorithms used for community detection is usually low.
nificant improvement over the MFO-SFG algorithm, with Therefore, in the future work, how to design an optimized var-
a¼0:05. In other words, DMFO performs the best among the iant of moth-flame to effectively reduce the computational
three algorithms. Compared with MFO-SFG, DMFO can efficiency of the algorithm without affecting the accuracy of
obtain better NMI and better modularity. This result shows clustering will be the focus of research.
that although the chromosomes generated by the crossover
and mutation strategies in Algorithm 1 can expand the new REFERENCES
search space, the population begins to converge and the struc- [1] F. Chiti and C. M. Dobson, “Protein misfolding, amyloid formation, and
ture of chromosomes begins to become similar after constant human disease: A summary of progress over the last decade,” Annu.
Rev. Biochem., vol. 86, no. 1, pp. 27–68, Jun. 2017.
calculation, which cause the chromosomes generated by cross- [2] C. Gao and J. Liu, “Network-based modeling for characterizing human
over operation are still similar. Whats more, in the late stage collective behaviors during extreme events,” IEEE Trans. Syst., Man,
of the calculation, due to the high fitness of the entire popula- Cybern., Syst., vol. 47, no. 1, pp. 171–183, Jan. 2017.
[3] E. Strano, M. P. Viana, A. Sorichetta, and A. J. Tatem, “Mapping road
tion, the chromosomes generated by the mutation are less network communities for guiding disease surveillance and control strat-
likely to survive the selection. Therefore, in general, although egies,” Sci. Rep., vol. 8, no. 1, Mar. 2018, Art. no. 4744.
the strategy of Algorithm 1 can prevent the algorithm from [4] E. Stalidzans, I. Mozga, J. Sulins, and P. Zikmanis, “Search for a mini-
mal set of parameters by assessing the total optimization potential for a
falling into the local optimum quickly, we still need to dynamic model of a biochemical network,” IEEE-ACM Trans. Comput.
improve the diversity of algorithm, so as to guide Algorithm 1 Biol. Bioinf., vol. 14, no. 4, pp. 978–985, Jul./Aug. 2017.
Authorized licensed use limited to: University Town Library of Shenzhen. Downloaded on November 08,2023 at 08:42:16 UTC from IEEE Xplore. Restrictions apply.
1788 IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, VOL. 9, NO. 3, MAY/JUNE 2022
[5] K. He, Y. Li, S. Soundarajan, and J. E. Hopcroft, “Hidden community [31] Y. Xu and H. Chen, “An efficient chaotic mutative moth-flame-inspired
detection in social networks,” Inf. Sci., vol. 425, pp. 92–106, Jan. 2018. optimizer for global optimization tasks,” Expert Syst. Appl., vol. 192,
[6] L. Sun, J. G. Jin, K. W. Axhausen, D.-H. Lee, and M. Cebrian, pp. 135–155, Sep. 2019.
“Quantifying long-term evolution of intra-urban spatial interactions,” J. [32] Z. T. Li, J. Liu, and K. Wu, “A multiobjective evolutionary algorithm
Roy. Soc. Interface, vol. 12, no. 102, Jan. 2015, Art. no. 20141089. based on structural and attribute similarities for community detection in
[7] F. Radicchi, C. Castellano, F. Cecconi, V. Loreto, and D. Parisi, attributed networks,” IEEE Trans. Cybern., vol. 48, no. 7, pp. 1963–1976,
“Defining and identifying communities in networks,” Proc. Nat. Acad. Jul. 2018.
Sci. USA, vol. 101, no. 9, pp. 2658–2663, Feb. 2004. [33] C. Gao, Z. Chen, X. Li, and Z. Wang, “Multiobjective discrete particle
[8] M. E. J. Newman, “Fast algorithm for detecting community structure in swarm optimization for community detection in dynamic networks,”
networks,” Phys. Rev. E, vol. 69, no. 6, Jun. 2004, Art. no. 066133. Europhysics Lett., vol. 122, no. 2, Apr. 2018, Art. no. 28001.
[9] X. Zhang, K. Zhou, H. Pan, L. Zhang, X. Zeng, and Y. Jin, “A network [34] C. Gao, M. X. Liang, and X. H. Li, “Network community detection
reduction-based multiobjective evolutionary algorithm for community based on the Physarum-inspired computational framework,” IEEE-ACM
detection in large-scale complex networks,” IEEE Trans. Cybern., Trans. Comput. Biol. Bioinform., vol. 15, no. 6, pp. 1916–1928, Nov.
vol. 50, no. 2, pp. 703–716, Feb. 2020. 2018.
[10] M. Girvan and M. E. J. Newman, “Community structure in social and [35] C. Pizzuti and A. Socievole, “Multiobjective optimization and local
biological networks,” Proc. Nat. Acad. Sci. USA, vol. 99, no. 12, merge for clustering attributed graphs,” IEEE Trans. Cybern., vol. 50,
pp. 7821–7826, Jun. 2002. no. 12, pp. 4997–5009, Dec. 2020.
[11] B. K. Patra, S. Nandi, and P. Viswanath, “A distance based clustering [36] Z. Li and J. Liu, “A multi-agent genetic algorithm for community detec-
method for arbitrary shaped clusters in large datasets,” Pattern Recog- tion in complex networks,” Phys A. Statist. Mech. Appl., vol. 499, no. 2,
nit., vol. 44, no. 12, pp. 2862–2870, Dec. 2011. pp. 336–347, 2016.
[12] M. Guendouz, A. Amine, and R. M. Hamou, “A discrete modified fire- [37] M. Gong, L. Ma, Q. Zhang, and L. Jiao, “Community detection in net-
works algorithm for community detection in complex networks,” Appl. works by using multiobjective evolutionary algorithm with decom-
Intell., vol. 46, no. 2, pp. 373–385, Mar. 2017. position,” Phys. A, vol. 391, no. 15, pp. 4050–4060, Aug. 2012.
[13] C. Pizzuti, “GA-Net: A genetic algorithm for community detection in [38] R. Shang, S. Luo, W. Zhang, R. Stolkin, and L. Jiao, “A multiobjective
social networks,” in Proc. 10th Int. Conf. Parallel Problem Solving evolutionary algorithm to find community structures based on affinity
Nature, 2008, pp. 1081–1090. propagation?,” Phys. A, vol. 453, pp. 203–227, Jul. 2016.
[14] H. H. Chang, Z. R. Feng, and Z. G. Ren, “Community detection using [39] F. Zou, D. Chen, S. Li, R. Lu, and M. Lin, “Community detection in
dual-representation chemical reaction optimization,” IEEE Trans. complex networks: Multi-objective discrete backtracking search optimi-
Cybern., vol. 47, no. 12, pp. 4328–4341, Dec. 2017. zation algorithm with decomposition,” Appl. Soft. Comput., vol. 53,
[15] M. Gong, Q. Cai, X. Chen, and L. Ma, “Complex network clustering by pp. 285–295, Apr. 2017.
multiobjective discrete particle swarm optimization based on decom- [40] M. A. E. Aziz, A. A. Ewees, and A. E. Hassanien, “Whale optimization
position,” IEEE Trans. Evol. Comput., vol. 18, no. 1, pp. 82–97, algorithm and moth-flame optimization for multilevel thresholding
Feb. 2014. image segmentation,” Expert Syst. Appl., vol. 83, pp. 242–256, 2017.
[16] L. Li, L. Jiao, J. Zhao, R. Shang, and M. Gong, “Quantum-behaved dis- [41] C. Li, S. Li, and Y. Liu, “A least squares support vector machine model
crete multi-objective particle swarm optimization for complex network optimized by moth-flame optimization algorithm for annual power load
clustering,” Pattern Recognit., vol. 63, pp. 1–14, Mar. 2017. forecasting,” Appl. Intell., vol. 45, no. 4, pp. 1166–1178, 2016.
[17] S. Rahimi, A. Abdollahpouri, and P. Moradi, “A multi-objective particle [42] M. Wang et al., “Toward an optimal kernel extreme learning machine
swarm optimization algorithm for community detection in complex using a chaotic moth-flame optimization strategy with applications in
networks,” Swarm Evol. Comput., vol. 39, pp. 297–309, Apr. 2018. medical diagnoses,” Neurocomputing, vol. 267, no. 6, pp. 69–84, 2017.
[18] P. Ji, S. Zhang, and Z. Zhou, “A decomposition-based ant colony opti- [43] J. Handl and J. Knowles, “An evolutionary approach to multi-objective
mization algorithm for the multi-objective community detection,” J. clustering,” IEEE Trans. Evol. Comput., vol. 11, no. 1, pp. 56–76,
Ambient Intell. Humanized Comput., vol. 11, no. 1, pp. 173–188, Feb. 2007.
Jan. 2020. [44] M. Gong, Q. Cai, Y. Li, and J. Ma, “An improved memetic algorithm for
[19] F. Zou, D. Chen, D. Huang, R. Lu, and X. Wang, “Inverse modelling- community detection in complex networks,” in Proc. IEEE Congr. Evol.
based multi-objective evolutionary algorithm with decomposition for Comput., 2012, pp. 1–8.
community detection in complex networks,” Phys. A, vol. 513, pp. 662– [45] J. Kennedy and R. C. Eberhart, “A discrete binary version of the particle
674, Jan. 2019. swarm algorithm,” in Proc. IEEE Int. Conf. Syst., Man, Cybern.1997,
[20] C. Pizzuti, “A multiobjective genetic algorithm to find communities in pp. 4104–4108.
complex networks,” IEEE Trans. Evol. Comput., vol. 16, no. 3, pp. 418– [46] Q. Zhang and H. Li, “MOEA/D: A multi-objective evolutionary algo-
430, Jun. 2012. rithm based on decomposition,” IEEE Trans. Evol. Comput., vol. 11,
[21] X. Liu, F. Zhang, X. Li, C. Gao, and J. Liu, “Multi-objective discrete no. 6, pp. 712–731, Dec. 2007.
moth-flame optimization for complex network clustering,” in Proc. 25th [47] X. Ma, Q. Zhang, G. Tian, J. Yang, and Z. Zhu, “On Tchebycheff
Int. Symp. Methodologies Intell. Syst., 2020, pp. 372–382. decomposition approaches for multi-objective evolutionary opti-
[22] G. Wang and S. Deb, “Monarch butterfly optimization,” Neural Comput. mization,” IEEE Trans. Evol. Comput., vol. 22, no. 2, pp. 226–244,
Appl., vol. 31, pp. 1995–2014, 2019. Apr. 2018.
[23] A. Heidari and S. Mirjalili, “Harris hawks optimization: Algorithm and [48] J. Duch and A. Arenas, “Community detection in complex networks
applications,” Future Gener. Comput. Syst., vol. 97, pp. 849–872, 2019. using extremal optimization,” Phys. Rev. E, vol. 72, no. 2, Aug. 2005,
[24] S. Li and H. Chen, “Slime mould algorithm: A new method for stochastic Art. no. 027104.
optimization,” Future Gener. Comput. Syst., vol. 111, pp. 300–323, 2020. [49] D. Lusseau, K. Schneider, O. J. Boisseau, P. Haase, E. Slooten, and
[25] Y. Yang and H. Chen, “Hunger games search: Visions, conception, S. M. Dawson, “The bottlenose dolphin community of doubtful sound
implementation, deep analysis, perspectives, and towards performance features a large proportion of long-lasting associations,” Behav. Ecol.
shifts,” Expert Syst. with Appl., vol. 177, 2021, Art. no. 114864. Sociobiol., vol. 54, no. 4, pp. 396–405, Sep. 2003.
[26] G. Wang, “Moth search algorithm: A bio-inspired metaheuristic algo- [50] M. Girvan and M. E. J. Newman, “Community structure in social and bio-
rithm for global optimization problems,” Memetic Comput., vol. 10, logical networks,” in Proc. Nat. Acad. Sci. USA, 2002, pp. 7821–7826.
no. 2, pp. 151–164, Jun. 2018. [51] L. N. F. Ana and A. K. Jain, “Robust data clustering,” in Proc. IEEE
[27] S. Mirjalili, “Moth-flame optimization algorithm: A novel nature- Comput. Soc. Conf. Comput. Vis. Pattern Recognit., 2003, pp. 128–133.
inspired heuristic paradigm,” Knowl.-Based Syst., vol. 89, pp. 228–249, [52] R. Guimer, L. Danon, A. Daz-Guilera, F. Giralt, and A. Arenas, “Self-
Nov. 2015. similar community structure in a network of human interactions,” Phys.
[28] C. Li, Z. Niu, Z. Song, B. Li, J. Fan, and P. X. Liu, “A double evolution- Rev. E, vol. 68, no. 6, Dec. 2003.
ary learning moth-flame optimization for real-parameter global optimi- [53] L. Danon, A. Dıaz-Guilera, J. Duch, and A. Arenas, “Comparing com-
zation problems,” IEEE Access, vol. 6, pp. 76700–76727, 2018. munity structure identification,” J. Statist. Mech., vol. 2005, no. 09,
[29] Y. Xu and H. Chen, “Enhanced moth-flame optimizer with mutation strat- pp. P09008–P09008, Sep. 2005.
egy for global optimization,” Inf. Sci., vol. 492, pp. 181–203, Aug. 2019. [54] J. Derrac, S. Garca, D. Molina, and F. Herrera, “A practical tutorial on
[30] W. Shan and Z. Qian, “Double adaptive weights for stabilization of the use of nonparametric statistical tests as a methodology for comparing
moth flame optimizer: Balance analysis, engineering cases, and medical evolutionary and swarm intelligence algorithms,” Swarm Evol. Comput.,
diagnosis,” Knowl.-Based Syst., vol. 214, Feb. 2021, Art. no. 106728. vol. 1, no. 1, pp. 3–18, 2011.
Authorized licensed use limited to: University Town Library of Shenzhen. Downloaded on November 08,2023 at 08:42:16 UTC from IEEE Xplore. Restrictions apply.