Audet 2020 Metrics
Audet 2020 Metrics
Invited Review
a r t i c l e i n f o a b s t r a c t
Article history: In recent years, the development of new algorithms for multiobjective optimization has considerably
Received 3 December 2018 grown. A large number of performance indicators has been introduced to measure the quality of Pareto
Accepted 11 November 2020
front approximations produced by these algorithms. In this work, we propose a review of a total
Available online 18 November 2020
of 63 performance indicators partitioned into four groups according to their properties: cardinality,
Keywords: convergence, distribution and spread. Applications of these indicators are presented as well.
Multiobjective optimization © 2020 Elsevier B.V. All rights reserved.
Quality indicators
Performance indicators
https://doi.org/10.1016/j.ejor.2020.11.016
0377-2217/© 2020 Elsevier B.V. All rights reserved.
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
In Jiang, Ong, Zhang, and Feng (2014), an empirical study fo- • y1 < y2 (y1 strictly dominates y2 ) if and only if y1i < y2i for all
cuses on the correlations between different indicators with their i = 1, 2, . . . , m.
computational complexity on concave and convex Pareto fronts.
Finally, the usage of indicators proposed by the multiobjective In the case when neither y1 y2 nor y2 y1 , y1 and y2 are said
evolutionary optimization community prior to 2013 is analyzed to be incomparable.
in Riquelme, Lücken, and Baran (2015). We can now present the concept of dominance relations for the
Another survey Li and Yao (2019) on the quality evaluation of decision vectors.
solution sets was recently published. It complements the present
study by categorizing more performance indicators, but presents Definition 2 (Dominance relations for decision vectors Audet,
them at a higher level of description. Most of the indicators in Savard, & Zghal, 2008). Given two decision vectors x1 and x2 in
the present work are listed in Li and Yao (2019), but not all of the feasible set X ⊆ Rn , we write:
them (e.g., the ones proposed in Custódio, Madeira, Vaz, & Vicente, • x1 x2 (x1 weakly dominates x2 ) if and only if F (x1 ) F (x2 ).
2011). While Li and Yao (2019) is mainly oriented toward the evo- • x1 ≺ x2 (x1 dominates x2 ) if and only if F (x1 ) ≤ F (x2 ).
lutionary algorithms community, the present work addresses the • x1 ≺≺ x2 (x1 strictly dominates x2 ) if and only if F (x1 ) < F (x2 ).
whole operational research community, and also addresses some • x1 x2 (x1 and x2 are incomparable) if neither x1 weakly dom-
issues in more detail, such as complexity computational costs, as
inates x2 nor x2 weakly dominates x1 .
well as data and performance profiles. The reader is invited to con-
sult their survey as a useful complement. With these relations, we now precise the concept of solution in
The present work is an attempt to propose a survey offering a the multiobjective optimization framework.
panorama on all important aspects of performance indicators con-
trary to the previous surveys, addressed to the whole multiobjec- Definition 3 (Pareto optimality and Pareto solutions Ehrgott,
tive optimization community. This work systematically analyzes 63 2005). The vector x ∈ X is a Pareto-optimal solution if there is no
performance indicators by partitioning them into these four cat- other vector in X that dominates it. The set of Pareto-optimal so-
egories: Cardinality, Convergence, Distribution and spread, Conver- lutions is called the Pareto set, denoted XP , and the image of the
gence and distribution. The use of performance metrics targets four Pareto set is called the Pareto front, denoted YP .
cases: comparison of algorithms, embedding of performance indi- In single-objective optimization, the set of optimal solutions
cators into multiobjective methods, suggestion of stopping crite- is often composed of a singleton. In the multiobjective case, the
ria for multiobjective optimization and identification of promising Pareto front usually contains many elements (an infinity in con-
distribution-based performance indicators. Table 1 lists these indi- tinuous optimization and an exponential number in discrete op-
cators and their category, classifies them based on their properties, timization Ehrgott, 2005). For a problem with m objectives, the
and indicates the section in which they are discussed. This work is Pareto front YP is at most of dimension m − 1. For three objectives,
organized as follows. Section 2 introduces the notations and defi- the Pareto front is a surface, a curve, a point, or combinations of
nitions related to multiobjective optimization and performance in- surfaces, curves and/or points, or the empty set. For two objec-
dicators. Section 3 is the core of this work, and is devoted to clas- tives, the Pareto front can be a curve or a point or a combination
sification of the indicators according to their specific properties. Fi- of curves and/or points, or the empty set. Also, it is interesting to
nally, Section 4 presents some applications. define some bounds on this set.
2. Notations and definitions Definition 4 (Ideal and nadir objective vectors). The ideal objec-
tive vector yI Ehrgott (2005) is defined as the objective vector
To apprehend performance indicators, the first part of this sec- whose components are the solutions of each single-objective prob-
tion describes the main concepts related to multiobjective opti- lem min fi (x ), i = 1, 2, . . . , m. The nadir objective vector yN is de-
x∈X
mization. The second part focuses on the theory of Pareto set ap- fined as the objective vector whose components are the solutions
proximations and performance indicators. of the single-objective problems max fi (x ), i = 1, 2, . . . , m.
x∈XP
2.1. Multiobjective optimization and Pareto dominance For computational reasons, the nadir objective vector is often
approximated by yN for which the coordinates are defined the fol-
We consider the following continuous multiobjective optimiza- lowing way: let xk, be the solution of the single-objective problem
tion problem: min fk (x ) for k = 1, 2, . . . , m. The ith coordinate of
yN is given by:
x∈X
min F (x ) = [ f1 (x ) f2 (x ) . . . fm (x )]
yNi = max fi (xk, ).
x∈X
k=1,2,...,m
where X ⊆ Rn = ∅ is called the feasible set, and fi : Rn → R are m
For a biobjective optimization problem, yN equals yN . It is not al-
objective functions for i = 1, 2, . . . , m, with m ≥ 2. The image of the
ways the case when m ≥ 3.
feasible set Y = {F (x ) ∈ Rm : x ∈ X } is called the (feasible) objective
An illustration is given in Fig. 1 where the Pareto front is piece-
set. The sets Rn and Rm are respectively denoted as the decision
wise continuous. To simplify the notation, continuous Pareto and
space and the objective space.
piecewise continuous Pareto fronts will be respectively designed
To compare functions objective values, the following cone order
as continuous and discontinuous Pareto fronts.
relation is adopted (Ehrgott, 2005).
Remark. In a multiobjective optimization problem, objectives are
Definition 1 (Dominance relations between objective vec-
not necessarily contradictory, and the set of Pareto solutions may
tors Dächert, Klamroth, Lacour, & Vanderpooten, 2017). Given
be a singleton. In this study, we assume that this is not the case.
two objective vectors y1 and y2 in the objective space Rm , we
write:
2.2. Approximation sets and performance indicators
• y1 y2 (y1 weakly dominates y2 ) if and only if y1i ≤ y2i for all
i = 1, 2, . . . , m. Generally, whether in the context of continuous or discrete op-
• y1 ≤ y2 (y1 dominates y2 ) if and only if y1 y2 and y1 = y2 . timization, it is not possible to find or enumerate all elements
398
C. Audet, J. Bigeon, D. Cartier et al.
Table 1
A summary of performance indicators. The 9 rightmost columns indicate references where the indicators are presented.
Category Performance indicators Sect. Cheng et al. Collette Knowles Zitzler et al. Okabe et al. Faulkenberg Jiang et al. Riquelme Li and Yao
(2012) and Siarry and Corne (2003) (2003) and Wiecek (2014) et al. (2015) (2019)
(2011) (2002) (2009)
√ √ √ √ √ √ √
Cardinality 3.1 C-metric/Two sets Coverage Zitzler and Thiele (1998) 3.1.5
√ √ √ √ √ √
Error ratio Van Veldhuizen (1999) 3.1.4
√ √ √ √
Generational non dominated vector 3.1.3
generation Van Veldhuizen and Lamont (2000)
√ √ √
Generational non dominated vector generation 3.1.3
ratio Van Veldhuizen and Lamont (2000)
√
Mutual domination rate Martí et al. (2016) 3.1.6
√ √ √
Nondominated vector additional Van Veldhuizen and 3.1.3
Lamont (2000)
√ √ √ √ √ √ √ √
Overall nondominated vector generation Van Veldhuizen 3.1.1
(1999)
√ √ √ √ √ √ √
Overall nondominated vector generation 3.1.2
ratio Van Veldhuizen (1999)
√ √ √
Ratio of non-dominated points by the reference 3.1.5
set Hansen and Jaszkiewicz (1998)
√ √ √
399
Category Performance indicators Sect. Cheng et al. Collette Knowles Zitzler et al. Okabe et al. Faulkenberg Jiang et al. Riquelme Li and Yao
(2012) and Siarry and Corne (2003) (2003) and Wiecek (2014) et al. (2015) (2019)
(2011) (2002) (2009)
√ √
Evenness Messac and Mattson (2004) 3.3.7
√ √
Extension Meng et al. (2005) 3.3.14
√
-metric Custódio et al. (2011) 3.3.3
√ √ √ √
Hole Relative Size Collette and Siarry (2005) 3.3.4
√
Laumanns metric Laumanns et al. (2000) 3.3.17
√
Modified Diversity indicator Asafuddoula et al. (2015) 3.3.18
√ √ √ √ √ √
M2 -metric Zitzler et al. (2000) 3.3.5
√ √ √ √ √ √ √ √
M3 -metric Zitzler et al. (2000) 3.3.5
√ √ √ √ √
Number of distinct choices Wu and Azarm (2000) 3.3.18
√ √
Outer diameter Zitzler et al. (2008) 3.3.11
√ √ √ √ √ √
Overall Pareto Spread Wu and Azarm (2000) 3.3.10
Riesz S-Energy Hardin and Saff (2004) 3.3.16
√ √
Sigma diversity metric Mostaghim and Teich (2005) 3.3.18
√ √ √ √ √ √ √ √
Spacing Schott (1995) 3.3.1
√ √
U-measure Leung and Wang (2003) 3.3.9
√
Uniform assessment metric Li et al. (2008) 3.3.13
√ √ √
Uniform distribution Tan et al. (2002) 3.3.5
400
√ √
Uniformity Sayın (2000) 3.3.6
√ √
Uniformity Meng et al. (2005) 3.3.8
√ √
Convergence and Averaged Hausdorff distance Schutze et al. (2012) 3.4.2
distribution 3.4 Cone-based hypervolume Emmerich et al. (2013b) 3.4.7
√
Dominance move Li and Yao (2017) 3.4.6
√ √ √ √ √ √
D-metric/Difference coverage of two sets Zitzler (1999) 3.4.7
√ √ √ √ √
DR -metric Czyzzak and Jaszkiewicz (1998) 3.4.3
√ √ √ √ √
Hyperarea difference Wu and Azarm (2000) 3.4.7
√ √ √ √ √ √ √ √ √
Hypervolume indicator (or S-metric) Zitzler et al. (2000) 3.4.7
Table 2
Comparison relations between Pareto front approximations Zitzler et al. (2003). Note that YN1 ≺≺ YN2 ⇒ YN1 ≺ YN2 ⇒ YN1 YN2 ⇒ YN1 YN2 .
1 2
Relation Objective vectors y and y Pareto front approximations YN1 and YN2
Strictly dominates y1 < y2 y1 is better than y2 in all objectives YN1 ≺≺ YN2 Every y2 ∈ YN2 is strictly dominated
by at least one y1 ∈ YN1
Dominates y1 ≤ y2 y1 is not worse than y2 in all YN1 ≺ YN2 Every y2 ∈ YN2 is dominated by at
objectives and better in at least one least one y1 ∈ YN1
objective
Weakly dominates y1 y2 y1 is not worse than y2 in all YN1 YN2 Every y2 ∈ YN2 is weakly dominated
objectives by at least one y1 ∈ YN1
Is better YN1 YN2 Every y2 ∈ YN2 is weakly dominated
by at least one y1 ∈ YN1 and YN1 = YN2
Is incomparable Neither y1 weakly dominates y2 nor YN1 YN2 Neither YN1 weakly dominates YN2
y2 weakly dominates y1 nor YN1 weakly dominates YN2
401
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
YN1 R YN2 ⇒ CI,E (YN1 , YN2 ) for all YN ∈ , ONV G(YN ) = |YN |.
This indicator is to be maximized. Nonetheless, this is not a perti-
or for any YN1 , YN2 Pareto front approximations, we have:
nent measure. For example, consider a Pareto front approximation
YN1 composed of one million non-dominated points and a Pareto
YN2 R YN1 ⇒ CI,E (YN1 , YN2 ). front approximation YN2 with only one point, such as this point
dominates all the other points of YN1 . YN1 outperforms YN2 for this
For any Pareto front approximations YN1 , YN2 ∈ , there are no indicator but YN2 is clearly better than YN1 (Knowles & Corne, 2002).
combination I of unary performance indicators such that YN1
YN2 ⇔ CI,E (YN1 , YN2 ) (Zitzler et al., 2003). 3.1.2. Overall Non-dominated vector generation ratio (ONVGR)
The mathematical properties of the performance indicators (Van Veldhuizen, 1999)
mentioned in this survey are summarized in Tables 3, 4 and 5 in ONV GR represents the ratio of a number of elements in the
the appendices. Pareto front approximation with respect to a Pareto optimal solu-
tion set. Formally,
Remark. Throughout the rest of the paper, the following nota-
|YN |
tions will be used. A discrete representation of the Pareto front is ONV GR(YN ; YP ) =
denoted by YP ⊆ YP , called the Pareto optimal solution set (Okabe |YP |
et al., 2003). where |YP | is the cardinality of a Pareto optimal solution set and
The Pareto front approximation at iteration k will be denoted by |YN | the number of points of the Pareto front approximation. A
YN (k ). In many cases, the Pareto front is unknown. The user needs higher value is considered to be better. Note that this indicator is
to specify a set of objective vectors in the objective space, called just ONV G (3.1.1) divided by a scalar. Consequently, it suffers from
a reference set and denoted by YR ⊆ Rm . Note that a Pareto front the same drawbacks as the previous indicator.
(approximated or not) contains only feasible objective vectors, i.e.
each element of a Pareto front approximation belongs to Y. It im- 3.1.3. Generational indicators (GNVG, GNVGR and NVA)
plies that if an algorithm does not find any feasible points then (Van Veldhuizen, 1999)
YN (k ) is empty. For the following definitions to apply, we impose GNV G(YN ; k ) (generational non-dominated vector generation)
that the iteration counter k is set to 0 at the iteration where a first returns the number of non-dominated points |YN (k )| generated at
feasible point has been found. iteration k for a given iterative algorithm. GNV GR(YN ; YP , k ) (gen-
erational non-dominated vector generation ratio) is the ratio of
402
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
non-dominated points |YN (k )| generated at iteration k over the car- it gives some unreliable results. Moreover, it does not give an indi-
dinality of YP where YP is a set of points from the Pareto front. cation of “how much” a Pareto front approximation strictly domi-
NVA(YN ; k ) (non-dominated vector addition) represents the varia- nates another.
tion of non-dominated points in the objective space generated be- Similarly to the C-metric, given a reference set YR in the ob-
tween successive iterations. It is given by: jective space, the C2R metric (Ratio of non-dominated points by the
reference set) introduced in Hansen and Jaszkiewicz (1998) is given
NVA(YN ; k ) = |YN (k )| − |YN (k − 1 )| for k > 0.
by:
These indicators can be used to follow the evolution of the gen-
|{y ∈ YN : there does not exist r ∈ YR such that r ≤ y}|
eration of non-dominated points along iterations of a given algo- C2R (YN ; YR ) = .
|YN |
rithm. It seems difficult to use them as a stopping criterion as the
number of non-dominated points can evolve drastically between A higher C2R value is considered to be better. This indicator has the
two iterations. same drawbacks as the C-metric.
3.1.4. Error ratio (ER) (Van Veldhuizen, 1999) 3.1.6. Mutual domination rate (MDR) (Martí, García, Berlanga, &
This indicator considers the number of non-dominated objec- Molina, 2016)
tive vectors which belong to the Pareto front. It is given by the The authors of Martí et al. (2016) use this performance indica-
following formula: tor in combination with a Kalman filter to monitor the progress
of evolutionary algorithms along iterations and thus providing
1
E (YN ) = 1Rm \YP (y ) a stopping criterion. At a given iteration k, MDR captures how
|YN | y∈YN many non-dominated points at iteration k − 1 are dominated by
non-dominated points generated at iteration k and reciprocally.
where for all y ∈ Rm ,
Given two Pareto front approximations YN1 and YN2 , the function
0 if y belongs to the Pareto front. (YN1 , YN2 ) returns the set of elements of YN1 that are dominated
1Rm \YP (y ) =
1 otherwise. by at least one element of YN2 . Formally,
The lower the indicator value, the better it is considered. |(YN (k − 1 ), YN (k ) )| |(YN (k ), YN (k − 1 ) )|
MDR(YN ; k ) = −
In Van Veldhuizen (1999), the author does not mention the |YN (k − 1 )| |YN (k )|
presence of rounding errors in their indicator. A suggestion should
where YN (k ) is the Pareto front approximation generated at it-
be to consider an external accuracy parameter , quantifying the
eration k. If MDR(YN ; k ) = 1, the set of non-dominated points
belonging of an element of the Pareto front approximation to the
at iteration k totally dominates its predecessor at iteration k −
Pareto front with near to correct rounding errors.
1. If MDR(YN ; k ) = 0, no significant progress has been observed.
This indicator requires the analytical expression of the Pareto
MDR(YN ; k ) = −1 is the worst case, as it results in a total loss of
front. Consequently, an user can only use it on analytical bench-
domination at the current iteration.
mark tests. Moreover, this indicator depends mostly on the cardi-
Cardinality indicators have a main drawback. They fail to quan-
nality of the Pareto front approximation, which can misguide inter-
tify how well-distributed the Pareto front approximation is, or to
pretations. Knowles and Corne (2002) illustrate this drawback with
quantify how it converges during the course of an algorithm.
the following example. Consider two Pareto front approximations.
The first one has 100 elements, one in the Pareto front and the
3.2. Convergence indicators
others close to it. Its error ratio is equal to 0.99. The second one
has only two elements, one in the Pareto front, the other far from
Most of these measures require the knowledge of the Pareto
it. Its ratio is equal to 0.5. It is obvious that the first Pareto front
Front to be evaluated. They capture the degree of proximity be-
approximation is better, even if its error ratio is bad. However, it is
tween a Pareto front and its approximation.
straightforward to compute.
Similarly to the error ratio measure, Hansen and Jaszkiewicz
3.2.1. Generational distance (GD) (Van Veldhuizen, 1999)
(1998) defines the C1R metric (called also ratio of the reference
The GD indicator captures the average distance between each
points). Given a reference set YR (chosen by the user) in the objec-
element of a Pareto front approximation and its closest neighbor
tive space, it is the ratio of the number of objectives vectors found
in a discrete representation of the Pareto front. This indicator is
in YR over the cardinality of the reference set YR . A higher value is
given by the following formula:
desirable.
1
p
1
3.1.5. C-metric or coverage of two sets (C) (Zitzler, 1999) GD(YN ; YP ) = min y − y1 2 p
Let YN1 and YN2 be two Pareto front approximations. The C-metric |YN | y1 ∈YN
y2 ∈YP
403
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
Siarry (2011), it could be used as a stopping criteria. A slight varia- 3.2.5. Progress metric (Pg ) (Van Veldhuizen, 1999)
tion of the generational distance GD(YN (k ), YN (k + 1 ) ) between two This indicator introduced in Bäck (1996) measures the progres-
successive iterations, as long as the algorithm is running, could sion of the Pareto front approximation given by an algorithm to-
mean a convergence towards the Pareto front. It can be applied wards the Pareto front in function of the number of iterations for
on continuous and discontinuous Pareto front approximations. a given objective function i. It is defined by:
min yi
y∈YN (0)
3.2.2. Standard deviation from the generational distance (ST DGD) Pg (YN ; i, k ) = ln .
(Van Veldhuizen, 1999) min yi
y∈YN (k )
It measures the deformation of the Pareto front approxima-
tion according to a discrete representation of the Pareto front. It In Van Veldhuizen (1999), the author modifies this metric to take
is given by the following formula: into account whole Pareto front approximations:
1
2
GD(YN (0 ); YP )
ST DGD(YN ; YP ) = min y1 − y2 − GD(YN ; YP ) . RPg (YN ; YP , k ) = ln
|YN | 1 y2 ∈YP GD(YN (k ); YP )
y ∈YN
where GD(YN (k ); YP ) is the generational distance (3.2.1) of the
The same critics as with the generational distance apply. Pareto front approximation YN at iteration k.
Pg is not always defined, for example when values of
miny∈YN (0 ) yi or miny∈YN (k ) yi are negative or null. As GD is always
3.2.3. Seven points average distance (SPAD) (Schott, 1995)
positive, RPg is well defined, but it requires the knowledge of the
As it is not practical to obtain the Pareto front, an alternative is
Pareto front.
to use a reference set YR in the objective space. The SPAD indicator
Pg , when it exists, provides an estimation of the speed of con-
defined for biobjective optimization problems uses a reference set
vergence of the associated algorithm. RPg captures only the vari-
composed of seven points:
ations of the generational distance along the number of itera-
h l tions. The drawbacks of the generational distance do not apply in
YR = max f1 (x ), max f2 (x ) . this case. Finally, a bad measure of progression does not necessar-
3 x∈X 3 x∈X
0≤h,l≤3
ily mean that an algorithm performs poorly. Some methods less
SPAD captures the average distance of the elements of the refer- deeply explore the objective space, but reach the Pareto front after
ence set YR to their closest neighbor in the Pareto front approxi- a more important number of iterations.
mation. Formally,
3.2.6. -indicator (I ) (Zitzler et al., 2003)
1 An objective vector y1 ∈ Rm is -dominating, for > 0, an ob-
SPAD(YN ; YR ) = min y − r .
|YR | y∈YN
r∈YR jective vector y2 ∈ Rm if:
404
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
Pareto front approximation, which can misguide quality compar- where d1 y j , YN \ {y j } = miny∈Y \{y j } y − y j 1 is the l1 distance
N
ison between different Pareto front approximations. Furthermore,
of y j ∈ YN to the set YN \ {y j } and d¯ is the mean of all d1 (y j , YN \
the -indicator focuses only on one objective when comparing dif-
{y j } ) for j = 1, 2, . . . , |YN |.
ferent objective vectors, as noticed in Li and Yao (2017). For ex-
This method cannot account for holes in the Pareto front ap-
ample, consider y1 = (0, 1, 1 ) and y2 = (1, 0, 0 ) in a tri-objective
proximation as it takes into account the distance between an ob-
maximization problem. Although y1 performs better than y2 in two
jective vector and its closest neighbor. The major issue with this
different objectives, the additive -indicators are identical:
indicator is it gives some limited information when points given
I+ {y1 }, {y2 } = I+ {y2 }, {y1 } . by the algorithm are clearly separated, but spread into multiple
groups. On the contrary, it is straightforward to compute.
On the contrary, it is straightforward to compute. It can be used
for continuous and discontinuous approximations of Pareto fronts.
3.2.7. Degree of approximation (DOA) (Dilettoso, Rizzo, & Salerno, 3.3.2. Delta indexes ( , and ) (Deb et al., 2000; Zhou, Jin,
2017) Zhang, Sendhoff, & Tsang, 2006)
By taking into account the dominance relation in the objective Deb et al. (20 0 0) introduce the index for biobjective prob-
space, DOA captures an average degree of proximity from a discrete lems, which captures the variation of distance between consecu-
representation of the Pareto front to a Pareto front approximation. tive elements of the Pareto front approximation into the biobjec-
A lower value is considered to be better. This indicator is proved tive space. Formally,
to be ≺-complete (see Definition 9). It aims to compare algorithms
when the Pareto fronts are known.
|Y
N |−1 dc y j , YN \ {y j } − d¯c
(Y )=
Given y2 an objective vector belonging to YP , the set D (y2 , YN ) N
|YN | − 1
j=1
in the objective space is defined as the subset of points belonging
to the Pareto front approximation YN dominated by the objective
where dc y j , YN \ {y j } is the Euclidean distance between consec-
vector y2 . The minimum Euclidean distance between y2 ∈ YP and
D (y2 , YN ) (which may be empty) is computed with utive elements of the Pareto front approximation YN , and d¯c the
mean of the dc y j , YN \ {y j } for j = 1, 2, . . . , |YN | − 1. As this indi-
min 2
y −y 1
if |D (y , YN )| > 0
2
cator considers Euclidean distances between consecutive objective
d (y2 , YN ) = y1 ∈D (y2 ,YN )
∞ if |D (y2 , YN )| = 0. vectors, it can be misleading if the Pareto front approximation is
piecewise continuous. The index does not generalize to more
Similarly, d + y2 , YN \ D (y2 , YN ) is defined for y2 ∈ YP by consider- than 2 objectives, as it uses lexicographic order
in the biobjective
ing the set of points of YN that do not belong to D (y2 , YN ) as: objective space to compute the dc y j , YN \ {y j } . In addition, it does
not consider the extent of the Pareto front approximation, i.e. dis-
d+ y2 , YN \ D (y2 , YN )
tances between extreme points of the Pareto front.
min ( y1 − y2 )+ if |YN \ D (y2 , YN )| > 0 The index is an indicator derived from the index to take
= y1 ∈YN \D (y2 ,YN ) into account the extent of the Pareto front approximation for biob-
∞ if |YN \ D (y2 , YN )| = 0 jective problems:
where (y1 − y2 )+ = (max(0, y1i − y2i ))i=1,2,...,m . 2 |YN |−1
The DOA indicator is finally given by min yi, − y + dc y j , YN \ {y j } − d¯c
i=1 y∈YN j=1
1 (YN ; YP ) = 2
DOA(YN ; YP ) = min d (y2 , YN ), d+ y2 , YN \ D (y2 , YN ) . min yk, − y + (|YN | − 1 )d¯c
|YP | 2 y ∈YP
k=1 y∈YN
The value of DOA does not depend on the number of points of where miny∈YN yi, − y for i = 1, 2 are the Euclidean distances be-
YP , i.e. if |YP | |YN | (Dilettoso et al., 2017). In fact, this indica- tween the extreme solutions of the Pareto front (i.e. yi, = F (xi, ) ∈
tor partitions YN into subsets in which each element is domi- YP where xi, is solution to the ith single-objective problem) and
nated by a point y ∈ YP . Its computational cost is quite low (in the boundary solutions of the Pareto front approximation. The
O (m |YN | × |YP | )). It can be used for discontinuous and continuous other notations remain the same as before. This indicator requires
approximations of Pareto fronts. the resolution of each single-objective optimization problem. This
indicator is extended to Pareto fronts with more than two objec-
3.3. Distribution and spread indicators tives by Zhou et al. (2006) to the generalized -index:
m |YN |
According to Custódio et al. (2011), “the spread metrics try to
d2 (yi, , YN ) + d2 (y j , YN \ {y j } ) − d¯2
measure the extents of the spread achieved in a computed Pareto front i=1 j=1
(YN ; YP ) = m
approximation”. They are not really useful to evaluate the conver- d2 (yi, , YN ) + |YN | d¯2
gence of an algorithm, or at comparing algorithms, but rather i=1
405
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
Fig. 2. Illustration of the metric for a biobjective problem (inspired by Custódio et al., 2011).
3.3.3. Two measures proposed by Custódio et al. (2011) ( and ) 3.3.4. Hole relative size (HRS) (Collette & Siarry, 2005)
Given a Pareto set approximation XN = {x1 , x2 , . . . , xN } to which This indicator identifies the largest hole in a Pareto front ap-
two additional “extreme” points indexed by 0 and N + 1 are proximation for a biobjective problem. It is given by
added, for each objective i for i = 1, 2, . . . , m, elements x j for j =
0, 1, . . . , N + 1 of the Pareto set approximation are sorted so that,
HRS(YN ) = (1/d¯) max dj
j=1,2,...,|YN |−1
fi (x0 ) ≤ fi (x1 ) ≤ fi (x2 ) ≤ . . . ≤ fi (xN+1 ). where YN is a Pareto front approximation whose elements have
been sorted in ascendant order according to the first objective,
Custódio et al. (2011) introduce the following metric > 0 defined d j = y j − y j+1 2 is the l2 distance between the two adjacent ob-
by: jective vectors y j ∈ YN and y j+1 ∈ YN and d¯ the mean of all d j for
j = 1, 2, . . . , |YN | − 1.
(YN ) = max max δi, j A lower indicator value is desirable. As it takes into account
i∈{1,2,...,m} j∈{0,1,...,N}
holes in the objective space, this indicator is more adapted to con-
tinuous Pareto front approximations.
where δi, j = fi (x j+1 ) − fi (x j ) and YN = F (XN ). When considering a
biobjective problem (m = 2), the metric reduces to consider the
maximum distance in the infinity norm between two consecutive 3.3.5. Zitzler’s metrics M2 and M3 (Zitzler et al., 2000)
points in the Pareto front approximation as it is shown in Fig. 2. The M2 metric returns a value in the interval [0; |YN |] where YN
To take into account the extent of the Pareto front approxima- is the Pareto front approximation. It reflects the number of sub-
tion, the authors of Custódio et al. (2011) define the following in- sets of the Pareto front approximation YN of a certain radius (σ ). A
dicator by higher value is considered to be better. Its expression is given by
N−1 1
δi,0 + δi,N + δi, j − δi M2 (YN ; σ ) = |{y1 ∈ YN , y2 − y1 > σ }|.
(YN ) = max
j=1 |YN | − 1 2y ∈YN
i∈{1,2,...,m} δi,0 + δi,N + (N − 1 )δi
If M2 (YN ; σ ) = |YN |, it means that for each objective vector, no
other objective vector within the distance σ can be found. It is
where δi , for i = 1, 2, . . . , m, is the mean of all distances δi, j for j =
straightforward to compute but it can be difficult to interpret.
1, 2 . . . , N − 1.
The authors of Tan, Lee, and Khor (2002) introduce the Uniform
The and indicators do not use the closest distance between
distribution indicator, based too on the search of niches of size σ ,
two points in the objective space. Consequently, they do not have
given by
the same drawbacks as the spacing metric. However, the δi, j dis-
tance captures holes in the Pareto front if this one is piecewise 1
UD(YN ; σ ) =
discontinuous. For these two indicators, a lower value is desirable. 1 + Dnc (YN , σ )
These two metrics are more adapted to continuous Pareto front ap-
proximations. where Dnc (YN , σ ) is the standard deviation of the number of niches
around all the points of the Pareto front approximation YN defined
Remark. The authors of Custódio et al. (2011) suggest two ways as
⎛ ⎞
to compute extreme points x0 and xN+1 . For benchmark tests, the
|YN |
|YN |
2
Pareto front is known and extreme points correspond to the ones 1 1
of the Pareto set. Otherwise, the and indicators use the ex- Dnc (YN , σ ) = ⎝ nc (y j , σ ) − nc (yl , σ ) ⎠
|YN | − 1 j=1
|YN | l=1
treme points of the Pareto set approximation XN .
406
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
with nc (y j , σ ) = {y ∈ YN , y − y j < σ } − 1. The UD indicator is to 2. For both sets, compute the average distances dl2 (YN1 ) and
be minimized.
dl2 (YN2 ) between neighbor points given by
Finally, the M3 metric defined by Zitzler et al. (20 0 0), considers
the extent of the front: 1
dl2 (YN ) = dl2 (y, YN \ {y} ).
m |YN | y∈Y
M3 (YN ) =
N
max max yj −y .
j∈{1,2,...,|YN |} y∈YN \{y j }
i=1 3. For each set, compute the spacing measures SP l (YN1 ) and
A higher value is considered to be better. The metric only takes M3 SP l (YN2 ) given by
into account the extremal points of the computed Pareto front ap-
2
proximation. Consequently, it is sufficient for two different algo-
rithms to have the same extremal points to be considered as equiv- 1 − ψ dl2 (y, YN \ {y} ), dl2 (YN )
SP l (YN ) =
alent according to this metric. It can be used on continuous and
y∈Y
|YN | − 1
discontinuous approximations of Pareto fronts as it only gives in- N
407
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
OS(YN ) = approximation such that |YN | > 2. The computation of this indica-
i=1
yi I − yi M tor is decomposed into several steps:
where yM is an approximation of the maximum objective vector 1. A minimum spanning tree TG covering all the elements of
(objective vector composed of the maxima of each single-objective YN based on the Euclidean distance in the objective space is
optimization problem assuming they exist) or the maximum ob- built.
jective vector and yI an approximation of the ideal objective vector 2. Each element y ∈ YN has at least one neighbor in the span-
or the ideal objective vector. ning set, i.e a vertex adjacent to y. Let NTG (y ) be the set of
This is an indicator for which the values are among the values 0 adjacent vertices to y in the spanning tree TG .
and 1. The maximum and ideal objective vectors need to be com- For each yv ∈ NTG (y ), we define a “neighborhood” (Li et al.,
puted for more precise results, resulting in 2 m single-objective 2008)
problems to solve. The indicator does not take into account the
Nyv (y ) = y2 ∈ YN , y2 − y ≤ yv − y
distribution of points along the Pareto front approximation.
which corresponds to the subset of YN contained in the
3.3.11. Outer diameter (IOD ) (Zitzler et al., 2008) closed ball of radius yv − y and centered in y. Note that
Analogously to the overall Pareto spread metric, the outer di- {y, yv } ∈ Nyv (y ). The neighborhoods that contain only two el-
ameter indicator returns the maximum distance along all objec- ements, i.e. y and yv , are not considered.
tive dimensions pondered by weights w ∈ Rm 3. For all y ∈ YN and yv ∈ NTG (y ), a distribution relation is de-
+ chosen by the user.
A higher indicator value is desirable. It is given by: fined by
⎧
⎨0 if |Nyv (y )| = 2,
IOD (YN ) = max wi max yi − min yi . y − y2
ψ (y, yv ) = otherwise.
1≤i≤m y∈YN y∈YN
⎩ y − yv
y2 ∈Nyv (y )\{y}
The weights can be used to impose an order on criteria im-
portance relatively to the modeling of a specific problem but it 4. There are 2|YN | − 2 neighborhoods. Among them, Nr corre-
is not mandatory. Although this indicator is cheap to compute, it sponds to the number of neighborhoods that only contain
only takes into account the extent of the Pareto front approxima- two elements. The uniform assessment metric is then de-
tion. By the way, it can result in an information loss of the extent fined by
of the Pareto front approximation, as it focuses only on the largest 1
distance along a single dimension. ID (YN ) = ψ (y, yv )
2|YN | − Nr − 2
y∈YN yv ∈NT (y )
G
3.3.12. Distribution metric (DM) (Zheng, Yang, Xu, & Hu, 2017) which corresponds to the mean of the distribution relation
This indicator introduced by Zheng et al. (2017) aims to cor- for neighborhoods containing more than two elements.
rect several drawbacks of the spacing measure (Schott, 1995) and
add some information about the extent of the Pareto front. As it is This indicator does not require external parameters. Due to the
mentioned, the “spacing metric does not adopt normalized distance, definition of the neighborhood, it takes into account holes in the
which may result in a bias conclusion, especially when the orders of Pareto front. Indeed, contrary to the spacing metric, it does not
magnitudes of the objectives differ considerably”. Moreover, it cannot consider only closest distances between objective vectors.
account for holes in the Pareto front, as it considers only closest
neighbors. An example pointing out the drawbacks of the spacing 3.3.14. Extension measure (EX) (Meng et al., 2005)
metric (3.3.1) is given in Fig. 3. This indicator aims to measure the extent of the Pareto front
The DM indicator is given by approximation. It is given by
m yI − yN
1 σi 1
m
DM (YN ) =
i i
EX (YN ; YP ) = d2 (yi, , YN )2
|YN | μii=1
max yi − min yi m
y∈YN y∈YN i=1
408
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
409
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
the nadir and ideal objective vectors is firstly normalized, When an element of the Pareto front approximation YN does
then divided into boxes, whose the number is decided by not belong to the set of nearest points to the Pareto optimal solu-
the user. Based on this discretization of the objective space, tion set YP , it is ignored by the IGD indicator. The authors of Tian,
the indicator is computed using the values of the density Zhang, Cheng, and Jin (2016) propose a variant of the IGD indicator,
function for each center of each box and the Shannon for- named IGD–NS, which takes into account these non-contributed el-
mula of entropy (Shannon, 2001). The higher the value, the ements in an Euclidean distance-based indicator. Let
better.
2. Cluster CLμ and Number of Distinct Choices NDCμ (Wu & YNC = y ∈ YN : ∀y ∈ YP , y − y
2 2
= min y − y 1 2
y1 ∈YN
Azarm, 20 0 0): Given two respective good (ideal objective
vector) and bad (maximum objective vector assuming it ex- be the set of non-contributed elements of YN . The IGD–NS indicator
ists) objective vectors yI and yM , the objective space (pre- is defined by
liminary normalized) is divided into hyperboxes of size μ
(∈ (0; 1]). NDCμ is defined as the number of hyperboxes IGD–NS(YN ; YP ) = min y1 − y2 + min y1 − y2 .
y1 ∈YN y1 ∈YNC
containing elements of the Pareto front approximation. For y2 ∈YP y2 ∈YP
this indicator, a higher value is desirable. CLμ is then defined A lower indicator value is desirable.
|YN |
as CLμ (YN ) = NDCμ (YN )
. A higher value of the CLUμ indicator
is the consequence of a more clustered distribution of the el- 3.4.2. Averaged Hausdorff distance ( p ) (Schutze, Esquivel, Lara, &
ements of a Pareto front approximation and so a lower value Coello, 2012)
is considered to be better. In Schutze et al. (2012), the authors combine GD (3.2.1) and
3. Sigma diversity metrics σ and σ (Mostaghim & Teich, 2005): IGD (3.4.1) into a new indicator, called the averaged Hausdorff dis-
The objective space is divided into zones delimited by uni- tance p defined by
formly distributed reference lines starting from the origin
whose the number equals |YN |. The indicator value is the ra- p (YN ; YP ) = max {GD p (YN ; YP ), IGD p (YN ; YP )}
tio of the number of objective vectors that are sufficiently where GD p and IGD p are slightly modified versions of the GD and
close to the reference lines according to the Euclidean norm IGD indicators defined as
with a threshold d chosen by the user, over the total number 1
p p
of reference lines. The higher the value, the better. 1
4. Diversity comparison indicator DCI (Li, Yang, & Liu, 2014): It GD p (YN ; YP ) = min y1 − y2
|YN | 1
y ∈YN
y2 ∈YP
is a k-ary spread indicator. The zone of interest in the objec-
tive space delimited by lower and upper bounds is divided and
into a number of hyperboxes. For each Pareto front approx- 1
imation, a contribution coefficient is computed relatively to p p
1
the hyperboxes where non-dominated points are found. For IGD p (YN ; YP ) = min y − y 1 2
.
each Pareto front approximation, DCI returns the mean of
| P| 2
Y
y ∈YP
y1 ∈YN
410
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
2015), one of their major drawbacks is that they are non mono- The authors of Hansen and Jaszkiewicz (1998) recommend to
tone (Ishibuchi et al., 2015). The p indicator (3.4.2) has the same use the utility set U∞ = (uw )w∈W of weighted Tchebycheff utility
problem (Schutze et al., 2012). functions, with
Also, the authors of Ishibuchi et al. (2015) propose a slightly
different version of the IGD indicator named IGD+ integrating the uw (y ) = − max (wi |yi − ri | )
i=1,2,...,m
dominance relation computable in O (m |YN | × |YP | ) where YP is a
fixed Pareto optimal solution set. It is weakly Pareto compliant, for y ∈ Rm where r is a reference objective vector chosen so that
i.e. : any objective vector of the feasible objective set does not dominate
r (or as an approximation of the ideal point Brockhoff, Wagner, &
for all YN1 , YN2 ∈ such that YN1 YN2 , IGD+ (YN1 ; YP ) ≤ IGD+ (YN2 ; YP ). Trautmann, 2012; Brockhoff, Wagner, & Trautmann, 2015; Zitzler
The IGD+ indicator is defined by et al., 2008) and w ∈ W a weight vector such that for all w ∈ W
and i = 1, 2, . . . , m,
1
IGD+ (YN ; YP ) = min (y1 − y2 )+
m
|YP | y2 ∈Y y1 ∈YN
wi ≥ 0 and wi = 1.
P
i=1
where ( y1 − y2 )
+ = (max(0, y1i − y2i ))i=1,2,...,m .
As opposed to the IGD indicator, IGD+ takes
into account the Zitzler et al. (2008) suggest using the set of augmented weighted
dominance relation between an element of YP and an element of Tchebycheff utility functions defined by
YN when computing their Euclidean distance. A reference set YR
can also be used instead of YP : the authors of Ishibuchi, Masuda,
m
uw ( y ) = − max wi |yi − ri | + ρ | ( yi − ri |
Tanigaki, and Nojima (2014) analyze the choice of such reference i=1,2,...,m
i=1
sets. This indicator can be used with discontinuous and continu-
ous Pareto fronts. where ρ is a sufficiently small positive real number.
Similarly to IGD+ , given a reference set YR ⊂ Rm , As given in Brockhoff et al. (2012), for m = 2 objectives, W can
Dist1R (Czyzzak & Jaszkiewicz, 1998) is given by be chosen such that:
1 1. W = {(0, 1 ), ( k−1
1 1
), ( k−1
2 2
), . . . , ( 1, 0 )}
Dist1R (YN ; YR ) = min max {0, wi (yi − ri )} , 1− k−1
, 1− k−1
is a
|YR | y∈YN i=1,2,...,m
r∈YR set of k weights uniformly distributed in the space [0; 1]2 .
tan ϕ
2. W = {( 1+tan
1
ϕ , 1+tan ϕ ), ϕ ∈ k } where k = {0, 2(kπ−1) ,
with wi a relative weight assigned to objective i. For all these indi-
2π
cators, a lower value is desirable. 2(k−1 )
, . . . , π2
} is a set of weights uniformly distributed over
the trigonometric circle.
3.4.4. R1 and R2 indicators (Hansen & Jaszkiewicz, 1998)
The IR2 indicator (Brockhoff et al., 2012) is an unary indicator
Let YN1 and YN2 be two Pareto front approximations, U a set of
derived from R2 defined as (in the case of weighted Tchebycheff
utility functions u : Rm → R mapping each point in the objective
utility functions)
space into a measure of utility, and p a probability distribution on
the set U. To each u ∈ U are associated u (YN1 ) = maxy∈Y 1 u(y ) and 1
& '
N IR2 (YN ; W ) = min max (wi |yi − ri | ) .
u (YN2 ) = maxy∈Y 2 u(y ). The two indicators measure to which ex- |W | y∈YN i=1,2,...,m
w∈W
N
tent YN1 is better than YN2 over the set of utility functions U. The R1 The lower this index, the better.
indicator is given by As Knowles and Corne (2002) remark, “the application of R2 de-
%
pends up on the assumption that it is meaningful to add the val-
R1 (YN1 , YN2 ; U, p) = C (YN1 , YN2 , u ) p(u ) du
u∈U
ues of different utility functions from the set U. This simply means
that each utility function in U must be appropriately scaled with
where
respect to the others and its relative importance”. R-indicators are
1 if u (YN1 ) > u (YN2 ), only monotonic, i.e. I (YN1 ) ≥ I (YN2 ) in case YN1 weakly dominates YN2 .
C( YN1 , YN2 , u )= 1/2 if u (YN1 ) = u (YN2 ), They do not require important computations as the number of
0 if u (YN1 ) < u (YN2 ). objectives increases. The reference point has to be chosen care-
fully. Studies concerning the properties of the R2 indicator can be
The R2 indicator defined as
found in Brockhoff et al. (2012), Brockhoff et al. (2015) and Wagner,
R2 (YN1 , YN2 ; U, p) = E u (YN1 ) − E u (YN2 ) Trautmann, and Brockhoff (2013).
% 1
= u (YN ) − u (YN2 ) p(u )du.
u∈U 3.4.5. G-metric (Lizarraga-Lizarraga, Hernandez-Aguirre, &
Botello-Rionda, 2008)
is the expected difference in the utility of a Pareto front approx-
This indicator enables to compare k Pareto front approximations
imation YN1 with another one YN2 . In practice, these two indicators
based on two criteria: the repartition of their elements and the
use a discrete and finite set U of utility functions associated with
level of domination in the objective space. It is compliant with the
an uniform distribution over U (Zitzler et al., 2008). The two indi-
weak dominance relation as defined above. Its computation de-
cators can then be rewritten as
composes into several steps: given k Pareto front approximations
1 (YN1 , YN2 , . . . , YNk ):
R1 (YN1 , YN2 ; U ) = C (YN1 , YN2 , u ) and
|U | u∈U
1. Scale the values of the objective vectors corresponding to
1 1
R2 (YN1 , YN2 ; U ) = u YN − u (YN2 ). the images of the decision vectors in the k sets, i.e. ex-
|U | u∈U tract the non-dominated objective vectors from the union
#k j
If R2 (YN1 , YN2 ; U ) > 0, then YN1 is considered as better than YN2 . Else if YN , then normalize all objective vectors according to
j=1
R2 (YN1 , YN2 ; U ) ≥ 0, YN1 is considered as not worse than YN2 . the extreme values of the objective vectors of this set.
411
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
412
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
Fig. 6. The relative value of the hypervolume metric depends on the chosen reference point r 1 or r 2 (inspired by Knowles & Corne, 2002). On the top, two non-dominated
YN1 and YN2 sets are shown, with HV (YN1 ; r 1 ) > HV (YN2 ; r 1 ). On the bottom, HV (YN2 ; r 2 ) > HV (YN1 ; r 2 ).
The Hyperarea Difference was suggested by Wu and Azarm volume indicators based on set monotonic and desirability func-
(20 0 0) to compensate the lack of information about the theoret- tions (Emmerich, Deutz, & Yevseyeva, 2014).
ical Pareto front. Given a good objective vector
yI and a bad objec-
tive vector yM , we can approximate the size of the area dominated
by the Pareto front (or circumvent the objective space by a rect-
angle). The Hyperarea Difference is just the normalization of the
3.4.8. Hypervolume Sharpe-Ratio indicator IHSR (Yevseyeva, Guerreiro,
dominated space by the Pareto front approximation in the objec-
Emmerich, & Fonseca, 2014)
tive space over the given rectangle.
The conception of this indicator proposed by Yevseyeva et al.
More recently, a pondered hypervolume by weights indicator
(2014) lies on an analogy between the financial Portfolio Selection
was introduced by Zitzler et al. (2007) to give a preference of
problem (Markowitz, 1952) and the quality of a Pareto front ap-
an objective according to another. More volume indicators can be
proximation. This analogy interprets the elements of the approxi-
found in Wu and Azarm (20 0 0). Some other authors (Jiang, Yang,
mation set as assets with expected returns. An investor must al-
& Li, 2016) (for biobjective optimization problems) suggest to com-
locate capital to maximize the expected return of the assembled
pute the hypervolume defined by a reference point and the projec-
portfolio while minimizing the expected variance (associated to
tion of the points belonging to the Pareto front approximation on
risk).
the line delimited by the two extreme points. This measure en-
To solve the financial Portfolio Selection problem, it is com-
ables to better estimate the distribution of the points along the
mon to look for a strategy which maximizes the financial per-
Pareto front (in fact, it can be shown that for a linear Pareto front,
formance index known as reward-to-volatility ratio or Sharpe ra-
an uniform distribution of points maximizes the hypervolume in-
tio (Cornuéjols, Peña, & Tütüncü, 2018). Let A = {a1 , a2 , . . . , a|A| } be
dicator: see Auger, Bader, and Brockhoff (2010); Auger et al. (2009,
a non-empty set of assets, let r a ∈ R|A| be the expected return of
2012); Shukla, Doll, and Schmeck (2014) for more details about the
these assets and Q ∈ R|A|×|A| the covariance matrix of asset returns.
properties of the hypervolume indicator). A logarithmic version of
The Sharpe-Ratio maximization problem is defined as
the hypervolume indicator called the logarithmic hypervolume in-
dicator (Friedrich et al., 2011) is defined by
(ra )T z − r f
max hA ( z ) = "
# z∈[0,1]|A| zT Qz
log HV (YN ; r ) = λm [log y, log r] |A|
y∈YN
such that zi = 1
with the same notations as previously. Note that this indicator can i=1
413
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
414
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
it is done in Al-Dujaili and Suresh (2016) and Brockhoff et al. long to the Pareto front and maximize (or minimize) the perfor-
(2015) is not a pertinent choice due to their inability to cap- mance indicator for a given multiobjective problem. Their study
ture dominance relation. Instead, we suggest to use their weakly enables to understand bias of considered indicators and analyze
monotonic counterpart IGD+ (3.4.3) or DOA (3.2.7), that can be the behavior of bounded size indicator-based algorithms. The first
cheaper to compute than for example the hypervolume indica- ones were done for the hypervolume indicator and some of its
tor when the number of objectives is high. It is equally possible variants in the biobjective case (Auger et al., 2009; 2012; Em-
to build Pareto-compliant indicators by considering a combination merich et al., 2013b) then extended to more objectives in Auger,
of weakly Pareto compliant indicators with at least one strictly Bader, and Brockhoff (2010) and Shukla et al. (2014). Theoretical
Pareto compliant indicator as it is proposed in Falcón-Cardona results for the R2 indicator (Brockhoff et al., 2012), the p in-
et al. (2019). dicator (Rudolph, Schütze, Grimme, Domínguez-Medina, & Traut-
The attainment function (da Fonseca, Fonseca, & Hall, 2001) mann, 2016) or the Hypervolume Sharpe-Ratio indicator (Guerreiro
is another tool for the performance assessment of multiobjective & Fonseca, 2020) for the biobjective case exist too.
(stochastic) solvers. Assuming a multiobjective solver has produced
j
k Pareto front approximations YN for j = 1, 2, . . . , k on a given prob- 4.3. Stopping criteria of multiobjective algorithms
lem, the empirical attainment function α : Rm → [0, 1] is defined
as To generate a Pareto front approximation, two approaches are
currently considered. The first category, named as scalarization
1
k
α (y ) = 1{YNj {y}}. methods, consists in aggregating the objective functions and to
k solve a series of single-objective problems. Surveys about scalar-
j=1
ization algorithms can be found for example in Wiecek, Ehrgott,
For a given y ∈ Rm , the attainment function estimates the prob- and Engau (2016). The second class, designed as a posteriori ar-
ability that the multiobjective solver reaches (in term of domi- ticulations of preferences (Custódio et al., 2011) methods, aims at
nance in the objective space) the objective vector y. The interested obtaining the whole Pareto front without combining any objective
reader can refer to Brockhoff, Auger, Hansen, and Tušar (2017), function in a single-objective framework. Evolutionary algorithms,
da Fonseca et al. (2001), Fonseca, da Fonseca, and Paquete (2005), Bayesian optimization methods (Emmerich et al., 2016) or deter-
Fonseca, Guerreiro, López-Ibáñez, and Paquete (2011) and Zitzler ministic algorithms such as DMS (Custódio et al., 2011) belong to
et al. (2008) for additional information. this category.
For scalarization methods, under some assumptions, solutions
4.2. Embedding performance indicators in multiobjective optimization to single-objective problems can be proved to belong to the Pareto
algorithms front or a local one. So, defining stopping criteria results in choos-
ing the number of single-objective problems to solve via the choice
Performance indicators are able to quantify properties a good of parameters and a single-objective stopping criterion for each of
Pareto front approximation should possess. It is then logical to in- them. Stopping at a predetermined number of function evaluations
corporate them into multiobjective optimization methods. By op- is often used in the context of blackbox optimization (Audet et al.,
timizing directly the indicator, one can hope to obtain approxi- 2008). The use of performance indicators also is not relevant.
mations of the Pareto front satisfying demanding properties. For A posteriori methods consider a set of points in the objective
these last years, the evolutionary multiobjective community has space (a population) that is brought to head for the Pareto front
frequently adopted this approach. along iterations. Basically, a maximum number of evaluations is
For example, performance indicators such as R2 (Trautmann, still given as a stopping criterion but it remains crucial to give
Wagner, & Brockhoff, 2013) or HV (Beume, Naujoks, & Emmerich, an estimation to how far from a (local) Pareto front the approxi-
2007), I (Zitzler & Künzli, 2004), IGD+ (Falcón-Cardona et al., mation set is. For multiobjective Bayesian optimization (Emmerich
2019; Lopez & Coello, 2016), IGD–NS (Tian et al., 2016) are used in et al., 2016), the goal is to find at next iteration the point that max-
selection mechanisms in evolutionary algorithms. The reader is in- imizes the hyperarea difference between old non-dominated set of
vited to consult the recent survey (Falcón-Cardona & Coello, 2020) points and the new one. The performance indicator is directly em-
for more information on indicator-based multiobjective evolution- bedded into the algorithm and could be used as a stopping crite-
ary algorithms. Similarly, the -indicator (Custódio et al., 2011) en- rion. For evolutionary algorithms, surveys on stopping criteria for
ables to identify holes in the Pareto front approximation around multiobjective optimization can be found in Martí et al. (2016) and
which the algorithm can explore to improve diversity. In global Wagner, Trautmann, and Martí (2011). The approach is to measure
stochastic optimization, some methods integrate hypervolume in- the progression of the current population combining performance
dicator (Bradford, Schweidtmann, & Lapkin, 2018; Emmerich, Yang, indicators (hypervolume, MDR, etc.) and statistic tools (Kalman fil-
Deutz, Wang, & Fonseca, 2016; Feliot, Bect, & Vazquez, 2017) and ter Martí et al., 2016, χ 2 -variance test Wagner, Trautmann, and
its variants (Feliot, Bect, & Vazquez, 2019) or R2 (Deutz, Emmerich, Naujoks (2009), etc.) These last ones enable to detect a stationary
& Yang, 2019) to better explore the decision space. In Akhtar and state reached by the evolving population.
Shoemaker (2016), the authors use Radial Basis models and the hy- We believe that the use of monotonic performance indicators
pervolume indicator to identify next promising points to evaluate. or binary ones that capture the dominance property seems to be
In Al-Dujaili and Suresh (2018), the authors propose a multiobjec- the most efficient one in the years to come to follow the behavior
tive optimistic algorithm using the additive -indicator (3.2.6) and of population-based algorithms along iterations.
analyse its link with the weighted Tchebysheff approach.
The transformation of a multiobjective optimization problem 4.4. Distribution and spread
into a single-objective quality indicator based problem implies a
loss of information. Indeed, the choice of a specific performance The choice of spread and distribution indicators has only a
indicator reflects the personal preferences of the decision user. It sense when one wants to measure the distribution of points in the
is then important to understand the bias of this choice on the so- objective space, no matter how close from the Pareto front the ap-
lution set found. Given a performance indicator, the concept of op- proximated set is. Spread and distribution metrics can put forward
timal μ-distribution (Auger et al., 2009) refers to the study of the global properties (for example statistics on the distribution of the
optimal distributions of non-dominated points of size μ which be- points or extent of the front) or local properties such as the largest
415
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
distance between closest non-dominated points that can be used sidered until now as the most relevant. The hypervolume indica-
to conduct search such as indicator. Typically, the construction tor possesses good mathematical properties, it can capture domi-
of a distribution or spread indicator requires two steps. The first nance properties and distribution and does not require the knowl-
consists in defining a distance between two points in the objec- edge of the Pareto front. Empirical studies (Jiang et al., 2014; Ok-
tive space. Many distribution indicators in the literature use min- abe et al., 2003) have confirmed its efficiency compared to other
imum Euclidean or Manhattan distance between points such as performance indicators. That is why it has been deeply used in
the SP metric (3.3.1), the index (3.3.2), HRS (3.3.4), and so on. the evolutionary community (Riquelme et al., 2015). However, it
The DM (3.3.12) and -metric (3.3.3) indicators use a “sorting dis- has some limitations: the exponential cost as the number of ob-
tance”; ID (3.3.13) a “neighborhood distance” based on a spanning jectives increases and the choice of the reference point. To com-
tree, and so on. Once this is done, many of the existing distribu- pare algorithms, it can be replaced with other indicators capturing
tion indicators are built by using statistic tools on this distance: lower dominance relation such as dominance move, G-metric, bi-
mean ( (3.3.2), U measure (3.3.9), DM (3.3.12) for example), mean nary -indicator, Hypervolume Sharpe-Ratio indicator, modified in-
square (SP (3.3.1), Dnc (3.3.5)), and so on. verted generated distance or degree of approximation whose com-
To use a distribution or spread indicator, it should satisfy the putational cost is less important.
following properties: Future research can focus on the discovery of new performance
indicators that correct some drawbacks of the hypervolume indi-
1. The support of scaled functions, which enables to compare
cator but keeps its good properties, and the integration of perfor-
all objectives in an equivalent way (DM (3.3.12), OS (3.3.10),
mance indicators directly into algorithms for multiobjective opti-
IOD (3.3.11), (3.3.3), (3.3.3)).
mization.
2. For piecewise continuous or discontinuous Pareto front ap-
proximations, a good distribution indicator should not be
Acknowledgments
based on the distance between closest neighbors, as it can
hide some holes (Zheng et al., 2017). Some indicators pos-
This research was financed by Le Digabel’s NSERC discov-
sess this property such as DM (3.3.12), (3.3.3), (3.3.3),
ery grant RGPIN-2018-05286, and also by the NSERC CRD RD-
Es (3.3.16) or evenness indicators (3.3.7).
CPJ 490744-15 grant and an InnovÉÉ grant, both in collaboration
3. Distribution and spread performance indicators should not
with Hydro-Québec and Rio Tinto. We also would like to thank the
be based on external parameters, such as Zitzler’s metric
anonymous reviewers for their suggestions and comments which
M2 (3.3.5), UD (3.3.5), or entropy measure (3.3.18).
helped to greatly improve this work.
4. An easy interpretation: a value returned by an indicator has
to be “intuitive” to understand. For example, the binary uni-
formity (3.3.8) is extremely difficult to interpret and should Appendix A. A summary of performance indicators
not be used. This remark applies for all types of performance
indicators. Table 3 draws a summary of all indicators described in
Section 3. Most of complexity cost indications for computing in-
One could directly include spread control parameters in the
dicators are drawn from Jiang et al. (2014). YP ⊆ YP corresponds to
design of new algorithms. The Normal Boundary Intersection
the Pareto optimal solution set and YN is a Pareto front approxima-
method (Das & Dennis, Jr., 1998) controls the spread of a Pareto
tion returned by a given algorithm. The symbol “✗” indicates that
front approximation. This method is also used in the context of
the performance indicator does not satisfy the monotony property.
blackbox optimization (Audet, Savard, & Zghal, 2010).
The “-” symbol corresponds to binary indicators, for which mono-
tonicity has no meaning.
5. Discussion
In this work, we give a review of performance indicators for the Appendix B. Compatibility and completeness
quality of Pareto front approximations in multiobjective optimiza-
tion, as well as some usages of these indicators. Tables 4 and 5 summarize compatibility and completeness
The most important application of performance indicators is to properties.
allow comparison and analysis of results of different algorithms. In Only the strongest relationships are kept. Some of them are
this optic, among all these indicators, the hypervolume indicator drawn from Zitzler et al. (2003). All spread and distribution indica-
and its binary counterpart, the hyperarea difference can be con- tors are not compatible with Pareto front approximation relations.
416
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
Table 3
A summary of performance indicators.
Category Performance indicators Sect. Symbol Parameters Comparison sets Computational complexity Monotone
Cardinality C-metric/Two sets Coverage Zitzler and Thiele (1998) 3.1.5 C None Binary indicator O (m |YN1 | × |YN2 | ) -
3.1 Error ratio Van Veldhuizen (1999) 3.1.4 ER None Pareto front YP Low ✗
Generational non dominated vector 3.1.3 GNVG None None Low ✗
generation Van Veldhuizen and Lamont (2000)
Generational non dominated vector generation 3.1.3 GNVGR None Pareto front YP Low ✗
ratio Van Veldhuizen and Lamont (2000)
Mutual domination rate Martí et al. (2016) 3.1.3 MDR None None Low ✗
Nondominated vector additional Van Veldhuizen and 3.1.3 NVA None None Low ✗
Lamont (2000)
Overall nondominated vector generation Van Veldhuizen 3.1.1 ONVG None None Low ✗
(1999)
Overall nondominated vector generation 3.1.2 ONVGR None Pareto front YP Low ✗
ratio Van Veldhuizen (1999)
Ratio of non-dominated points by the reference 3.1.5 C2R None Reference set YR O (m |YN | × |YR | ) ✗
set Hansen and Jaszkiewicz (1998)
Ratio of the reference points Hansen and Jaszkiewicz 3.1.4 C1R None Reference set YR O (m |YN | × |YR | ) ✗
(1998)
Convergence Degree of Approximation Dilettoso et al. (2017) 3.2.7 DOA None Pareto front YP O (m |YN | × |YP | ) Not
strictly
3.2 -family Zitzler et al. (2003) 3.2.6 I None Pareto front YP O (m |YN | × |YP | ) Not
strictly
Generational distance Van Veldhuizen (1999) 3.2.1 GD None Pareto front YP O (m |YN | × |YP | ) ✗
γ -metric Deb et al. (2000) 3.2.1 γ None Pareto front YP O (m |YN | × |YP | ) ✗
Maximum Pareto front error Van Veldhuizen (1999) 3.2.4 MPF E None Pareto front YP O (m |YN | × |YP | ) ✗
M1 -metric Zitzler et al. (2000) 3.2.1 M1 None Pareto front YP O (m |YN | × |YP | ) ✗
Progression metric Van Veldhuizen (1999) 3.2.5 - None None O (m |YN | ) ✗
Seven points average distance Schott (1995) 3.2.3 SPAD None Reference set YR O (m |YN | ) ✗
Standard deviation from the Generational 3.2.2 ST DGD None Pareto front YP O (m |YN | × |YP | ) ✗
distance Van Veldhuizen (1999)
Distribution Cluster Wu and Azarm (2000) 3.3.18 CLμ A parameter None High ✗
μ
and spread -index Deb et al. (2000) 3.3.2 None Pareto front YP O (m |YN |2 + m |YN | × |YP | ) ✗
3.3 -index Deb et al. (2000) 3.3.2 None None O (m |YN |2 ) ✗
spread metric Zhou et al. (2006) 3.3.2 None Pareto front YP O (m |YN |2 + m |YN | × |YP | ) ✗
Distribution metric Zheng et al. (2017) 3.3.12 DM None None O (m |YN |2 ) ✗
Diversity comparison indicator Li et al. (2014) 3.3.18 DCI A parameter k-ary indicator O m (k |YNmax | )2 ✗
div comparing
YN1 , YN2 , . . . , YNk
non-dominated
sets
Diversity indicator Cai et al. (2018) 3.3.15 DIR None Reference set YR O (m |YN | × |YR | ) ✗
Entropy metric Farhang-Mehr and Azarm (2004) 3.3.18 - A parameter None High ✗
grids
Evenness Messac and Mattson (2004) 3.3.7 ξ None None O (m |YN |2 ) ✗
Extension Meng et al. (2005) 3.3.14 EX None Pareto front YP O (m |YN | × |YP | ) ✗
-metric Custódio et al. (2011) 3.3.3 None None O (m |YN |2 ) ✗
Hole Relative Size Collette and Siarry (2005) 3.3.4 HRS None None O (m |YN |2 ) ✗
O (|YN | 3 poly log |YN | )
m
Laumanns metric Laumanns et al. (2000) 3.3.17 - None None ✗
Modified Diversity indicator Asafuddoula et al. (2015) 3.3.18 M –DI A parameter Reference set YR O (m |YN |2 × |YR | )
δ
M2 -metric Zitzler et al. (2000) 3.3.5 M2 Niche radius None O (m |YN |2 ) ✗
σ
M3 -metric
Zitzler et al. (2000) 3.3.5 M3 None None O (m |YN | ) 2
✗
Number of distinct choices Wu and Azarm (2000) 3.3.18 NDCμ A parameter None High ✗
μ
Outer diameter Zitzler et al. (2008) 3.3.11 IOD None None O (m |YN | ) ✗
Overall Pareto Spread Wu and Azarm (2000) 3.3.10 OS None y˜I and y˜M O (m |YN | ) ✗
Riesz S-energy Hardin and Saff (2004) 3.3.16 ES A parameter None O (m |YN |2 ) ✗
s
Sigma diversity metric Mostaghim and Teich (2005) 3.3.18 σ A parameter None High ✗
lines
Spacing Schott (1995) 3.3.1 SP None None O (m |YN |2 ) ✗
U-measure Leung and Wang (2003) 3.3.9 U None None O (m |YN |2 ) ✗
Uniform assessment metric Li et al. (2008) 3.3.13 ID None None O (m |YN |2 ) ✗
Uniform distribution Tan et al. (2002) 3.3.5 UD Niche radius None O (m |YN |2 ) ✗
σ
Uniformity Sayın (2000) 3.3.6 δ None None O (m |YN |2 ) ✗
Uniformity Meng et al. (2005) 3.3.8 - None Binary Quadratic ✗
(continued on next page)
417
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
Table 3 (continued)
Category Performance indicators Sect. Symbol Parameters Comparison sets Computational complexity Monotone
Convergence Averaged Hausdorff distance Schutze et al. (2012) 3.4.2 q None Pareto front YP O (m |YN | × |YP | ) ✗
and
Angle γ and O (|YN | poly log |YN | )
m
distribution Cone-based hypervolume Emmerich et al. (2013b) 3.4.7 - None 3 Strictly
Reference
point r
3.4 Dominance move Li and Yao (2017) 3.4.6 DoM None Binary indicator O (|YN | log |YN | ) -
O (|YN | 3 poly log |YN | )
m
D-metric/Difference coverage of two sets Zitzler (1999) 3.4.7 - Reference Binary indicator -
point r
DR -metric Czyzzak and Jaszkiewicz (1998) 3.4.3 - None Reference set YR O (m |YN | × |YR | ) Not
strictly
O (|YN | 3 poly log |YN | )
m
Hyperarea difference Wu and Azarm (2000) 3.4.7 HD Reference None Strictly
point r
O (|YN | 3 poly log |YN | )
m
Hypervolume indicator (or S-metric) Zitzler et al. (2000) 3.4.7 HV Reference None Strictly
point r
Hypervolume Sharpe-ratio indicator Yevseyeva et al. 3.4.8 IHSR None Polynomial Not
(2014) Reference points strictly
yl and yu
Inverted generational distance Coello and Cortés (2005) 3.4.1 IGD None Pareto front YP O (m |YN | × |YP | ) ✗
Inverted generation distance with non contributed 3.4.1 IGD–NS None Pareto front YP O (m |YN | × |YP | ) ✗
solutions detection Tian et al. (2016)
G-metric Lizarraga-Lizarraga et al. (2008) 3.4.5 - None k-ary indicator O (k3 |YNmax |2 ) Not
comparing strictly
YN1 , YN2 , . . . , YNk
non-dominated
sets
O (|YN | 3 poly log |YN | )
m
Logarithmic hypervolume indicator Friedrich et al. (2011) 3.4.7 log HV Reference None Strictly
point r
Modified inverted generational distance Ishibuchi et al. 3.4.3 IGD +
None Pareto front YP O (m |YN | × |YP | ) Not
(2015) strictly
Performance comparison indicator Li et al. (2015) 3.4.6 PCI σ distance k-ary indicator Quadratic Not
comparing strictly
YN1 , YN2 , . . . , YNk
non-dominated
sets
p, q-averaged distance Vargas and Bogoya (2018) 3.4.2 p,q None Pareto front YP Quadratic ✗
R-metric Hansen and Jaszkiewicz (1998) 3.4.4 YR A set W of Reference set YR O (m |YN | × |YR | × |W | ) Not
weights strictly
vectors
Table 4
Compatibility and completeness of unary performance indicators.
Cardinality Error ratio Van Veldhuizen (1999) 3.1.4 ER ER(YN1 ) < ER(YN2 ) ✗ ✗
3.1 Generational non dominated vector 3.1.3 GNVG - - -
generation Van Veldhuizen and Lamont (2000)
Generational non dominated vector generation 3.1.3 GNVGR - - -
ratio Van Veldhuizen and Lamont (2000)
Mutual domination rate Martí et al. (2016) 3.1.6 MDR - - -
Nondominated vector additional Van Veldhuizen 3.1.3 NVA - - -
and Lamont (2000)
Overall nondominated vector 3.1.1 ONVG ONVG(YN1 ) > ONVG(YN2 ) ✗ ✗
generation Van Veldhuizen (1999)
Overall nondominated vector generation 3.1.2 ONVGR ONVGR(YN1 ; YP ) > ONVGR(YN2 ; YP ) ✗ ✗
ratio Van Veldhuizen (1999)
Ratio of non-dominated points by the reference 3.1.5 C2R (
C2R YN1 ; YR )> (
C2R YN2 ; YR ) ✗ ✗
set Hansen and Jaszkiewicz (1998)
Ratio of the reference points Hansen and 3.1.4 C1R C1R (YN1 ; YR ) > C1R (YN2 ; YR ) ✗ ✗
Jaszkiewicz (1998)
Convergence Degree of Approximation Dilettoso et al. (2017) 3.2.7 DOA DOA(YN1 ; YP ) < DOA(YN2 ; YP ) Not better than ≺
3.2 Generational distance Van Veldhuizen (1999) 3.2.1 GD GD (YN1 ; YP ) < GD (YN2 ; YP ) ✗ ✗
γ -metric Deb et al. (2000) 3.2.1 γ γ (YN1 ; YP ) < γ (YN2 ; YP ) ✗ ✗
Maximum Pareto front error Van Veldhuizen 3.2.4 MPF E MPF E (YN1 ; YP ) < MPF E (YN2 ; YP ) ✗ ✗
(1999)
M1 -metric Zitzler et al. (2000) 3.2.1 M1 M1 (YN1 ; YP ) < M1 (YN2 ; YP ) ✗ ✗
Progression metric Van Veldhuizen (1999) 3.2.5 - - - -
Seven points average distance Schott (1995) 3.2.3 SPAD SPAD (YN1 ; YP ) < SPAD (YN2 ; YP ) ✗ ✗
Standard deviation from the Generational 3.2.2 ST DGD - - -
distance Van Veldhuizen (1999)
Distribution Cluster Wu and Azarm (2000) 3.3.18 CLμ - - -
and spread -index Deb et al. (2000) 3.3.2 (YN1 ; YP ) < (YN2 ; YP ) ✗ ✗
(continued on next page)
418
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
Table 4 (continued)
Table 5
Compatibility and completeness of binary performance indicators (inspired by Zitzler et al. (2003)): a - means there is no comparison method which is complete and
compatible for the given relation, a ✗ that the indicator is not even monotone.
=
419
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
References Coello, C., & Cortés, N. (2005). Solving multiobjective optimization problems using
an artificial immune system. Genetic Programming and Evolvable Machines, 6(2),
Akhtar, T., & Shoemaker, C. (2016). Multi objective optimization of computation- 163–190. https://doi.org/10.1007/s10710- 005- 6164- x.
ally expensive multi-modal functions with RBF surrogates and multi-rule se- Collette, Y., & Siarry, P. (2005). Three new metrics to measure the convergence of
lection. Journal of Global Optimization, 64(1), 17–32. https://doi.org/10.1007/ metaheuristics towards the Pareto frontier and the aesthetic of a set of solutions
s10898-015-0270-y. in biobjective optimization. Computers and Operations Research, 32(4), 773–792.
Al-Dujaili, A., & Suresh, S. (2016). BMOBench: Black-box multi-objective optimiza- https://doi.org/10.1016/j.cor.2003.08.017.
tion benchmarking platform. Technical Report. arXiv. Collette, Y., & Siarry, P. (2011). Multiobjective optimization: Principles and case studies.
Al-Dujaili, A., & Suresh, S. (2018). Revisiting norm optimization for multi-objective Springer. https://doi.org/10.1007/978- 3- 662- 08883- 8.
black-box problems: A finite-time analysis. Journal of Global Optimization, 73(3), Cornuéjols, G., Peña, J., & Tütüncü, R. (2018). Optimization methods in finance. Cam-
659–673. https://doi.org/10.1007/s10898- 018- 0709- z. bridge University Press. https://doi.org/10.1017/9781107297340.
Asafuddoula, M., Ray, T., & Singh, H. (2015). Characterizing Pareto front approxima- Custódio, A., Emmerich, M., & Madeira, J. (2012). Recent developments in derivative-
tions in many-objective optimization. In Proceedings of the 2015 annual confer- free multiobjective optimization. Computational Technology Reviews, 5(1), 1–30.
ence on genetic and evolutionary computation (pp. 607–614). New York, NY, USA: https://doi.org/10.4203/ctr.5.1.
ACM. https://doi.org/10.1145/2739480.2754701. Custódio, A., Madeira, J., Vaz, A., & Vicente, L. (2011). Direct multisearch for mul-
Audet, C., Savard, G., & Zghal, W. (2008). Multiobjective optimization through a se- tiobjective optimization. SIAM Journal on Optimization, 21(3), 1109–1140. https:
ries of single-objective formulations. SIAM Journal on Optimization, 19(1), 188– //doi.org/10.1137/10079731X.
210. https://doi.org/10.1137/060677513. Czyzzak, P., & Jaszkiewicz, A. (1998). Pareto simulated annealing–a meta-
Audet, C., Savard, G., & Zghal, W. (2010). A mesh adaptive direct search algorithm heuristic technique for multiple-objective combinatorial optimization. Jour-
for multiobjective optimization. European Journal of Operational Research, 204(3), nal of Multi-Criteria Decision Analysis, 7(1), 34–47. https://doi.org/10.1002/(sici)
545–556. https://doi.org/10.1016/j.ejor.2009.11.010. 1099- 1360(199801)7:1<34::aid- mcda1613.0.co;2- 6.
Auger, A., Bader, J., & Brockhoff, D. (2010). Theoretically investigating optimal μ- Dächert, K., Klamroth, K., Lacour, R., & Vanderpooten, D. (2017). Efficient compu-
distributions for the hypervolume indicator: First results for three objectives. In tation of the search region in multi-objective optimization. European Journal of
R. Schaefer, C. Cotta, J. Kołodziej, & G. Rudolph (Eds.), Parallel problem solving Operational Research, 260(3), 841–855. https://doi.org/10.1016/j.ejor.2016.05.029.
from nature, PPSN XI (pp. 586–596). Berlin, Heidelberg: Springer. https://doi.org/ Das, I., & DennisJr. , J. (1998). Normal-boundary intersection: A new method for
10.1007/978- 3- 642- 15844- 5_59. generating the Pareto surface in nonlinear multicriteria optimization prob-
Auger, A., Bader, J., Brockhoff, D., & Zitzler, E. (2009). Theory of the hypervolume lems. SIAM Journal on Optimization, 8(3), 631–657. https://doi.org/10.1137/
indicator: Optimal μ-distributions and the choice of the reference point. In S1052623496307510.
Proceedings of the tenth ACM SIGEVO workshop on foundations of genetic algo- Deb, K. (2001). Multi-objective optimization using evolutionary algorithms. New York,
rithms (pp. 87–102). New York, NY, USA: ACM. https://doi.org/10.1145/1527125. NY, USA: John Wiley and Sons.
1527138. Deb, K., Agrawal, S., Pratap, A., & Meyarivan, T. (20 0 0). A fast elitist non-
Auger, A., Bader, J., Brockhoff, D., & Zitzler, E. (2012). Hypervolume-based multiob- dominated sorting genetic algorithm for multi-objective optimization: NSGA-
jective optimization: Theoretical foundations and practical implications. Theo- II. In M. Schoenauer, K. Deb, G. Rudolph, X. Yao, E. Lutton, J. Merelo, & H.-
retical Computer Science, 425, 75–103. https://doi.org/10.1016/j.tcs.2011.03.012. P. H. P. Schwefel (Eds.), Parallel problem solving from nature PPSN VI (pp. 849–
Bäck, T. (1996). Evolutionary algorithms in theory and practice: Evolution strategies, 858). Berlin, Heidelberg: Springer. https://doi.org/10.1007/3- 540- 45356- 3_83.
evolutionary programming, genetic algorithms. New York, NY, USA: Oxford Uni- Deb, K., & Tiwari, S. (2008). Omni-optimizer: A generic evolutionary algorithm
versity Press. for single and multi-objective optimization. European Journal of Operational Re-
Beiranvand, V., Hare, W., & Lucet, Y. (2017). Best practices for comparing optimiza- search, 185(3), 1062–1087. https://doi.org/10.1016/j.ejor.2006.06.042.
tion algorithms. Optimization and Engineering, 18(4), 815–848. https://doi.org/10. Deutz, A., Emmerich, M., & Yang, K. (2019). The expected R2-indicator improvement
1007/s11081- 017- 9366- 1. for multi-objective Bayesian optimization. In K. Deb, E. Goodman, C. Coello,
Beume, N., Fonseca, C., Lopez-Ibanez, M., Paquete, L., & Vahrenhold, J. (2009). On K. Klamroth, K. Miettinen, S. Mostaghim, & P. Reed (Eds.), Evolutionary multi-
the complexity of computing the hypervolume indicator. IEEE Transactions on criterion optimization (pp. 359–370). Cham: Springer. https://doi.org/10.1007/
Evolutionary Computation, 13(5), 1075–1082. https://doi.org/10.1109/tevc.2009. 978- 3- 030- 12598- 1_29.
2015575. Dilettoso, E., Rizzo, S., & Salerno, N. (2017). A weakly Pareto compliant quality in-
Beume, N., Naujoks, B., & Emmerich, M. (2007). SMS-EMOA: Multiobjective selec- dicator. Mathematical and Computational Applications, 22(1). https://doi.org/10.
tion based on dominated hypervolume. European Journal of Operational Research, 3390/mca22010025.
181(3), 1653–1669. https://doi.org/10.1016/j.ejor.20 06.08.0 08. Dolan, E., & Moré, J. (2002). Benchmarking optimization software with perfor-
Bradford, E., Schweidtmann, A., & Lapkin, A. (2018). Efficient multiobjective opti- mance profiles. Mathematical Programming, 91(2), 201–213. https://doi.org/10.
mization employing Gaussian processes, spectral sampling and a genetic al- 1007/s101070100263.
gorithm. Journal of Global Optimization, 71(2), 407–438. https://doi.org/10.1007/ Ehrgott, M. (2005). Multicriteria optimization, volume 491 of lecture notes in eco-
s10898- 018- 0609- 2. nomics and mathematical systems (2nd). Berlin: Springer.
Branke, J., Deb, K., Miettinen, K., & Slowiński, R. (2008). Multiobjective optimization: Emmerich, M., Deutz, A., & Kruisselbrink, J. (2013a). On Quality Indicators for Black-
Interactive and evolutionary approaches: 5252. Springer. https://doi.org/10.1007/ -Box Level Set Approximation (pp. 157–185)). Berlin, Heidelberg: Springer.
978- 3- 540- 88908- 3. Emmerich, M., Deutz, A., Kruisselbrink, J., & Shukla, P. (2013b). Cone-based hy-
Bringmann, K., & Friedrich, T. (2010a). Approximating the volume of unions and pervolume indicators: Construction, properties, and efficient computation. In
intersections of high-dimensional geometric objects. Computational Geometry, R. Purshouse, P. Fleming, C. Fonseca, S. Greco, & J. Shaw (Eds.), Evolutionary
43(6), 601–610. https://doi.org/10.1016/j.comgeo.2010.03.004. multi-criterion optimization (pp. 111–127). Berlin, Heidelberg: Springer. https:
Bringmann, K., & Friedrich, T. (2010b). Tight bounds for the approximation ratio of //doi.org/10.1007/978- 3- 642- 37140- 0_12.
the hypervolume indicator. In R. Schaefer, C. Cotta, J. Kołodziej, & G. Rudolph Emmerich, M., Deutz, A., & Yevseyeva, I. (2014). On Reference Point Free Weighted
(Eds.), Parallel problem solving from nature, PPSN XI (pp. 607–616). Berlin, Hei- Hypervolume Indicators based on Desirability Functions and their Probabilistic
delberg: Springer. Interpretation. Procedia Technology, 16, 532–541. https://doi.org/10.1016/j.protcy.
Bringmann, K., & Friedrich, T. (2013). Approximation quality of the hypervolume in- 2014.10.001.
dicator. Artificial Intelligence, 195, 265–290. https://doi.org/10.1016/j.artint.2012. Emmerich, M., Yang, K., Deutz, A., Wang, H., & Fonseca, C. (2016). A Multicriteria
09.005. Generalization of Bayesian Global Optimization (pp. 229–242)). Cham: Springer.
Brockhoff, D., Auger, A., Hansen, N., & Tušar, T. (2017). Quantitative performance Falcón-Cardona, J., & Coello, C. (2020). Indicator-based multi-objective evolutionary
assessment of multiobjective optimizers: The average runtime attainment func- algorithms: A comprehensive survey. ACM Computing Surveys, 53(2), 1–35. https:
tion. In H. Trautmann, G. Rudolph, K. Klamroth, O. Schütze, M. Wiecek, Y. Jin, & //doi.org/10.1145/3376916.
C. Grimme (Eds.), Evolutionary multi-criterion optimization (pp. 103–119). Cham: Falcón-Cardona, J., Coello, C., & Emmerich, M. (2019). CRI-EMOA: A Pareto-front
Springer. https://doi.org/10.1007/978- 3- 319- 54157- 0_8. shape invariant evolutionary multi-objective algorithm. In K. Deb, E. Goodman,
Brockhoff, D., Wagner, T., & Trautmann, H. (2012). On the properties of the R2 in- C. Coello, K. Klamroth, K. Miettinen, S. Mostaghim, & P. Reed (Eds.), Evolutionary
dicator. In Proceedings of the 14th annual conference on genetic and evolutionary multi-criterion optimization (pp. 307–318). Cham: Springer.
computation (pp. 465–472). New York, NY, USA: ACM. https://doi.org/10.1145/ Falcón-Cardona, J., Emmerich, M., & Coello, C. (2019). On the Construction of Pareto-
2330163.2330230. Compliant Quality Indicators. In Proceedings of the Genetic and Evolutionary Com-
Brockhoff, D., Wagner, T., & Trautmann, H. (2015). 2 indicator-based multiobjective putation Conference Companion (pp. 2024–2027). ACM. https://doi.org/10.1145/
search. Evolutionary Computation, 23(3), 369–395. https://doi.org/10.1162/EVCO_ 3319619.3326902.
a_00135. Farhang-Mehr, A., & Azarm, S. (2004). An information-theoretic entropy metric for
Cai, X., Sun, H., & Fan, Z. (2018). A diversity indicator based on reference vectors assessing multi-objective optimization solution set quality. Journal of Mechanical
for many-objective optimization. Information Sciences, 430–431, 467–486. https: Design, 125(4), 655–663. https://doi.org/10.1115/1.1623186.
//doi.org/10.1016/j.ins.2017.11.051. Faulkenberg, S., & Wiecek, M. (2009). On the quality of discrete representations in
Chan, T. (2013). Klee’s measure problem made easy. In IEEE 54th annual symposium multiple objective programming. Optimization and Engineering, 11(3), 423–440.
on foundations of computer science (pp. 410–419). IEEE. https://doi.org/10.1109/ https://doi.org/10.1007/s11081- 009- 9099- x.
FOCS.2013.51. Feliot, P., Bect, J., & Vazquez, E. (2017). A Bayesian approach to constrained single-
Cheng, S., Shi, Y., & Qin, Q. (2012). On the performance metrics of multiob- and multi-objective optimization. Journal of Global Optimization, 67(1–2), 97–
jective optimization. In Y. Tan, Y. Shi, & Z. Ji (Eds.), Advances in swarm in- 133. https://doi.org/10.1007/s10898- 016- 0427- 3.
telligence (pp. 504–512). Berlin, Heidelberg: Springer. https://doi.org/10.1007/ Feliot, P., Bect, J., & Vazquez, E. (2019). User preferences in Bayesian multi-objective
978- 3- 642- 30976- 2_61. optimization: The expected weighted hypervolume improvement criterion. In
420
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
G. Nicosia, P. Pardalos, G. Giuffrida, R. Umeton, & V. Sciacca (Eds.), Machine Li, M., & Yao, X. (2019). Quality evaluation of solution sets in multiobjective optimi-
learning, optimization, and data science (pp. 533–544). Cham: Springer. https: sation: A survey. ACM Computing Surveys, 52(2), 26:1–26:38. https://doi.org/10.
//doi.org/10.1007/978- 3- 030- 13709- 0_45. 1145/3300148.
Fonseca, C., da Fonseca, V., & Paquete, L. (2005). Exploring the performance of Li, M., Zheng, J., & Xiao, G. (2008). Uniformity assessment for evolutionary multi-
stochastic multiobjective optimisers with the second-order attainment function. objective optimization. In IEEE congress on evolutionary computation (IEEE World
In C. Coello, A. H. Aguirre, & E. Zitzler (Eds.), Evolutionary multi-criterion op- Congress on Computational Intelligence) (pp. 625–632). https://doi.org/10.1109/
timization (pp. 250–264). Berlin, Heidelberg: Springer. https://doi.org/10.1007/ CEC.2008.4630861.
978- 3- 540- 31880- 4_18. Liuzzi, G., Lucidi, S., & Rinaldi, F. (2016). A derivative-free approach to constrained
Fonseca, C., Guerreiro, A., López-Ibáñez, M., & Paquete, L. (2011). On the computa- multiobjective nonsmooth optimization. SIAM Journal on Optimization, 26(4),
tion of the empirical attainment function. In R. Takahashi, K. Deb, E. Wanner, 2744–2774. https://doi.org/10.1137/15M1037810.
& S. Greco (Eds.), Evolutionary multi-criterion optimization (pp. 106–120). Berlin, Lizarraga-Lizarraga, G., Hernandez-Aguirre, A., & Botello-Rionda, S. (2008). G-metric:
Heidelberg: Springer. https://doi.org/10.1007/978- 3- 642- 19893- 9_8. An M-ary quality indicator for the evaluation of non-dominated sets. In Pro-
da Fonseca, V., Fonseca, C., & Hall, A. (2001). Inferential performance assess- ceedings of the 10th annual conference on genetic and evolutionary computa-
ment of stochastic optimisers and the attainment function. In E. Zitzler, tion (pp. 665–672). New York, NY, USA: ACM. https://doi.org/10.1145/1389095.
L. Thiele, K. C. A. C. Deb, & D. Corne (Eds.), Evolutionary multi-criterion op- 1389227.
timization (pp. 213–225). Berlin, Heidelberg: Springer. https://doi.org/10.1007/ Lopez, E., & Coello, C. (2016). IGD+-EMOA: A multi-objective evolutionary algo-
3- 540- 44719- 9_15. rithm based on IGD+. In 2016 IEEE Congress on Evolutionary Computation (CEC)
Friedrich, T., Bringmann, K., Voß, T., & Igel, C. (2011). The logarithmic hypervolume (pp. 999–1006). https://doi.org/10.1109/CEC.2016.7743898.
indicator. In Proceedings of the 11th workshop proceedings on foundations of ge- Markowitz, H. (1952). Portfolio selection. Journal of Finance, 7(1), 77–91.
netic algorithms (pp. 81–92). New York, NY, USA: ACM. https://doi.org/10.1145/ Martí, L., García, J., Berlanga, A., & Molina, J. (2016). A stopping criterion for multi-
1967654.1967662. objective optimization evolutionary algorithms. Information Sciences, 367–368,
Ghosh, D., & Chakraborty, D. (2015). A direction based classical method to obtain 700–718. https://doi.org/10.1016/j.ins.2016.07.025.
complete Pareto set of multi-criteria optimization problems. Opsearch, 52(2), Meng, H., Zhang, X., & Liu, S. (2005). New quality measures for multiobjective
340–366. https://doi.org/10.1007/s12597- 014- 0178- 1. programming. In L. Wang, K. Chen, & Y. Ong (Eds.), Advances in natural com-
Guerreiro, A., & Fonseca, C. (2016). Hypervolume sharpe-ratio indicator: Formaliza- putation (pp. 1044–1048). Berlin, Heidelberg: Springer. https://doi.org/10.1007/
tion and first theoretical results. In Parallel problem solving from nature – PPSN 11539117_143.
XIV (pp. 814–823). https://doi.org/10.1007/978- 3- 319- 45823- 6_76. Messac, A., & Mattson, C. (2004). Normal constraint method with guarantee of
Guerreiro, A., & Fonseca, C. (2020). An analysis of the hypervolume sharpe-ratio even representation of complete Pareto frontier. AIAA Journal, 42(10), 2101–2111.
indicator. European Journal of Operational Research, 283(2), 614–629. https://doi. https://doi.org/10.2514/1.8977.
org/10.1016/j.ejor.2019.11.023. Moré, J., & Wild, S. (2009). Benchmarking derivative-free optimization algorithms.
Hansen, M., & Jaszkiewicz, A. (1998). Evaluating the quality of approximations to the SIAM Journal on Optimization, 20(1), 172–191. https://doi.org/10.1137/080724083.
non-dominated set. IMM, Department of Mathematical Modelling. Mostaghim, S., & Teich, J. (2005). A new approach on many objective diversity
Hardin, D., & Saff, E. (2004). Discretizing manifolds via minimum energy points. measurement. In J. Branke, K. Deb, K. Miettinen, & R. Steuer (Eds.), Practi-
Notices of the AMS, 51(10), 1186–1194. cal approaches to multi-objective optimization. In Dagstuhl Seminar Proceedings.
Hardin, D., & Saff, E. (2005). Minimal Riesz energy point configurations for rectifi- Dagstuhl, Germany: Internationales Begegnungs- und Forschungszentrum für
able d-dimensional manifolds. Advances in Mathematics, 193(1), 174–204. https: Informatik (IBFI), Schloss Dagstuhl, Germany.
//doi.org/10.1016/j.aim.20 04.05.0 06. Nesterov, Y., & Nemirovskii, A. (1994). Interior-point polynomial algorithms in convex
Ishibuchi, H., Imada, R., Setoguchi, Y., & Nojima, Y. (2018). How to specify a reference programming. Society for Industrial and Applied Mathematics. https://doi.org/10.
point in hypervolume calculation for fair performance comparison. Evolutionary 1137/1.9781611970791.
Computation, 26(3), 411–440. https://doi.org/10.1162/evco_a_00226. Nocedal, J., & Wright, S. (2006). Numerical optimization. Springer Series in Operations
Ishibuchi, H., Masuda, H., Tanigaki, Y., & Nojima, Y. (2014). Difficulties in specify- Research and Financial Engineering (2nd). Berlin: Springer.
ing reference points to calculate the inverted generational distance for many- Okabe, T., Jin, Y., & Sendhoff, B. (2003). A critical survey of performance indices
objective optimization problems. In IEEE symposium on computational intel- for multi-objective optimisation. In Evolutionary computation: 2 (pp. 878–885).
ligence in multi-criteria decision-making (pp. 170–177). https://doi.org/10.1109/ https://doi.org/10.1109/CEC.2003.1299759. Canberra, Australia
mcdm.2014.7007204. Riquelme, N., Lücken, C. V., & Baran, B. (2015). Performance metrics in multi-
Ishibuchi, H., Masuda, H., Tanigaki, Y., & Nojima, Y. (2015). Modified distance calcu- objective optimization. In Latin American computing conference (pp. 1–11). IEEE.
lation in generational distance and inverted generational distance. In A. Gaspar- https://doi.org/10.1109/CLEI.2015.7360024.
Cunha, C. H. Antunes, & C. Coello (Eds.), Evolutionary multi-criterion optimization Rudolph, G., Schütze, O., Grimme, C., Domínguez-Medina, C., & Traut-
(pp. 110–125). Cham: Springer. https://doi.org/10.1007/978- 3- 319- 15892- 1_8. mann, H. (2016). Optimal averaged Hausdorff archives for bi-objective problems:
Jaszkiewicz, A. (2018). Improved quick hypervolume algorithm. Computers and Op- theoretical and numerical results. Computational Optimization and Applications,
erations Research, 90, 72–83. https://doi.org/10.1016/j.cor.2017.09.016. 64(2), 589–618. https://doi.org/10.1007/s10589-015-9815-8.
Jiang, S., Ong, Y., Zhang, J., & Feng, L. (2014). Consistencies and contradictions of Russo, L., & Francisco, A. (2014). Quick hypervolume. IEEE Transactions on Evolution-
performance metrics in multiobjective optimization. IEEE Transactions on Cyber- ary Computation, 18(4), 481–502. https://doi.org/10.1109/TEVC.2013.2281525.
netics, 44(12), 2391–2404. https://doi.org/10.1109/tcyb.2014.2307319. Russo, L., & Francisco, A. (2016). Extending quick hypervolume. Journal of Heuristics,
Jiang, S., Yang, S., & Li, M. (2016). On the use of hypervolume for diversity measure- 22(3), 245–271. https://doi.org/10.1007/s10732- 016- 9309- 6.
ment of Pareto front approximations. In 2016 IEEE Symposium Series on Com- Sayın, S. (20 0 0). Measuring the quality of discrete representations of efficient sets
putational Intelligence (SSCI) (pp. 1–8). IEEE. https://doi.org/10.1109/SSCI.2016. in multiple objective mathematical programming. Mathematical Programming,
7850225. 87(3), 543–560. https://doi.org/10.1007/s101070 050 011.
Kidd, M., Lusby, R., & Larsen, J. (2020). Equidistant representations: Connecting cov- Schott, J. (1995). Fault tolerant design using single and multicriteria genetic algo-
erage and uniformity in discrete biobjective optimization. Computers and Oper- rithm optimization. Technical Report. Air force Institute Of Tech Wright-Patter-
ations Research, 117, 104872. https://doi.org/10.1016/j.cor.2019.104872. son AFB OH.
Knowles, J., & Corne, D. (2002). On metrics for comparing nondominated sets. In Schutze, O., Esquivel, X., Lara, A., & Coello, C. (2012). Using the averaged Hausdorff
Evolutionary computation: 1 (pp. 711–716). Honolulu, HI: IEEE. https://doi.org/ distance as a performance measure in evolutionary multiobjective optimization.
10.1109/CEC.2002.1007013. IEEE Transactions on Evolutionary Computation, 16(4), 504–522. https://doi.org/
Lacour, R., Klamroth, K., & Fonseca, C. (2017). A box decomposition algorithm to 10.1109/TEVC.2011.2161872.
compute the hyperVolume indicator. Computers and Operations Research, 79, Shannon, C. (2001). A mathematical theory of communication. ACM SIGMOBILE Mo-
347–360. https://doi.org/10.1016/j.cor.2016.06.021. bile Computing and Communications Review, 5(1), 3–55. https://doi.org/10.1145/
Laumanns, M., Günter, R., & Schwefel, H. (1999). Approximating the Pareto set: Con- 584091.584093.
cepts, diversity issues, and performance assessment. Secretary of the SFB 531. Shir, O., Preuss, M., Naujoks, B., & Emmerich, M. (2009). Enhancing decision space
Laumanns, M., Zitzler, E., & Thiele, L. (20 0 0). A unified model for multi-objective diversity in evolutionary multiobjective algorithms. In M. Ehrgott, C. Fonseca,
evolutionary algorithms with elitism. In Congress on evolutionary computation: X. Gandibleux, J.-K. Hao, & M. Sevaux (Eds.), Evolutionary multi-criterion op-
1 (pp. 46–53). https://doi.org/10.1109/CEC.20 0 0.870274. Piscataway, New Jersey, timization (pp. 95–109). Berlin, Heidelberg: Springer. https://doi.org/10.1007/
USA 978- 3- 642- 01020- 0_12.
Leung, Y., & Wang, Y. (2003). U-measure: a quality measure for multiobjective pro- Shukla, P., Doll, N., & Schmeck, H. (2014). A theoretical analysis of volume based
gramming. IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems Pareto front approximations. In Proceedings of the 2014 annual conference on ge-
and Humans, 33(3), 337–343. https://doi.org/10.1109/TSMCA.2003.817059. netic and evolutionary computation (pp. 1415–1422). New York, NY, USA: ACM.
Li, M., Yang, S., & Liu, X. (2014). Diversity comparison of Pareto front approximations https://doi.org/10.1145/2576768.2598348.
in many-objective optimization. IEEE Transactions on Cybernetics, 44(12), 2568– Slowinski, R., & Teghem, J. (1990). Stochastic versus fuzzy approaches to multiobjective
2584. https://doi.org/10.1109/TCYB.2014.2310651. mathematical programming under uncertainty. Springer. https://doi.org/10.1007/
Li, M., Yang, S., & Liu, X. (2015). A performance comparison indicator for Pareto 978- 94- 009- 2111-5.
front approximations in many-objective optimization. In Proceedings of the 2015 Tan, K., Lee, T., & Khor, E. (2002). Evolutionary algorithms for multi-objective op-
annual conference on genetic and evolutionary computation (pp. 703–710). New timization: performance assessments and comparisons. Artificial Intelligence Re-
York, NY, USA: ACM. https://doi.org/10.1145/2739480.2754687. view, 17(4), 251–290. https://doi.org/10.1023/A:1015516501242.
Li, M., & Yao, X. (2017). Dominance move: A measure of comparing solution sets in Tian, Y., Zhang, X., Cheng, R., & Jin, Y. (2016). A multi-objective evolutionary algo-
multiobjective optimization. Technical Report. arXiv. rithm based on an enhanced inverted generational distance metric. In Proceed-
421
C. Audet, J. Bigeon, D. Cartier et al. European Journal of Operational Research 292 (2021) 397–422
ings of the 2016 IEEE congress on evolutionary computation (CEC’2016) (pp. 5222– Wu, J., & Azarm, S. (20 0 0). Metrics for quality assessment of a multiobjective design
5229). IEEE. https://doi.org/10.1109/CEC.2016.7748352. optimization solution set. Journal of Mechanical Design, 123(1), 18–25. https://
Trautmann, H., Wagner, T., & Brockhoff, D. (2013). R2-EMOA: Focused multiobjective doi.org/10.1115/1.1329875.
search using R2-indicator-based selection. In G. Nicosia, & P. Pardalos (Eds.), Yevseyeva, I., Guerreiro, A., Emmerich, M., & Fonseca, C. (2014). A portfolio optimiza-
Learning and intelligent optimization (pp. 70–74). Berlin, Heidelberg: Springer. tion approach to selection in multiobjective evolutionary algorithms. In Parallel
https://doi.org/10.1007/978- 3- 642- 44973- 4_8. problem solving from nature – PPSN XIII (pp. 672–681). https://doi.org/10.1007/
Ulrich, T., Bader, J., & Thiele, L. (2010a). Defining and optimizing indicator-based 978- 3- 319- 10762- 2_66.
diversity measures in multiobjective search. In R. Schaefer, C. Cotta, J. Kołodziej, Zhao, J., Jiao, L., Liu, F., Fernandes, V., Yevseyeva, I., Xia, S., & Emmerich, M. (2018).
& G. Rudolph (Eds.), Parallel problem solving from nature, PPSN XI (pp. 707–717). 3D fast convex-hull-based evolutionary multiobjective optimization algorithm.
Berlin, Heidelberg: Springer. https://doi.org/10.1007/978- 3- 642- 15844- 5_71. Applied Soft Computing, 67, 322–336. https://doi.org/10.1016/j.asoc.2018.03.005.
Ulrich, T., Bader, J., & Zitzler, E. (2010b). Integrating decision space diversity into Zheng, K., Yang, R., Xu, H., & Hu, J. (2017). A new distribution metric for comparing
hypervolume-based multiobjective search. In Proceedings of the 12th annual con- Pareto optimal solutions. Structural and Multidisciplinary Optimization, 55(1), 53–
ference on genetic and evolutionary computation (pp. 455–462). New York, NY, 62. https://doi.org/10.10 07/s0 0158-016-1469-3.
USA: ACM. https://doi.org/10.1145/1830483.1830569. Zhou, A., Jin, A., Zhang, Q., Sendhoff, B., & Tsang, E. (2006). Combining model-based
Van Veldhuizen, D. (1999). Multiobjective evolutionary algorithms: Classifications, and genetics-based offspring generation for multi-objective optimization using
analyses, and new innovations. Technical Report. School of Engineering of the a convergence criterion. In IEEE international conference on evolutionary compu-
Air Force Institute of Technology, Dayton, Ohio. tation (pp. 892–899). IEEE. https://doi.org/10.1109/CEC.2006.1688406.
Van Veldhuizen, D., & Lamont, G. (20 0 0). On measuring multiobjective evolution- Zitzler, E. (1999). Evolutionary algorithms for multiobjective optimization: Methods and
ary algorithm performance. In Proceedings of the 2000 congress on evolutionary applications. Zurich, Swiss Federal: Institute of Technology Ph.D. thesis..
computation: 1 (pp. 204–211). https://doi.org/10.1109/CEC.20 0 0.870296. Zitzler, E., Brockhoff, D., & Thiele, L. (2007). The hypervolume indicator revis-
Vargas, A., & Bogoya, J. (2018). A generalization of the averaged Hausdorff ited: on the design of Pareto-compliant indicators via weighted integration.
distance. Computación y Sistemas, 22(2), 331–345. https://doi.org/10.13053/ In S. Obayashi, K. Deb, C. Poloni, T. Hiroyasu, & T. Murata (Eds.), Evolutionary
CyS- 22- 2- 2950. multi-criterion optimization (pp. 862–876). Berlin, Heidelberg: Springer. https:
Wagner, T., Trautmann, H., & Brockhoff, D. (2013). Preference articulation by means //doi.org/10.1007/978- 3- 540- 70928- 2_64.
of the R2 indicator. In R. Purshouse, J. Fleming, C. Fonseca, S. Greco, & J. Shaw Zitzler, E., Deb, K., & Thiele, L. (20 0 0). Comparison of multiobjective evolutionary
(Eds.), Evolutionary multi-criterion optimization (pp. 81–95). Berlin, Heidelberg: algorithms: Empirical results. Evolutionary Computation, 8(2), 173–195. https://
Springer. https://doi.org/10.1007/978- 3- 642- 37140- 0_10. doi.org/10.1162/106365600568202.
Wagner, T., Trautmann, H., & Martí, L. (2011). A taxonomy of online stopping criteria Zitzler, E., Knowles, J., & Thiele, L. (2008). Quality assessment of Pareto set approx-
for multi-objective evolutionary algorithms. In R. Takahashi, K. Deb, E. Wanner, imations. In Multiobjective optimization (pp. 373–404). Springer, Berlin, Heidel-
& S. Greco (Eds.), Evolutionary multi-criterion optimization (pp. 16–30). Berlin, berg. https://doi.org/10.1007/978- 3- 540- 88908- 3_14.
Heidelberg: Springer. https://doi.org/10.1007/978- 3- 642- 19893- 9_2. Zitzler, E., & Künzli, S. (2004). Indicator-based selection in multiobjective search.
Wagner, T., Trautmann, H., & Naujoks, B. (2009). OCD: Online convergence detec- In X. Yao, E. Burke, J. Lozano, J. Smith, J. Merelo-Guervós, J. Bullinaria, . . . H.-
tion for evolutionary multi-objective algorithms based on statistical testing. In S. Schwefel (Eds.), Parallel problem solving from nature - PPSN VIII (pp. 832–842).
M. Ehrgott, C. Fonseca, X. Gandibleux, J.-K. Hao, & M. Sevaux (Eds.), Evolu- Berlin, Heidelberg: Springer. https://doi.org/10.1007/978- 3- 540- 30217- 9_84.
tionary multi-criterion optimization (pp. 198–215). Berlin, Heidelberg: Springer. Zitzler, E., & Thiele, L. (1998). Multiobjective optimization using evolutionary al-
https://doi.org/10.1007/978- 3- 642- 01020- 0_19. gorithms – A comparative case study. In A. Eiben, T. Bäck, M. Schoenauer, &
While, L., Bradstreet, L., & Barone, L. (2012). A fast way of calculating exact hy- H. Schwefel (Eds.), Parallel problem solving from nature – PPSN V (pp. 292–301).
pervolumes. IEEE Transactions on Evolutionary Computation, 16(1), 86–95. https: Berlin, Heidelberg: Springer. https://doi.org/10.10 07/BFb0 056872.
//doi.org/10.1109/TEVC.2010.2077298. Zitzler, E., Thiele, L., Laumanns, M., Fonseca, C., & da Fonseca, V. (2003). Perfor-
Wiecek, M., Ehrgott, M., & Engau, A. (2016). Continuous multiobjective program- mance assessment of multiobjective optimizers: An analysis and review. IEEE
ming. In Multiple criteria decision analysis (pp. 739–815). Springer New York. Transactions on Evolutionary Computation, 7(2), 117–132. https://doi.org/10.1109/
https://doi.org/10.1007/978- 1- 4939- 3094- 4_18. tevc.2003.810758.
422