Arxiv_ v1 [Math.oc] 23 Sep 2021
Arxiv_ v1 [Math.oc] 23 Sep 2021
Arxiv_ v1 [Math.oc] 23 Sep 2021
a
College of Mathematics, Sichuan University, Chengdu, China; b School of Mathematical
Sciences, Chongqing Normal University, Chongqing, China; c College of Mathematics and
Statistics, Chongqing Jiaotong University, Chongqing, China
ARTICLE HISTORY
Compiled September 24, 2021
ABSTRACT
In this paper, we propose an extension of the classical Frank-Wolfe method for
solving constrained vector optimization problems with respect to a partial order
induced by a closed, convex and pointed cone with nonempty interior. In the pro-
posed method, the construction of auxiliary subproblem is based on the well-known
oriented distance function. Two types of stepsize strategies including Armijio line
search and adaptive stepsize are used. It is shown that every accumulation point
of the generated sequences satisfies the first-order necessary optimality condition.
Moreover, under suitable convexity assumptions for the objective function, it is
proved that all accumulation points of any generated sequences are weakly efficient
points. We finally apply the proposed algorithms to a portfolio optimization problem
under bicriteria considerations.
KEYWORDS
vector optimization; Frank-Wolfe method; stationary point; convergence; portfolio
optimization
1. Introduction
2
(see Algorithm 1) and adaptive stepsize (see Algorithm 3) for vector optimization
problems are introduced and the convergence results of the produced sequences are
obtained. An application to a bicriteria portfolio optimization problem is presented in
Section 5. Finally, in Section 6, we make some conclusions about our works.
2. Preliminaries
For a nonempty set X ⊂ Rm , the interior and boundary of X are respectively de-
noted by int(X) and bd(X). Let C ⊂ Rm be a closed, convex and pointed cone with
nonempty interior. For any y1 , y2 ∈ Rm , the partial order in Rm induced by C is
defined as
y1 y2 ⇔ y2 − y1 ∈ C,
y1 ≺ y2 ⇔ y2 − y1 ∈ int(C).
We now recall the concept of oriented distance function (also called assigned dis-
tance function or Hiriart-Urruty function), which was proposed by Hiriart-Urruty [28]
to investigate optimality conditions of nonsmooth optimization problems from the ge-
ometric point of view. The oriented distance function has been extensively used in
several works, such as scalarization for vector optimization [29,30], optimality condi-
tions for vector optimization [31], optimality conditions for set-valued optimization
problems [32], etc. Herein, we consider the oriented distance function in Rm .
Note that
y ∈ Rm \A,
dA (y), if
4A (y) =
−dRm \A (y), if y ∈ A.
3
y1 + y2 ≥ 0, y2 ≥ 0}. Let A := −C and
y2 , if y ∈ B1 ,
dA (y) := kyk2 , if y ∈ B2 ,
|y1√+y2 |
2
, if y ∈ B3
and
(
|y1 +y2 |
√ , if y ∈ B4 ,
dR2 \A (y) := 2
|y2 |, if y ∈ B5 .
In this paper, for our purposes, let A := −C in Definition 2.1. For the sake of
convenience, we let
According to [29, Proposition 3.2] and the fact that C is a closed, convex and pointed
cone with nonempty interior, we have immediately the following properties related to
ϕC .
Lemma 2.3. [29] Let ϕC (·) be defined in (2). Then the following statements hold:
(i) ϕC is real valued and 1-Lipschitzian;
(ii) ϕC (y) < 0 for any y ∈ −int(C), ϕC (y) = 0 for any y ∈ bd(−C), and ϕC (y) > 0
for any y ∈ int(Rm \(−C));
(iii) ϕC is convex;
(iv) ϕC is positively homogeneous;
(v) For all y1 , y2 ∈ Rm ,
4
Remark 1. If M is a compact set, then diam(M ) = maxx,y∈M kx − yk2 and it is a
finite number.
∂fi
(JF (x))i,j = (x),
∂xj
In this paper, we consider the following constrained vector optimization problems with
respect to the partial order C:
Definition 3.1. [34] A point x ∈ Ω is called weakly efficient solution of problem (3)
if there exists no x∗ ∈ Ω such that F (x∗ ) ≺ F (x).
A necessary, but not sufficient, first order optimality condition for problem (3) at
x̂ ∈ Ω, is
JF (x̂)(s − x̂) = (h∇f1 (x̂), s − x̂i, h∇f2 (x̂), s − x̂i, . . . , h∇fm (x̂), s − x̂i)> .
5
Obviously, (4) is equivalent to JF (x̂)(s − x̂) ∈
/ −int(C) for any s ∈ Ω.
Remark 4. Note that if x̂ ∈ Ω is not a stationary point of problem (3), then there
exists ŝ ∈ Ω such that JF (x)(ŝ − x̂) ∈ −int(C), i.e., ϕC (JF (x̂)(ŝ − x̂)) < 0 from
Lemma 2.3(ii). In this case, as analyzed in [16, pp. 665], we can assert that ŝ − x̂ is a
descent direction for F .
We conclude this section by giving the relation between stationary point and weakly
efficient solution. The proof of this property can be similarly analyzed from [13, pp.
410] and we omit the process here.
6
Notice that, it follows from Lemma 2.3(iii) that ψx defined in (5) is a convex function.
This, combined with the fact that Ω is a nonempty, compact and convex set, gives
that problem (6) admits an optimal solution (possibly not unique) on Ω. We denote
the optimal solution of problem (6) by s(x), i.e.,
According to Remark 4, we formally give the search direction for the objective
function F at x.
Definition 4.1. For any given point x ∈ Ω, the search direction of the Frank-Wolfe
method for F at x is defined as
Proof. (i) Since x ∈ Ω, it follows from (7) and (8) that v(x) = mins∈Ω ψx (s) ≤
ψx (x) = ϕC (JF (x)(x − x)) = ϕC (0). Besides, ϕC (0) = 0 by Lemma 2.3(ii). Thus,
v(x) ≤ 0.
(ii) Necessity. Suppose that x ∈ Ω is a stationary point of problem (3). Then, it
follows from Remark 2 that ϕC (JF (x)(s − x)) ≥ 0 for any s ∈ Ω. By (7), we have
s(x) ∈ Ω. Hence, v(x) = ϕC (JF (x)(s(x) − x)) ≥ 0. This, combined with (i), yields
that v(x) = 0.
Sufficiency. Let v(x) = 0. According to (8), we obtain 0 = v(x) ≤ ψx (s) =
ϕC (JF (x)(s − x)) for all s ∈ Ω, which implies that x is a stationary point of problem
(3).
Proof. Take x ∈ Ω and let {xk } be a sequence in Ω such that limk→∞ xk = x. In order
to obtain the continuity of v on Ω, it is sufficient to prove that limk→∞ v(xk ) = v(x),
i.e.,
7
Since s(x) ∈ Ω, using (7) and (8), we can obtain for all k,
Let us show that v(x) ≤ lim inf k→∞ v(xk ). Obviously, we have
where the last inequality follows from Lemma 2.3(v). Taking lim inf k→∞ in (13), we
get
where the penultimate inequality follows from Lemma 2.3(i). Since s(xk ), xk ∈ Ω, it
follows from Remark 1 that ks(xk ) − xk k ≤ diam(Ω) < ∞. This, combined with the
continuously differentiability of F and (14), we get v(x) ≤ lim inf k→∞ v(xk ).
Altogether, (10) holds. Consequently, v is continuous on Ω.
t := τ t,
8
where τ ∈ (0, 1). The following lemma demonstrates the finiteness of this procedure
in view of the fact that (15) holds strictly for t > 0 small enough.
Lemma 4.4. Let s(x) be defined in (7) and JF (x)(s(x) − x) ≺ 0. If β ∈ (0, 1), then
there exists some t̂ > 0 such that
Theorem 4.5. Let {xk } be a sequence produced by Algorithm 1. Then, xk ∈ Ω for all
k.
9
Proof. We proceed by induction. From Algorithm 1, we have x0 ∈ Ω for k = 0.
Assume that xk ∈ Ω for k > 0. We shall prove xk+1 ∈ Ω for k + 1. It is easy to see that
sk ∈ Ω from Algorithm 1. According to the convexity of Ω, we have xk+1 = xk +tk dk =
xk + tk (sk − xk ) = tk sk + (1 − tk )xk ∈ Ω for tk ∈ (0, 1].
We present some properties related to the points which are iterated by Algorithm
1.
Proof. (i) From the assumption that an infinite sequence {xk } is generated by Algo-
rithm 1 and Remark 6, we have v k < 0 for all k.
(ii) By Theorem 4.5, xk+1 ∈ Ω for all k. From (15), we have
From the nonstationarity of xk and Remark 4, we have JF (xk )(sk −xk ) ≺ 0. Therefore,
the above inequality implies that F (xk+1 ) ≺ F (xk ).
(iii) For any i, we have
where the first inequality holds in view of (15) and Lemma 2.3(vi), the second inequal-
ity follows from Lemma 2.3(v), the first equality is due to Lemma 2.3(iv). According
to (16) and (i), we have
ti |v i | = −ti v i
ϕC (F (xi )) − ϕC (F (xi+1 )) (17)
≤ .
β
Theorem 4.7. Let {xk } be a sequence produced by Algorithm 1. Then, every accu-
mulation point of {xk } is a stationary point of problem (3).
Proof. Let x̂ ∈ Ω be a accumulation point of the sequence {xk }. Then, there exists a
subsequence {xkj } of {xk } such that
From Proposition 4.3 and (18), we have v(xkj ) → v(x̂) whenever j → ∞. Here, it is
sufficient to show that v(x̂) = 0 in view of Proposition 4.2(ii).
10
Let k := kj in Proposition 4.6(iii). Then
kj
X ϕC (F (x0 )) − ϕC (F (xkj +1 ))
ti |v(xi )| ≤ .
β
i=0
P∞ i
Taking limj→∞ on both sides of the above inequality, we get i=0 ti |v(x )| < ∞, which
implies that limk→∞ tk v(xk ) = 0, and in particular,
(a) lim sup tkj > 0 or (b) lim sup tkj = 0. (20)
j→∞ j→∞
We first suppose that (20)(a) holds. Then, there exists a subsequence {tkji } of {tkj }
converging to some t̂ > 0. And from (18), we have limi→∞ xkji = x̂. Thus, (19) implies
that limi→∞ tkji v(xkji ) = 0, and furthermore, limi→∞ v(xkji ) = 0. This, combined
with Proposition 4.3, gives that 0 = limi→∞ v(xkji ) = v(x̂).
We now consider (20)(b). Clearly, xkj , s(xkj ) ∈ Ω. From the compactness of Ω,
Remark 1 and (9), we have
i.e., the sequence {d(xkj )} is bounded. Now, we take subseqences {xkji }, {d(xkji )} and
{tkji } converging to x̂, d(x̂) and 0, respectively. By (8) and Proposition 4.6(i), we get
Taking limi→∞ on both sides of the above inequality and togethering with Proposition
4.3, we have
v(x̂) ≤ 0. (21)
Take some fixed but arbitrary l ∈ N, where N denotes the set of natural numbers. From
limi→∞ tkji = 0, we have tkji < τ l for i large enough. This shows that the Armijio
condition is not satisfied at xkji for t = τ l , that is,
or, equivalently,
11
where the equality holds in view of the closedness of C. By (22) and Lemma 2.3(ii),
we have
According to (24), Lemma 4.4 and Lemma 2.3(ii), we can obtain JF (x̂)d(x̂) ⊀ 0, i.e.,
JF (x̂)d(x̂) ∈ Rm \(−int(C)). Thus, we have v(x̂) = ϕC (JF (x̂)d(x̂)) ≥ 0 from Lemma
2.3(ii). This, combined with (21), yields that v(x̂) = 0.
It follows from Theorems 3.3 and 4.7 that the following result holds.
L
F (y) − F (x) JF (x)(y − x) + ky − xk22 e. (25)
2
L 2
Proof. Let G(·) := 2 k·k2 e−F (·). Since G(·) is C-convex on Ω, it follows from Lemma
2.6 that
12
From the notion of G(·), (26) and (27), we have
L L
F (y) − F (x) JF (x)(y − x) + kyk22 e − kxk22 e − Lhx, yie + Lkxk22 e
2 2
L
= JF (x)(y − x) + (kyk22 + kxk22 − 2hx, yi)e
2
L
= JF (x)(y − x) + ky − xk22 e,
2
and the proof is complete.
Remark 7. Here we call the property the vector version of the classical descent lemma
with respect to the order cone C. Actually, the setting of the condition in Lemma 4.9
is inspired by the works of Bauschke et al. [36].
L
ϕC (F (y)) − ϕC (F (x)) ψx (y) + ky − xk22 ϕC (e). (28)
2
Therefore, it immediately follows from (29), (30) and (5) that (28) holds.
Let us now give the Frank-Wolfe algorithm with adaptive stepsize (see Algorithm
3) for solving problem (3).
xk+1 ← xk + tk dk
end for
13
Likewise, Algorithm 3 can terminate with a stationary point in a finite number
of iterations or generate an infinite sequence. We will suppose that in the sequel
Algorithm 3 produces an infinite sequence {xk } of nonstationary points. Clearly, it
follows from Remark 6 that v k < 0 for all k.
Lemma 4.10. Suppose that L2 k · k22 e − F (·) is C-convex on Ω, ϕC (e) < 2 and {xk } is
a sequence produced by Algorithm 3. Then, for all k, it holds that
(v k )2
k+1 k ϕ (e) − 2 k
ϕC (F (x )) − ϕC (F (x )) ≤ C min , −v . (31)
2 L(diam(Ω))2
vk
tk = min 1, − . (32)
Lkdk k22
Since L2 k·k22 e−F (·) is C-convex on Ω, then by (28) invoked with x = xk and y = xk+1 ,
we have
L 2 k 2
ϕC (F (xk+1 )) − ϕC (F (xk )) ≤ ψxk (xk + tk (sk − xk )) + t kd k2 ϕC (e)
2 k
L 2 k 2
= tk ψxk (sk ) + t kd k2 ϕC (e) (33)
2 k
L 2 k 2
= tk v k + t kd k2 ϕC (e),
2 k
where the first equality holds in view of (5). According to (32), there are two options:
Case 1. Let tk = 1. This, combined with (32), gives that
2 − ϕC (e) k
ϕC (F (xk+1 )) − ϕC (F (xk )) ≤ v . (35)
2
k
Case 2. Let tk = − Lkdv k k2 . From Remark 1, we get kdk k = ksk − xk k ≤ diam(Ω).
2
This, together with (33), ϕC (e) < 2, yields that
ϕC (e) − 2 (v k )2
ϕC (F (xk+1 )) − ϕC (F (xk )) ≤
2 Lkdk k22
(36)
ϕ (e) − 2 (v k )2
≤ C .
2 L(diam(Ω))2
To present our convergence analysis for Algorithm 3, we need the following assump-
tion.
14
Assumption A. The sequence {F (xk )} is C-bounded from below, i.e., there exists
F̄ ∈ Rm such that F̄ F (xk ) for all k.
Theorem 4.11. Suppose that L2 k · k22 e − F (·) is C-convex on Ω, ϕC (e) < 2 and {xk } is
a sequence produced by Algorithm 3. If Assumption A holds, then every accumulation
point of {xk } is a stationary point of problem (3).
Proof. From (31), ϕC (e) < 2 and v k < 0, we have for all k,
(v k )2
k+1 k ϕ (e) − 2
ϕC (F (x )) − ϕC (F (x )) ≤ C min , −v k
2 L(diam(Ω))2 (37)
< 0,
i.e., ϕC (F (xk+1 )) < ϕC (F (xk )), which implies that {ϕC (F (xk ))} is nonincreasing for
all k. Since {F (xk )} is C-bounded from below (say by F̄ ), i.e., F̄ F (xk ) for all k,
it follows from Lemma 2.3(vi) that ϕC (F̄ ) ≤ ϕC (F (xk )) for all k. Therefore, we know
that the sequence {ϕC (F (xk ))} is convergent. This obviously means that
From Proposition 4.3, Proposition 4.2(ii) and (39), we obtain that each accumulation
point of {xk } is a stationary point of problem (3).
Remark 11. It is noteworthy that the extended Frank-Wolfe methods for vector op-
timization problems presented in Algorithms 1 and 3 are conceptual and theoretical
schemes rather than implementable algorithms. Similar issues also appear in the liter-
ature; see, e.g., [12,13,15,17–20,22,24–27]). Therefore, the computational efficiency of
the method to a real-world optimization problem depends essentially on the choice of
a good feature and structure of the minimization subproblem (6) at every iteration.
For example, when the norm k · k∞ is used and C := Rm + , the objective function ψx
in (6) has the simple form as shown in Remark 5, and then the descent direction can
be easily computed by program. We consider this issue as a subject in the following
section.
15
5. Numerical experiments for portfolio optimization
−x> u
minR2+ F (x) =
x> V x (40)
s.t. x ∈ Ω
The problem (40) is actually a well-known portfolio optimization problem, which plays
a critical role in determining portfolio strategies for investors. The decision variable
x = (x1 , x2 , . . . , xn ) of problem (40) stands for the asset weight vector, where xi ,
i = 1, 2, . . . , n, is the weight of asset i in the portfolio. u means the return rate
of the asset and the variance-covariance matrix V = (σij )n×n denotes the variance
and covariance of individual asset, where σii is the variance of asset i and σij is the
covariance between asset i and asset j. The first objective function denotes the negative
of the expected return (that is to be maximized, therefore minimized with a leading
minus) and the second one is to minimize the variance of the portfolio, which quantifies
the risk associated to the considered portfolio.
Herein, we use the real data presented in [37] that contains five stocks: IBM, Mi-
crosoft, Apple, Quest Diagnostics and Bank of America. The expected return and
variance of each stock in the portfolio were calculated based on historical stock price
and dividend payment from February 1, 2002 to February 1, 2007. Thus, in problem
(40), n = 5,
16
Clearly, problem (41) is nondifferentiable. Correspondingly, it can be equivalently
transfomed into the following differentiable form
min γ
s.t. γ ≥ h∇fi (x), s − xi, i = 1, 2 (42)
s∈Ω
Observe that problem (42) is a linear convex optimization problem. Therefore, the
optimal solution of problem (42) in our experiment can be obtained by using the
linprog of the solver optimize in Python. Moreover, the constrained set is actually an
unit simplex. So in order to obtain a set of weakly efficient solutions, we randomly and
uniformly sample 50 initial points on the simplex. The stopping criteria in Algorithms
1 and 3 are set as |v k | ≤ := 10−5 . Algorithms 1 and 3 were respectively run 50 times
by using same initial points and each time they ended at solution points, which have
been obtained after the verification of the stopping criterion. The solutions obtained
by Algorithms 1 and 3 are displayed in Figure 1. Meanwhile, the number of iterations
(on the “y” axes) and computing CPU time in seconds (on the “y” axes) for each initial
point (50 in total on the “x” axes) are reported in Figure 2(a) and (b), respectively.
Note that, in Figure 2(a)–(b), the red and blue dotted lines denote respectively the
average of iterations and CPU time obtained by Algorithms 1 and 3 for 50 instances
(the specific values are presented in Table 1).
Figure 1 shows that some possible optimal portfolio points on a return-risk tradeoff.
17
(a) Iterations (b) CPU time
As we have seen, the expected return is increasing with the risk. From Figure 2 and
Table 1, we observe that Algorithm 1 with Armijio line search takes fewer iterations
and CPU time than Algorithm 3 with adaptive stepsize for the same initial points.
A reasonable explanation of this phenomenon from the experimental data is that the
change of stepsize t in Algorithm 3 for each iteration is very small, which leads to a
small improvement of the objective function F , so it comes with additional cost.
6. Conclusions
In this paper, we have extended the classical Frank-Wolfe method to solve constrained
vector optimization problems with respect to a closed, convex and pointed cone with
nonempty interior. A key point is that we construct a auxiliary subproblem via the
well-known oriented distance function. Under reasonable assumptions, we prove that
accumulation points of the sequences generated by the proposed algorithms with two
different strategies of stepsizes are stationary. Applications to portfolio optimization
under bicriteria considerations are given.
In recent years, the convergence rate analysis of some gradient-based methods for
vector optimization problems have established under the setting of the partial order in
Rm is the nonnegative orthant (see [5,11,38,39]). Moreover, there are some convergence
rate results in the case of the general cone order (see [16,19]). It is noteworthy that
in this paper we have not analyzed the convergence rate of the proposed methods. An
interesting topic for future research is to investigate this issue.
References
[1] John J. Vector optimization: theory, applications and extensions. 2nd ed. Berlin: Springer;
2011.
[2] Miettinen K. Nonlinear multiobjective optimization. Boston, MAA: Kluwer; 1999.
[3] Eichfelder G. Adaptive scalarization methods in multiobjective optimization. Berlin:
Springer; 2008.
18
[4] Fliege J, Svaiter BF. Steepest descent methods for multicriteria optimization. Math.
Methods Oper Res. 2000;51: 479–494.
[5] Fliege J, Graña Drummond LM, Svaiter BF. Newton’s method for multiobjective opti-
mization. SIAM J Optim., 2009;20(2): 602–626.
[6] Qu SJ, Goh M, Chan FTS. Quasi-Newton methods for solving multiobjective optimiza-
tion. Oper Res Lett. 2011;39(5): 397–399.
[7] Qu SJ, Goh M, Liang B. Trust region methods for solving multiobjective optimisation.
Optim Methods Softw. 2013;28(4): 796–811.
[8] Fukuda EH, Graña Drummond L.M.: A survey on multiobjective descent methods.
Pesquisa Oper. 2014;34(3): 585–620.
[9] Tanabe H, Fukuda EH, Yamashita N. Proximal gradient methods for multiobjective op-
timization and their applications. Comput Optim Appl. 2019;72(2): 339–361.
[10] Wang JH, Hu YH, Wai Yu CK, Li C, Yang XQ. Extended Newton methods for multi-
objective optimization: majorizing function technique and convergence analysis. SIAM J
Optim. 2019;29(3): 2388–2421.
[11] Assunção PB, Ferreira OP, Prudente LF. Conditional gradient method for multiobjective
optimization. Comput Optim Appl. 2021;78(3): 741–768.
[12] Graña Drummond LM, Iusem AN. A projected gradient method for vector optimization
problems. Comput Optim Appl. 2004;28(1): 5–29.
[13] Graña Drummond LM, Svaiter BF. A steepest descent method for vector optimization
problems. J Comput Appl Math. 2005;175: 395–414.
[14] Fukuda EH, Graña Drummond LM. On the convergence of the projected gradient method
for vector optimization. Optimization. 2011;60(8–9): 1009–1021.
[15] Fukuda EH, Graña Drummond LM. Inexact projected gradient method for vector opti-
mization. Comput Optim Appl. 2013;54(3): 473–493.
[16] Graña Drummond LM, Raupp FMP, Svaiter BF. A quadratically convergent Newton
method for vector optimization. Optimization. 2014;63(5): 661–677.
[17] Lu F, Chen CR. Newton-like methods for solving vector optimization problems. Appl
Anal. 2014;93(8): 1567–1586.
[18] Qu SJ, Goh M, Ji Y, Souza RD. A new algorithm for linearly constrained c-convex vector
optimization with a supply chain network risk application. Eur J Oper Res. 2015;247(2):
359–365.
[19] Qu SJ, Ji Y, Jiang JL, Zhang QP. Nonmonotone gradient methods for vector optimization
with a portfolio optimization application. Eur J Oper Res. 2017;263(2): 356–366.
[20] Gonçalves MLN, Prudente LF. On the extension of the Hager–Zhang conjugate gradient
method for vector optimization. Comput Optim Appl. 2020;76(3): 889–916.
[21] Bonnel H, Iusem AN, Svaiter BF. Proximal methods in vector optimization. SIAM J
Optim. 2005;15(4): 953–970.
[22] Ceng LC, Yao JC. Approximate proximal methods in vector optimization. Eur. J Oper
Res. 2007;183(1): 1–19.
[23] Chen Z, Huang HQ, Zhao KQ. Approximate generalized proximal-type method for convex
vector optimization problem in Banach spaces. Comput Math Appl. 2009;57(7): 1196–
1203.
[24] Chuong TD. Tikhonov-type regularization method for efficient solutions in vector opti-
mization. J Comput Appl Math. 2010;234(3): 761–766.
[25] Chuong TD, Yao JC. Steepest descent methods for critical points in vector optimization
problems. Appl Anal. 2012;91(10): 1811–1829.
[26] Chuong TD. Newton-like for efficient solutions in vector optimization. Comput Optim
Appl. 2013;54: 495–516.
[27] Boţ RI, Grad S-M. Inertial forward backward methods for solving vector optimization
problems. Optimization. 2018;67(7): 959–974.
[28] Hiriart-Urruty JB. Tangent cone, generalized gradients and mathematical programming
in Banach spaces. Mathe Oper Res. 1979;4: 79–97.
[29] Zaffaroni A. Degrees of efficiency and degrees of minimality. SIAM J Control. Optim.
19
2003;42: 1071–1086.
[30] Miglierina E, Molho E, Rocca M. Well-posedness and scalarization in vector optimization.
J Optim Theory Appl. 2005;126(2): 391–409.
[31] Gao Y, Hou SH, Yang XM. Existence and optimality conditions for approximate solutions
to vector optimization problems. J Optim Theory Appl. 2012;152(1): 97–120.
[32] Zhou ZA, Chen W, Yang XM. Scalarizations and optimality of constrained set-valued
optimization using improvement sets and image space analysis. J Optim Theory Appl.
2019;183: 944–962.
[33] Muscat J. Functional analysis: an introduction to metric spaces, Hilbert Spaces, and
Banach Algebras. Springer; 2014.
[34] Luc TD. Theory of vector optimization, lecture notes in economics and mathematical
systems, vol. 319, Berlin: Springer; 1989.
[35] Bech A. First-order method in optimization. MPS-SIAM Series on Optimization; SIAM,
Philadelphia; 2017.
[36] Bauschke HH, Bolte J, Teboulle M. A descent lemma beyond Lipschitz gradient continu-
ity: first-order methods revisited and applications. Math. Oper. Res., 2017;42(2): 330–348.
[37] Duan YC. A multi-objective approach to portfolio optimization. Undergraduate Math J.
2007;8(1): 1–18.
[38] Fliege J, Vaz AIF, Vicente LN. Complexity of gradient descent for multiobjective opti-
mization. Optim Methods Soft. 2019;34(5): 949–959.
[39] Tanabe H, Fukuda EH, Yamashita N. Convergence rates analysis of multiobjective prox-
imal gradient methods. arXiv preprint arXiv:2010.08217, 2020.
20