0% found this document useful (0 votes)

19 views7 pages

Lecture 4

Uploaded by

mralreda99

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views7 pages

Lecture 4

Uploaded by

mralreda99

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

TMA947 / MMG621 – Nonlinear optimization Lecture 4

TMA947 / MMG621 — Nonlinear optimization

Lecture 4 — Introduction to optimality conditions

Emil Gustavsson, Zuzana Nedělková

October 20, 2017

[Minor revision: Axel Ringh - September, 2024]

Local and global optimality

We consider an optimization problem which is that to

minimize f (x), (1a)

subject to x ∈ S, (1b)

where S ⊆ Rn is a nonempty set, and f : Rn → R ∪ {+∞} is a given function.

s x
1 2 3 4 5 6 7

When n = 1, we know that an optimal solution will be found at

– boundary points of S,

– stationary points, that is, where f ′ (x) = 0,

– discontinuities of f or f ′ .

Definition (global minimum). x∗ ∈ S is a global minimum of f over S if

f (x∗ ) ≤ f (x), ∀x∈S

The next important definition is the one of a local minimum.

1
TMA947 / MMG621 – Nonlinear optimization Lecture 4

Definition (local minimum). x∗ ∈ S is a local minimum of f over S if

∃ε > 0 such that f (x∗ ) ≤ f (x), x ∈ S ∩ Bε (x∗ ),

where Bε (x∗ ) := { y ∈ Rn | ∥y − x∗ ∥ < ε } is the Euclidean ball with radius ε centered at x∗ .

We denote x∗ ∈ S as a strict local minimum of f over S if f (x∗ ) < f (x) holds above for x ̸= x∗ .

One of the most important theorems in optimization follows.

Theorem (Fundamental Theorem of global optimality). Consider the problem (1), where S is a convex
set and f is convex on S. Then, every local minimum of f over S is also a global minimum

Proof. See Theorem 4.3 in the book.

Existence of optimal solutions

First some basic notations.

– We say that a set S ⊆ Rn is open if for every x ∈ S there exists an ε > 0 such that Bε (x) :=
{ y ∈ Rn | ∥y − x∥ < ε } ⊂ S.
– We say that a set S ⊆ Rn is closed if Rn \ S is open.
– A limit point of a set S ⊆ Rn is a point x such that there exists a sequence {xk }∞
k=1 ⊂ S
fulfilling xk → x.
– We can then define a closed set as a set which contains all its limit points.
– We say that a set S ⊆ Rn is bounded if there exists a constant C > 0 such that ∥x∥ ≤ C for all
x ∈ S.
– If a set is both closed and bounded, we call it compact.

Two important definitions needed to formulate Weierstrass’ Theorem are the following.
Definition (weakly coercive function). A function f is said to be weakly coercive with respect to the
set S if either S is bounded or
lim f (x) = ∞
∥x∥→∞
x∈S

Definition (lower semi-continuity). A function f is said to be lower semi-continuous at x if the value

f (x) is less than or equal to every limit of f as xk → x

In other words, f is lower semi-continuous at x ∈ S if

xk → x =⇒ f (x) ≤ lim inf f (xk )

k→∞

2
TMA947 / MMG621 – Nonlinear optimization Lecture 4

Figure 1: A lower semi-continuous function in one variable

Now we can formulate Weierstrass’ Theorem which guarantees the existence of optimal solutions
to an optimization problem as long as a few assumptions are satisfied.

Theorem (Weierstrass’ Theorem). Consider the problem (1), where S is a nonempty and closed set and
f is lower semi-continuous on S. If f is weakly coercive with respect to S, then there exists a nonempty,
closed and bounded (thus compact) set of optimal solutions to the problem (1).

Proof. See Theorem 4.6 in the book.

One way to remember the assumptions in Weierstrass’ Theorem is to imagine what can go wrong,
i.e., when does a problem not have an optimal solution. One example of an optimization problem
where the solution set is empty is when f (x) = 1/x and S = [1, ∞).

Optimality conditions when S = Rn

When S = Rn , i.e., the problem is an unconstrained optimization problem, then the following
theorem holds.
Theorem (necessary condition for optimality, C 1 ). If f ∈ C 1 on Rn , then

x∗ is a local minimum of f on Rn =⇒ ∇f (x∗ ) = 0

Proof. See Theorem 4.13.

T
∂f (x) ∂f (x)
Note that ∇f (x) = ∂x1 , . . . , ∂xn . The opposite of the theorem is, however, not true. Take
f (x) = x and x = 0. We can strengthen the theorem by assuming that f is also in C 2 .
3

Theorem (necessary condition for optimality, C 2 ). If f ∈ C 2 on Rn , then

∇f (x∗ ) = 0

x∗ is a local minimum of f on Rn =⇒
∇2 f (x∗ ) ⪰ 0

Proof. See Theorem 4.16.

3
TMA947 / MMG621 – Nonlinear optimization Lecture 4

Remember that for a matrix A ∈ Rn×n , the notion A ⪰ 0 (A positive semidefinite) means that
xT Ax ≥ 0, for all x ∈ Rn . Now once again, the opposite direction in the theorem is not true.
However, by assuming positive definiteness of the Hessian of f , we can obtain a sufficient condi-
tion.
Theorem (sufficient condition for optimality, C 2 ). If f ∈ C 2 on Rn , then
∇f (x∗ ) = 0

=⇒ x∗ is a strict local minimum of f on Rn
∇2 f (x∗ ) ≻ 0

Proof. See Theorem 4.17.

To get sufficient conditions for a point to be a local minimum, we need to assume convexity of the
function f .
Theorem (necessary and sufficient condition for optimality, C 1 ). If f ∈ C 1 is convex on Rn , then
x∗ is a global minimum of f on Rn ⇐⇒ ∇f (x∗ ) = 0

Proof. See Theorem 4.18.

Optimality conditions for S ⊆ Rn

When S = Rn the directions we could move from a point x and still stay feasible were Rn itself.
When we consider cases where S ⊂ Rn this might not hold.
Definition (feasible direction). Let x ∈ S. A vector p ∈ Rn defines a feasible direction at x if
∃δ > 0 : x + αp ∈ S, for all α ∈ [0, δ].

So the feasible directions at a point x ∈ S describes the directions in which we can ”move”
without becoming infeasible.
Definition (descent direction). Let x ∈ Rn . A vector p defines a descent direction with respect to f
at x if
∃δ > 0 : f (x + αp) < f (x), for all α ∈ (0, δ].

Suppose that f ∈ C 1 around a point x ∈ Rn , and that p ∈ Rn . If ∇f (x)T p < 0 then the vector
p defines a direction of descent with respect to f at x. We can now state necessary optimality
conditions for cases when S ̸= Rn .
Theorem (necessary optimality conditions). Suppose that S ⊆ Rn and that f ∈ C 1 on S.

a) If x∗ ∈ S is a local minimum of f over S, then

∇f (x∗ )T p ≥ 0
holds for all feasible directions p at x∗ .

b) Suppose that S is convex. If x∗ is a local minimum of f over S, then

∇f (x∗ )T (x − x∗ ) ≥ 0, x∈S (2)

4
TMA947 / MMG621 – Nonlinear optimization Lecture 4

Proof. See Proposition 4.22 in the book.

We refer to (2) as a variational inequality and we can now extend the notion of stationary points by
denoting them as points fulfilling (2). This is first out of four definitions of a stationary point.

Now the necessary and sufficient conditions for optimality can be stated in the following theorem.

Theorem (necessary and sufficient optimality conditions). Suppose S ⊆ Rn is a convex nonempty

set and that f ∈ C 1 is a convex function on S. Then

x∗ is a global minimum of f over S ⇐⇒ ∇f (x∗ )T (x − x∗ ) ≥ 0, x ∈ S.

Proof. See Theorem 4.23 in the book.

Note that when S = Rn , the expression to the right just becomes ∇f (x∗ ) = 0. Why?

We will now present three additional definitions of a stationary point which are all equivalent to
(2). The first one we get by taking the minimum of the left-hand-side of (2) and then realizing that
the optimal value must be zero, i.e.,

min ∇f (x∗ )T (x − x∗ ) = 0. (3)

x∈S

Convince yourself that (2) and (3) are equivalent! Now we claim that (2) and (3) are also equiva-
lent with
x∗ = ProjS [x∗ − ∇f (x∗ )] . (4)
The equation (4) states that if you stand in a stationary point and take a step in the direction of the
negative gradient and then project back to the feasible set, you should end up in the same point.
The details for showing this equivalence can be found in the book (pp. 94–95). See also Figure 2.

For the last equivalent definition of a stationary point, we need to introduce the normal cone

Definition (normal cone). Suppose the set S is closed and convex. Let x ∈ S. Then the normal cone to
S at x is the set
NS (x) := p ∈ Rn | pT (y − x) ≤ 0, y ∈ S .

Think of the normal cone at a point x as all direction pointing ”straight out” from the set. See
Figure 2. Now the fourth equivalent definition of a stationary point is that

−∇f (x∗ ) ∈ NS (x∗ ). (5)

That (5) is equivalent to (2) is trivial.

5
TMA947 / MMG621 – Nonlinear optimization Lecture 4

N
z

x
1
0
0
1

Figure 2: Illustration of the normal cone and the projection operator.

Summary of optimality condition for convex S ⊆ Rn

Definition (stationary point). Suppose that S is convex and that f ∈ C 1 . A point x∗ ∈ S fulfilling the
four equivalent statements a)–d) are called a stationary point.

a)
∇f (x∗ )T (x − x∗ ) ≥ 0, x ∈ S,

b)
min ∇f (x∗ )T (x − x∗ ) = 0,
x∈S

c)
x∗ = ProjS [x∗ − ∇f (x∗ )] ,

d)
−∇f (x∗ ) ∈ NS (x∗ ).

The two important theorems which will be utilized throughout the whole course are the follow-
ing.

Theorem (necessary optimality conditions). Suppose that S is convex and that f ∈ C 1 . Then

x∗ is a local minimum of f over S =⇒ x∗ is stationary

Theorem (necessary and sufficient optimality conditions). Suppose that S is convex and that f ∈ C 1
is convex. Then
x∗ is a global minimum of f over S ⇐⇒ x∗ is stationary

As we will see later in the course, the last definition (the inclusion (5)) is the only one that can be
extended to the case of non-convex sets S.

6
TMA947 / MMG621 – Nonlinear optimization Lecture 4

The separation theorem

Now we will present a very useful theorem for convex sets which says that: "If a point y does not
lie in a closed convex set S, then there exist a hyperplane separating y from S". Mathematically,
this amounts to the following.

Theorem (the separation theorem). Suppose that S ⊆ Rn is closed and convex, and that the point y
does not lie in S. Then there exists a vector π ̸= 0 and a scalar α ∈ R such that π T y > α, and π T x ≤ α
for all x ∈ S.

Proof. See Theorem 4.29.

p
S
P

x
2

Figure 3: Illustration of the separation theorem.

The separation theorem can be used to prove Farkas’ Lemma efficiently, see Theorem 4.33.

Dashboard
No ratings yet
Dashboard
6 pages
Condensed Matter, Marder, Solutions
100% (4)
Condensed Matter, Marder, Solutions
38 pages
Optimality Conditions: Unconstrained Optimization: 1.1 Differentiable Problems
No ratings yet
Optimality Conditions: Unconstrained Optimization: 1.1 Differentiable Problems
10 pages
Bms Basic NLP 120609
No ratings yet
Bms Basic NLP 120609
103 pages
EE364a Homework 3 Solutions: 0 N 0 1 N N 1 1 N N 0 0
No ratings yet
EE364a Homework 3 Solutions: 0 N 0 1 N N 1 1 N N 0 0
19 pages
Introduction To Optimality Conditions: Magnus Onnheim
No ratings yet
Introduction To Optimality Conditions: Magnus Onnheim
32 pages
Opt Notes
No ratings yet
Opt Notes
2 pages
Class 20220823
No ratings yet
Class 20220823
35 pages
Lec6 Constr Opt
No ratings yet
Lec6 Constr Opt
30 pages
Op Tim Ization Note
No ratings yet
Op Tim Ization Note
15 pages
Lecture 6
No ratings yet
Lecture 6
5 pages
Convention: Throughout This Discussion A Feasible Direction D at A Point Is by Definition Taken
No ratings yet
Convention: Throughout This Discussion A Feasible Direction D at A Point Is by Definition Taken
12 pages
Numerical Optimization: 1 The Use of Optimality Conditions
No ratings yet
Numerical Optimization: 1 The Use of Optimality Conditions
6 pages
A Detailed Analysis of The Brachistochrone Problem
No ratings yet
A Detailed Analysis of The Brachistochrone Problem
15 pages
Optimization Notes2
No ratings yet
Optimization Notes2
14 pages
Nonlinear Programming (Concepts, Algorithms, and Applications To Chemical Processes) - 4. Concepts of Constrained Optimization (2010)
No ratings yet
Nonlinear Programming (Concepts, Algorithms, and Applications To Chemical Processes) - 4. Concepts of Constrained Optimization (2010)
28 pages
C62 Lecture1b
No ratings yet
C62 Lecture1b
20 pages
03a Optimization
No ratings yet
03a Optimization
33 pages
Math Chapter 7
No ratings yet
Math Chapter 7
4 pages
Gadhi - 2004 Sufficient Optimality Condition For Vector Opt Prob Under D.C. Data
No ratings yet
Gadhi - 2004 Sufficient Optimality Condition For Vector Opt Prob Under D.C. Data
12 pages
Convexity
No ratings yet
Convexity
13 pages
O4MD 02 Foundations
No ratings yet
O4MD 02 Foundations
8 pages
04 Nonlinear Systems and Optimization
No ratings yet
04 Nonlinear Systems and Optimization
74 pages
Convex Optimization L2 18
No ratings yet
Convex Optimization L2 18
11 pages
06 Optimization
No ratings yet
06 Optimization
42 pages
Chapter 4
No ratings yet
Chapter 4
19 pages
Lecture 7
No ratings yet
Lecture 7
4 pages
Lecture 2 Si416 2025
No ratings yet
Lecture 2 Si416 2025
17 pages
Limiting Reagents - Chemistry LibreTexts
No ratings yet
Limiting Reagents - Chemistry LibreTexts
5 pages
Optimization With Partial Differential Equations
No ratings yet
Optimization With Partial Differential Equations
8 pages
Opte - Optimization
No ratings yet
Opte - Optimization
125 pages
MAE Optimization Lecture 3 Handout
No ratings yet
MAE Optimization Lecture 3 Handout
43 pages
Introduction To Nonlinear Control Lecture # 3 Time-Varying and Perturbed Systems
No ratings yet
Introduction To Nonlinear Control Lecture # 3 Time-Varying and Perturbed Systems
54 pages
Partial Exam 23 Nov 2011
No ratings yet
Partial Exam 23 Nov 2011
7 pages
Coercive Ness
No ratings yet
Coercive Ness
13 pages
Lecture 2 - Optimization With Equality Constraints
No ratings yet
Lecture 2 - Optimization With Equality Constraints
44 pages
Introduction To Optimization
No ratings yet
Introduction To Optimization
18 pages
Lecture 3
No ratings yet
Lecture 3
5 pages
Princeton University Notation and Terminology in Optimization
No ratings yet
Princeton University Notation and Terminology in Optimization
13 pages
斯坦福大学机器学习数学基础 41-48
No ratings yet
斯坦福大学机器学习数学基础 41-48
8 pages
Blast Furnace Burden Permeability: Oleh Nick Standish, October 2013
100% (1)
Blast Furnace Burden Permeability: Oleh Nick Standish, October 2013
43 pages
Lect5 Removed
No ratings yet
Lect5 Removed
35 pages
Lecture 1 2 Background
No ratings yet
Lecture 1 2 Background
6 pages
Mclas Tema1 v2
No ratings yet
Mclas Tema1 v2
74 pages
Introduction To Nonlinear Control Lecture # 2 Time-Invariant Systems
No ratings yet
Introduction To Nonlinear Control Lecture # 2 Time-Invariant Systems
53 pages
Mathematics For Economics (ECON 104)
No ratings yet
Mathematics For Economics (ECON 104)
46 pages
Opte
No ratings yet
Opte
32 pages
(9783110426045 - An Introduction To Nonlinear Optimization Theory) 3 The Study of Smooth Optimization Problems
No ratings yet
(9783110426045 - An Introduction To Nonlinear Optimization Theory) 3 The Study of Smooth Optimization Problems
39 pages
Nonlinear Optimization
No ratings yet
Nonlinear Optimization
6 pages
MML 4
No ratings yet
MML 4
3 pages
Power Systems Operation and Management: Second Lecture
No ratings yet
Power Systems Operation and Management: Second Lecture
35 pages
Analythical Methods
No ratings yet
Analythical Methods
45 pages
BasicsOfConvexOptimization PDF
No ratings yet
BasicsOfConvexOptimization PDF
142 pages
1 - Theory of Maxima and Minima
No ratings yet
1 - Theory of Maxima and Minima
31 pages
Math For Econ (MIT)
No ratings yet
Math For Econ (MIT)
8 pages
Unconstrained Minimization in R: Newton Methods
No ratings yet
Unconstrained Minimization in R: Newton Methods
5 pages
Week02 Convex Optimization
No ratings yet
Week02 Convex Optimization
48 pages
Chapter 3 Mcqs Class 9
No ratings yet
Chapter 3 Mcqs Class 9
21 pages
Optimality Conditions
No ratings yet
Optimality Conditions
10 pages
Nocedal - Wright CH - 02-01
No ratings yet
Nocedal - Wright CH - 02-01
9 pages
Constrained Optimization
No ratings yet
Constrained Optimization
10 pages
NLP Slides
No ratings yet
NLP Slides
201 pages
Erythropoietin Concentrated Solution (1316)
No ratings yet
Erythropoietin Concentrated Solution (1316)
5 pages
Topic - Syllogism: DIRECTIONS For Questions 1 - 10: in Each of The Questions Below Are Given Three Statements Followed by
No ratings yet
Topic - Syllogism: DIRECTIONS For Questions 1 - 10: in Each of The Questions Below Are Given Three Statements Followed by
4 pages
Emerald Sea Imperial
No ratings yet
Emerald Sea Imperial
4 pages
Gravitation Notes
No ratings yet
Gravitation Notes
21 pages
Experiment # 09 Implementation of Bridges and Spanning Tree Protocol
No ratings yet
Experiment # 09 Implementation of Bridges and Spanning Tree Protocol
10 pages
Stat Basic Definitions
No ratings yet
Stat Basic Definitions
4 pages
HKCEE 1984 Mathematics II: N N N N
No ratings yet
HKCEE 1984 Mathematics II: N N N N
10 pages
Lecture (Distribution Models)
No ratings yet
Lecture (Distribution Models)
101 pages
Xlunifac
No ratings yet
Xlunifac
118 pages
(Updated Constantly) : CCNA 1 (v5.1 + v6.0) Chapter 6 Exam Answers Full
No ratings yet
(Updated Constantly) : CCNA 1 (v5.1 + v6.0) Chapter 6 Exam Answers Full
16 pages
NI Tutorial 3173 en PDF
No ratings yet
NI Tutorial 3173 en PDF
9 pages
Module 1.2 Lesson
No ratings yet
Module 1.2 Lesson
8 pages
Mi Paper 3
No ratings yet
Mi Paper 3
2 pages
Workshop 1 - PRF192 - FPTU
No ratings yet
Workshop 1 - PRF192 - FPTU
4 pages
Laws of Thermochemistry
No ratings yet
Laws of Thermochemistry
2 pages
Emat Sensor Design
No ratings yet
Emat Sensor Design
20 pages
Delim Minez Quinto PhysFund2 BSPHY1-1 Lab4
No ratings yet
Delim Minez Quinto PhysFund2 BSPHY1-1 Lab4
39 pages
4694
No ratings yet
4694
4 pages
Cambridge IGCSE™: Combined Science 0653/42 May/June 2021
No ratings yet
Cambridge IGCSE™: Combined Science 0653/42 May/June 2021
9 pages
上課筆記 week 13
No ratings yet
上課筆記 week 13
17 pages
Prueba - Inequalities - Quizlet
No ratings yet
Prueba - Inequalities - Quizlet
14 pages
Limited Top-Down Effects of Feral Cats On Rodent Dynamics in A Seabird Colony
No ratings yet
Limited Top-Down Effects of Feral Cats On Rodent Dynamics in A Seabird Colony
17 pages
Ephysicsl Experiment 6 - Torque - Finalreport
No ratings yet
Ephysicsl Experiment 6 - Torque - Finalreport
4 pages
Signal Converter Boe Bipolar en
No ratings yet
Signal Converter Boe Bipolar en
2 pages
Lectures on Integral Equations
From Everand
Lectures on Integral Equations
Harold Widom
4.5/5 (2)
Elgenfunction Expansions Associated with Second Order Differential Equations
From Everand
Elgenfunction Expansions Associated with Second Order Differential Equations
E. C. Titchmarsh
No ratings yet
Infinite Series
From Everand
Infinite Series
James M Hyslop
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)

Lecture 4

Uploaded by

Lecture 4

Uploaded by

TMA947 / MMG621 – Nonlinear optimization Lecture 4

TMA947 / MMG621 — Nonlinear optimization

Lecture 4 — Introduction to optimality conditions

October 20, 2017

[Minor revision: Axel Ringh - September, 2024]

Local and global optimality

We consider an optimization problem which is that to

minimize f (x), (1a)

where S ⊆ Rn is a nonempty set, and f : Rn → R ∪ {+∞} is a given function.

When n = 1, we know that an optimal solution will be found at

– stationary points, that is, where f ′ (x) = 0,

Definition (global minimum). x∗ ∈ S is a global minimum of f over S if

f (x∗ ) ≤ f (x), ∀x∈S

The next important definition is the one of a local minimum.

Definition (local minimum). x∗ ∈ S is a local minimum of f over S if

∃ε > 0 such that f (x∗ ) ≤ f (x), x ∈ S ∩ Bε (x∗ ),

where Bε (x∗ ) := { y ∈ Rn | ∥y − x∗ ∥ < ε } is the Euclidean ball with radius ε centered at x∗ .

One of the most important theorems in optimization follows.

Proof. See Theorem 4.3 in the book.

Existence of optimal solutions

First some basic notations.

Definition (lower semi-continuity). A function f is said to be lower semi-continuous at x if the value

In other words, f is lower semi-continuous at x ∈ S if

xk → x =⇒ f (x) ≤ lim inf f (xk )

Figure 1: A lower semi-continuous function in one variable

Proof. See Theorem 4.6 in the book.

Optimality conditions when S = Rn

x∗ is a local minimum of f on Rn =⇒ ∇f (x∗ ) = 0

Proof. See Theorem 4.13.

Theorem (necessary condition for optimality, C 2 ). If f ∈ C 2 on Rn , then

Proof. See Theorem 4.16.

Proof. See Theorem 4.17.

Proof. See Theorem 4.18.

Optimality conditions for S ⊆ Rn

a) If x∗ ∈ S is a local minimum of f over S, then

b) Suppose that S is convex. If x∗ is a local minimum of f over S, then

Proof. See Proposition 4.22 in the book.

Theorem (necessary and sufficient optimality conditions). Suppose S ⊆ Rn is a convex nonempty

x∗ is a global minimum of f over S ⇐⇒ ∇f (x∗ )T (x − x∗ ) ≥ 0, x ∈ S.

Proof. See Theorem 4.23 in the book.

min ∇f (x∗ )T (x − x∗ ) = 0. (3)

−∇f (x∗ ) ∈ NS (x∗ ). (5)

That (5) is equivalent to (2) is trivial.

Figure 2: Illustration of the normal cone and the projection operator.

Summary of optimality condition for convex S ⊆ Rn

x∗ is a local minimum of f over S =⇒ x∗ is stationary

The separation theorem

Proof. See Theorem 4.29.

Figure 3: Illustration of the separation theorem.

You might also like