0% found this document useful (0 votes)

17 views43 pages

MAE Optimization Lecture 3 Handout

The document discusses unconstrained optimization, focusing on optimality conditions, including first and second-order necessary and sufficient conditions. It defines global and local solutions, stationary points, and illustrates these concepts with examples of minima, maxima, and saddle points. The content is structured into sections covering definitions, necessary conditions, and proofs related to optimization problems.

Uploaded by

Samuel Jiménez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views43 pages

MAE Optimization Lecture 3 Handout

Uploaded by

Samuel Jiménez

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 43

1MAE004 - Optimization

Unconstrained Optimization

E. Flayac

March 11th, 2024

Introduction to Optimization | | March 11th | Slide 1/43

Outline

Optimality conditions for unconstrained optimization

Numerical Optimization

Descent direction methods

Introduction to Optimization | | March 11th | Slide 2/43

Optimality conditions for unconstrained optimization

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 3/43
Unconstrained Optimization Problem

We focus on the case where A = Rn . We obtain the following problem:

min f (x) (Punc )

x∈Rn

Definitions
▶ We say that x∗ ∈ Rn is a global solution (global minimum) of
(Punc ) if ∀x ∈ Rn , f (x∗ ) ≤ f (x).
▶ We say that x∗ ∈ Rn is a local solution (local minimum) of (Punc ) if
∃r > 0, ∀x ∈ B(x∗ , r ), f (x∗ ) ≤ f (x).
▶ We say that x∗ ∈ Rn is a strict local solution (strict local minimum)
of (Punc ) if there exists r > 0 such that:
f (x∗ ) < f (x), ∀x ∈ B(x∗ , r )\{x∗ }.

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 4/43
Example
f (x)

Local Minimum x

Global Minimum

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 5/43
Necessary Optimality Conditions

min f (x) (Punc )

x∈Rn

First-Order Necessary Optimality Condition

Let x∗ ∈ Rn . If f ∈ C 1 (Rn , R) and x∗ is a local solution of (Punc ) then
∇f (x∗ ) = 0. (1)

Remarks
▶ An element x∗ ∈ Rn satisfying ∇f (x∗ ) = 0 is called a stationary
point or a critical point.
▶ Condition (1) is necessary but not sufficient, e.g., n = 1,
f (x) = −x 2 , f ′ (x) = −2x.
In this case, x ∗ = 0 is a stationary point (f ′ (0) = 0) but a global
maximum.

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 6/43
Necessary Optimality Conditions: proof
First-Order Necessary Optimality Condition: Proof
▶ Taylor expansion at x∗ for h ∈ Rn
f (x∗ + h) = f (x∗ ) + ∇f (x∗ )T h + ||h||ϵ(h)
For d ∈ Rn and t > 0 and h = td
f (x∗ + td) = f (x∗ ) + t∇f (x∗ )T d + t||d||ϵ(td)
▶ For t small, x + td ≈ x, thus f (x∗ ) ≤ f (x∗ + td) as x∗ is a local
minimum.
∗
f (x
) ≤ ∗
f (x ) + t∇f (x∗ )T d + t||d||ϵ(td)
Which implies by dividing by t > 0:
0 ≤ ∇f (x∗ )T d + ||d||ϵ(td)
| {z }
→0 when t→0

▶ One gets ⟨∇f (x∗ ), d⟩ ≥ 0 for any d ∈ Rn

▶ Thus, by (d ← −d), ⟨∇f (x∗ ), d⟩ = 0 for any d ∈ Rn ,
and ∇f (x∗ ) = 0
Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 7/43
Necessary Optimality Conditions: Remark

Remark
Taylor expansion at x∗ for h ∈ Rn
f (x∗ + h) = f (x∗ ) + ∇f (x∗ )T h + ||h||ϵ(h)
If then ∇f (x∗ ) = 0 (stationary point) , then
f (x∗ + h) = f (x∗ ) + ||h||ϵ(h)
f (x∗ + h) ≈ f (x∗ )
Therefore:
▶ f is approximately constant around x∗
▶ The graph of f is ”flat” around x∗
▶ Question : do we have
▶ f (x∗ + h) > f (x∗ ) for any h ?
▶ f (x∗ + h) < f (x∗ ) for any h ?
▶ or something else ?

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 8/43
Example of stationary point in 2D : Minimum

20
z

−2 −2
−1
0 0
1
y 2 2 x
Graph of f1 (x, y ) = 3x 2 + 2.5y 2 with curves y = 0 (blue) and x = 0 (red)

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 9/43
Example of stationary point in 2D : Maximum

−10
z

−20

−2 −2
−1
0 0
1
y 2 2 x
Graph of f2 (x, y ) = −3x 2 − 2.5y 2 with curves y = 0 (blue) and x = 0 (red)

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 10/43
Example of stationary point in 2D : Saddle point

10
z

−10

−2

0 −2

y 0
2
2 x
Graph of f3 (x, y ) = 3x 2 − 2.5y 2 with curves y = 0 (blue) and x = 0 (red)

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 11/43
Example of stationary point in 2D : other

10
z

0
−10

−2

0 −2

y 0
2
2 x
Graph of f4 (x, y ) = 3x 2 − 2.5y 3 with curves y = 0 (blue) and x = 0 (red)

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 12/43
Examples of stationary points: minimum and maximum

x∗

0
Set x∗ = ∗ = .
y 0
Stationary point and minimum

6x
f1 (x, y )= 3x 2 + 2.5y 2 ∇f1 (x, y ) =
5y

∗ ∗ 0
∇f1 (x , y )= ⇒ x∗ is a stationary point and a (global) minimum
0
Stationary point and maximum

2 2 −6x
f2 (x, y )= −3x − 2.5y ∇f1 (x, y ) =
−5y

∗ ∗ 0
∇f2 (x , y )= ⇒ x∗ is stationary point and a (global) maximum
0

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 13/43
Examples stationary points: saddle points and other

x∗

0
Set x∗ = ∗ = .
y 0
Stationary point and a saddle point

6x
f3 (x, y )= 3x 2 − 2.5y 2 ∇f3 (x, y ) =
−5y

0
∇f3 (x ∗ , y ∗ )= ⇒ x∗ is a stationary point and a saddle point
0
Other stationary point

2 3 6x
f4 (x, y )= 3x − 2.5y ∇f4 (x, y ) =
−7.5y 2

∗ ∗ 0
∇f4 (x , y )= ⇒ x∗ is another type of stationary point
0

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 14/43
Necessary Optimality Conditions: Remark

Remark
If x∗ is a stationary point:
▶ f is approximately constant around x∗
▶ The graph of f is ”flat” around x∗
▶ Question : do we have
▶ f (x∗ + h) > f (x∗ ) for any h ?
▶ f (x∗ + h) < f (x∗ ) for any h ?
▶ or something else ?

▶ Answer: we cannot conclude using only ∇f (x∗ )

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 15/43
Necessary optimality conditions

min f (x) (Punc )

x∈Rn

Second-order necessary optimality condition

Let x∗ ∈ Rn . If f ∈ C 2 (Rn , R) at x∗ and x∗ is a local solution of (Punc ),
then,
∇f (x∗ ) = 0 and ∇2 f (x∗ ) ⪰ 0. (2)

Remarks
Condition (2) is necessary but still not sufficient, e.g
▶ n = 1, f (x) = −x 4 , f ′ (x) = −4x 3 f ′′ (x) = −12x 2 ,
▶ For x ∗ = 0 one gets f ′ (x ∗ ) = f ′ (0) = 0 and f ′′ (x ∗ ) = f ′′ (0) = 0
▶ In this case, x ∗ = 0 satisfies (2) but it is a global maximum.

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 16/43
Sufficient optimality conditions
Second-order sufficient optimality condition
Let x∗ ∈ Rn . If f ∈ C 2 (Rn , R) and x∗ satisfies:
∇f (x∗ ) = 0 and ∇2 f (x∗ ) ≻ 0,
then x∗ is a strict local solution of (Punc ).
Sketch of proof
▶ Taylor expansion (of order 2) at x∗ for h ∈ Rn
f (x∗ + h) = f (x∗ ) + ∇f (x∗ )T h + hT ∇2 f (x∗ )h + ||h||2 ϵ(h)
▶ We assumed that ∇f (x∗ ) = 0 so we get:
f (x∗ + h) = f (x∗ ) + hT ∇2 f (x∗ )h +||h||2 ϵ(h)
| {z } |{z}
>0 as ∇2 f (x∗ )≻0 ≈0 for h≈0

▶ Hence for h ≈ 0, f (x∗ + h) > f (x∗ ) which implies that x∗ is a strict

local solution of (Punc )

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 17/43
Examples of stationary points: minimum and maximum
x∗

0
Set x∗ = ∗ = .
y 0
Stationary point and minimum
20

z
∗ ∗ 0 10

∇f1 (x , y )=
0 0

−2
−1
−2

0 0

6 0
1
y x
∗ ∗
2 2
2
∇ f1 (x , y )= ≻0
0 5 f1 (x, y ) = 3x 2 + 2.5y 2

Stationary point and maximum 0

−10

z
∗ ∗ 0
∇f2 (x , y )= −20

0 −2
−1
−2

0 0

−6 0
1
y x
∗ ∗
2 2
2
∇ f2 (x , y )= ≺0
0 −5 f2 (x, y ) = −3x 2 − 2.5y 2

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 18/43
Examples stationary points: saddle points and other
x∗

0
Set x∗ = = .
y∗ 0
Stationary point and a saddle point
10

z
0
−10

0
∇f3 (x ∗ , y ∗ )= −2

0 0 −2

y 0
2

2 ∗ ∗ 6 0 2 x

∇ f3 (x , y )= ⊁ 0 nor ⊀ 0
0 −5 f3 (x, y ) = 3x 2 − 2.5y 2

Other stationary point 10

z
0
−10

∗ ∗ 0
∇f4 (x , y )= −2

0 0 −2

2 ∗ ∗ 6 0 6 0 y 0

∇ f4 (x , y )= ⪰0
2
= 2 x
0 −10y ∗ 0 0
f4 (x, y ) = 3x 2 − 2.5y 3

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 19/43
Classification of stationary points using the Hessian: ∇2 f (x∗ )

Let f ∈ C 2 (Rn , R) and x∗ ∈ Rn such that ∇f (x∗ ) = 0.

Since ∇2 f (x∗ ) is symmetric one has:
∇2 f (x∗ ) = UΛU T
with Λ = diag(λ1 , . . . , λn ) eigenvalues of ∇2 f (x∗ ).
We can distinguish the following cases:
▶ λi ̸= 0 for any 1 ≤ i ≤ n
▶ λi > 0 for any 1 ≤ i ≤ n ⇒ ∇2 f (x∗ ) ≻ 0 ⇒ x∗ local minimum

▶ λi < 0 for any 1 ≤ i ≤ n ⇒ ∇2 f (x∗ ) ≺ 0 ⇒ x∗ local maximum

▶ λi > 0 and λj < 0 for some 1 ≤ i ̸= j ≤ n ⇒ x∗ saddle point

▶ λi = 0 for some 1 ≤ i ≤ n ⇒ we cannot conclude using ∇2 f (x∗ )

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 20/43
What happens if λi = 0 for some 1 ≤ i ≤ n

0 50

z
z

z
−50 0

−2 −2 −2 −2 −2 −2
−1 −1 −1
0 0 0 0 0 0
1 1 1
y 2 2 x y 2 2 x y 2 2 x

4 4 4 4 44
f5 (x, y ) = 3x + 2.5y
f6 (x, y ) = −3x − 2.5y
f7 (x, y ) = 3x − 2.5y

0 0 0
∇f5 (0, 0) = ∇f6 (0, 0) = ∇f7 (0, 0) =
0 0 0
0 0 0 0 0 0
∇2 f5 (0, 0) = ⪰0 ∇2 f6 (0, 0) = ⪰0 ∇2 f7 (0, 0) = ⪰0
0 0 0 0 0 0

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 21/43
How to compute analytically and classify stationary points ?

min f (x) (Punc )

x∈Rn

1. Find all x∗ ∈ Rn such that ∇f (x∗ ) = 0 (stationary points)

2. Compute ∇2 f (x∗ ) and its eigenvalues for all stationary points x∗

3. Use slide 20 to decide if
▶ x∗ is a local minimum
▶ x∗ is a local maximum
▶ x∗ is a saddle point
▶ we cannot conclude

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 22/43
Sufficient optimality conditions in the convex case

min f (x) (Punc )

x∈Rn

Minima of a convex function

Let x∗ ∈ Rn . If f is convex (i.e. ∇2 f (x) ⪰ 0 for any x ∈ Rn ) then:
x∗ local solution of (Punc ) if and only if x∗ global solution of (Punc ).

First-order convex sufficient optimality condition

Let x∗ ∈ Rn . If f ∈ C 1 (Rn , R) is convex and x∗ satisfies:
∇f (x∗ ) = 0,
then x∗ is a local (and thus global) solution of (Punc ).
Remark
For f is convex, x∗ stationary point of f ⇒ x∗ global minimum of f

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 23/43
Unconstrained convex quadratic optimization

Let S ∈ Mn (R) be symmetric and c ∈ Rn . The following quadratic

problem can be defined:

1
minn f (x) = xT Sx − cT x. (Pquad )
x∈R 2
Properties
▶ ∀x ∈ Rn , ∇f (x) = Sx − c and ∇2 f (x) = S.
▶ f is convex ⇐⇒ S ⪰ 0.
▶ Let x∗ ∈ Rn . If S ⪰ 0 then:
x∗ ∈ Rn is a global solution of (Pquad ) ⇐⇒ Sx∗ = c.
▶ If S ≻ 0 then (Pquad ) has a unique global solution x∗ = S −1 c.

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 24/43
Linear least squares

Let R ∈ Mn×p (R) and y ∈ Rn . The linear least squares problem can be
defined as follows:
1
minp f (x) = ∥Rx − y∥2 . (Pls )
x∈R 2
Properties
▶ (Pls ) is a special case of (Pquad ) with S = R T R and c = R T y.
▶ ∀x ∈ Rn , ∇f (x) = R T Rx − R T y and ∇2 f (x) = R T R ⪰ 0.
▶ x∗ ∈ Rn is a global solution of (Pls ) ⇐⇒ R T Rx∗ = R T y.
▶ If R has full rank then (Pls ) has a unique global solution
x∗ = (R T R)−1 R T y.

Introduction to Optimization | Optimality conditions for unconstrained optimization | March 11th | Slide 25/43
Numerical Optimization

Introduction to Optimization | Numerical Optimization | March 11th | Slide 26/43

Concepts of Numerical Optimization

We aim to develop an algorithm in order to find a solution of (Punc )

(global or local).

min f (x) (Punc )

x∈Rn
Two ”naive” methods that only work in certain cases:

▶ ”Enumerate” all possibilities by discretization

⇒ fails very quickly as n increases
▶ Find all stationary points of f , i.e., points x such that ∇f (x) = 0,
and choose the one with the smallest cost
⇒ potentially an infinite number of solutions or no explicit solutions

Introduction to Optimization | Numerical Optimization | March 11th | Slide 27/43

Iterative Optimization Algorithms
General scheme
▶ We initialize the algorithm with x0 ∈ Rn
▶ For k ≥ 0,
▶ Start from an iterate xk
▶ Choose a ”good” direction dk
▶ Move ”sufficiently far” in the direction dk to a new point xk+1
▶ We interrupt the algorithm when a certain stopping criterion is
satisfied

Introduction to Optimization | Numerical Optimization | March 11th | Slide 28/43

What do we expect from these methods?

min f (x) (Punc )

x∈Rn

In order to solve (Punc ), we hope to achieve one of the following:

▶ The iterates should get close to a solution;
▶ The function values should get close to the optimum;
▶ The optimality conditions should get close to being satisfied.

Introduction to Optimization | Numerical Optimization | March 11th | Slide 29/43

Associated convergence notions

Convergence of Iterates to a Solution

xk → x∗ as k → +∞
where x∗ ∈ argminx∈Rn f (x)
Convergence of Costs of Iterates to the Optimal Value

f (xk ) → f ∗ as k → +∞
where f∗ = minx∈Rn f (x)
Convergence to a Stationary Point

∇f (xk ) → 0 as k → +∞
if f is differentiable

Introduction to Optimization | Numerical Optimization | March 11th | Slide 30/43

Why these conditions?

In practice:
▶ We do not know the optimal solution(s) x∗ ;
▶ We do not know the optimal value f ∗ = f (x∗ ).

From an algorithmic standpoint:

▶ We can measure the behavior of the iterates;
▶ We can evaluate the objective function and try to decrease it
iteratively;
▶ We can evaluate/estimate the gradient and decrease its norm to
zero.

Introduction to Optimization | Numerical Optimization | March 11th | Slide 31/43

Descent direction methods

Introduction to Optimization | Descent direction methods | March 11th | Slide 32/43

Descent Directions
Definition
Let x ∈ Rn , d ∈ Rn \{0}, and f ∈: C 1 (Rn , R).
We say that d is a descent direction at x if:
⟨∇f (x), d⟩ < 0,
where ⟨·, ·⟩ is the canonical scalar product on Rn .

Property
If d ∈ Rn \{0} is a descent direction of f at x, then there exists ᾱ > 0
such that ∀α ∈ (0, ᾱ]:
f (x + αd) < f (x). (3)

Remark
From Taylor expansion, we have
f (x + αd) = f (x) + α(∇f (x)T d + ϵ(d))
where f : Rn → R, d ∈ Rn , and α > 0.
Introduction to Optimization | Descent direction methods | March 11th | Slide 33/43
Descent Directions

Figure: All possible descent directions at xk (gray area)

Examples:
▶ d = −∇f (x) (steepest descent)

▶ d = −A∇f (x) with A ≻ 0

Introduction to Optimization | Descent direction methods | March 11th | Slide 34/43
Typical Scheme of a Descent Direction Method

▶ We choose an initial iterate x0 ∈ Rn , an initial step α0 > 0, a

threshold ϵ > 0, and a maximum number of iterations kmax
▶ For k ≥ 0, assuming that the iterate xk is available, then
1. We perform a stopping test
(e.g., k = kmax or ∥∇f (xk )∥ ≤ ϵ ⇒ STOP)
2. We choose a descent direction dk
3. We determine a stepsize αk > 0 to decrease f along dk (e.g.,
f (xk + αk dk ) < f (xk ))
4. We define the new iterate xk+1 = xk + αk dk

▶ We move to the next iteration k ← k + 1

Introduction to Optimization | Descent direction methods | March 11th | Slide 35/43

Negative gradient direction

min f (x) (Punc )

x∈Rn

Consider any x ∈ Rn .Then one of the two assertions below holds:

▶ Either ∇f (x) = 0;
▶ Or d = −∇f (x) is a descent direction of f at x
⇒ the function f decreases locally around x in the direction of
−∇f (x).

Introduction to Optimization | Descent direction methods | March 11th | Slide 36/43

Gradient descent method

▶ x0 ∈ Rn , α0 > 0, ϵ > 0, kmax ∈ N

▶ For k ≥ 0, assuming that the iterate xk is available, then
1. We perform a stopping test
(e.g., k = kmax or ∥∇f (xk )∥ ≤ ϵ ⇒ STOP)
2. We choose dk = −∇f(xk )
3. We determine a stepsize αk > 0
4. We define the new iterate xk+1 = xk + αk dk

▶ We move to the next iteration k ← k + 1

Remark
If ∇f (xk ) ̸= 0, then dk is always a descent direction

Introduction to Optimization | Descent direction methods | March 11th | Slide 37/43

Linesearch: choosing the Step Size of the Algorithm

Constant Step Size

We fix αk ≡ α > 0.
▶ No guarantee that f decreases at each iteration
▶ The choice of α is not straightforward in practice

Optimal Step Size

We choose αk such that:
αk = argmin f (xk + αdk )
α>0

▶ Optimal decrease at each iteration

▶ One-dimensional optimization problem to solve at each iteration

Introduction to Optimization | Descent direction methods | March 11th | Slide 38/43

Gradient Algorithm: Example

Figure: Gradient method in a favorable case

Introduction to Optimization | Descent direction methods | March 11th | Slide 39/43

Gradient Algorithm: Example

Figure: Gradient method with optimal stepsize applied to the Rosenbrock

banana function: f (x1 , x2 ) = (1 − x1 )2 + 100(x2 − x12 )2

Introduction to Optimization | Descent direction methods | March 11th | Slide 40/43

Newton’s Method

min f (x) (Punc )

x∈Rn

Quadratic approximation for f ∈ C 2

Given an iterate xk ∈ Rn , the next iterate xk+1 is chosen to minimize a
second-order Taylor approximation of f around xk :
1
f (x) ≈ q(x) = f (xk ) + ∇f (xk )T (x − xk ) + (x − xk )T ∇2 f (xk )(x − xk )
| {z2 }
Quadratic approximation of f around xk

▶ xk+1 = arg minx∈Rn q(x) if ∇2 f (xk ) ≻ 0 .

▶ However, xk+1 may not be defined when f is not convex.

Introduction to Optimization | Descent direction methods | March 11th | Slide 41/43

Newton’s Method

If we set xk+1 = xk + dk , then:

1
dk = arg minn f (xk ) + ∇f (xk )T d + dT ∇2 f (xk )d
d∈R 2

▶ However, dk is defined only when ∇2 f (xk ) ≻ 0.

▶ In this case: dk = −(∇2 f (xk ))−1 ∇f (xk ).

Introduction to Optimization | Descent direction methods | March 11th | Slide 42/43

Newton’s Method: Algorithm

▶ x0 ∈ Rn , α0 > 0, ϵ > 0, kmax ∈ N

▶ For k ≥ 0, assuming that the iterate xk is available, then
1. We perform a stopping test
(e.g., k = kmax or ∥∇f (xk )∥ ≤ ϵ ⇒ STOP)
2. We choose dk = −∇2 f(xk )−1 ∇f(xk )
3. We determine a step αk > 0 (e.g., αk ≡ 1)
4. We define the new iterate xk+1 = xk + αk dk

▶ We move to the next iteration k ← k + 1

Remarks
▶ Here, dk is a descent direction if ∇2 f (xk ) ≻ 0
▶ In general, Newton’s methods converge (locally) faster than gradient
methods but are more expensive (gradient vs. gradient + Hessian)
▶ MATLAB Example: Link

Introduction to Optimization | Descent direction methods | March 11th | Slide 43/43

1C 130H 4 15 1
100% (1)
1C 130H 4 15 1
172 pages
Princeton University Notation and Terminology in Optimization
No ratings yet
Princeton University Notation and Terminology in Optimization
13 pages
C62 Lecture2
No ratings yet
C62 Lecture2
17 pages
Class 20220823
No ratings yet
Class 20220823
35 pages
Lecture 4
No ratings yet
Lecture 4
7 pages
Introduction To Optimality Conditions: Magnus Onnheim
No ratings yet
Introduction To Optimality Conditions: Magnus Onnheim
32 pages
C62 Lecture1b
No ratings yet
C62 Lecture1b
20 pages
Optimization Notes2
No ratings yet
Optimization Notes2
14 pages
Analythical Methods
No ratings yet
Analythical Methods
45 pages
Opte
No ratings yet
Opte
32 pages
Optimality Conditions
No ratings yet
Optimality Conditions
10 pages
Nocedal - Wright CH - 02-01
No ratings yet
Nocedal - Wright CH - 02-01
9 pages
Lecture 2 - Optimization With Equality Constraints
No ratings yet
Lecture 2 - Optimization With Equality Constraints
44 pages
Material and Energy Balance
No ratings yet
Material and Energy Balance
26 pages
Math Chapter 7
No ratings yet
Math Chapter 7
4 pages
1 - Theory of Maxima and Minima
No ratings yet
1 - Theory of Maxima and Minima
31 pages
Math Camp Notes: Constrained Optimization: Equality Constraints
No ratings yet
Math Camp Notes: Constrained Optimization: Equality Constraints
5 pages
Lecture 1 Introduction To Optimization in Economics
No ratings yet
Lecture 1 Introduction To Optimization in Economics
47 pages
Chapter 4. Optimization
No ratings yet
Chapter 4. Optimization
62 pages
Lect5 Removed
No ratings yet
Lect5 Removed
35 pages
Chap - 1 - Static Optimization - 1.1 - 2014
No ratings yet
Chap - 1 - Static Optimization - 1.1 - 2014
57 pages
Optimumengineeringdesign Day3b
No ratings yet
Optimumengineeringdesign Day3b
32 pages
Topic05 Stud
No ratings yet
Topic05 Stud
28 pages
Bms Basic NLP 120609
No ratings yet
Bms Basic NLP 120609
103 pages
03a Optimization
No ratings yet
03a Optimization
33 pages
Chapter 6 Lecture Notes
No ratings yet
Chapter 6 Lecture Notes
4 pages
Mathematics For Economics (ECON 104)
No ratings yet
Mathematics For Economics (ECON 104)
46 pages
Optimality Conditions For General Constrained Optimization: CME307/MS&E311: Optimization Lecture Note #07
No ratings yet
Optimality Conditions For General Constrained Optimization: CME307/MS&E311: Optimization Lecture Note #07
28 pages
Optimization 3
No ratings yet
Optimization 3
30 pages
Power Systems Operation and Management: Second Lecture
No ratings yet
Power Systems Operation and Management: Second Lecture
35 pages
chp#06
No ratings yet
chp#06
12 pages
Nonlinear Optimization
No ratings yet
Nonlinear Optimization
6 pages
Coercive Ness
No ratings yet
Coercive Ness
13 pages
Constrained Optimization
No ratings yet
Constrained Optimization
9 pages
Mathematics For Economics (ECON 104)
No ratings yet
Mathematics For Economics (ECON 104)
51 pages
Classification of Critical Points and Extrema: Feasibility Problem
No ratings yet
Classification of Critical Points and Extrema: Feasibility Problem
1 page
Multivariabel Tanpa Kendala
No ratings yet
Multivariabel Tanpa Kendala
5 pages
MAE Opti Worksheet 3 Correction
No ratings yet
MAE Opti Worksheet 3 Correction
6 pages
Optimization Formulation
No ratings yet
Optimization Formulation
12 pages
CH 2
No ratings yet
CH 2
31 pages
03.canonical Problems
No ratings yet
03.canonical Problems
26 pages
Single Variable Optimization
No ratings yet
Single Variable Optimization
24 pages
Numerical Optimization: 1 The Use of Optimality Conditions
No ratings yet
Numerical Optimization: 1 The Use of Optimality Conditions
6 pages
Optimization With Constraints: 2nd Edition, March 2004
No ratings yet
Optimization With Constraints: 2nd Edition, March 2004
35 pages
Optimization Theory 2
No ratings yet
Optimization Theory 2
27 pages
Unconstrained and Constrained Optimization
No ratings yet
Unconstrained and Constrained Optimization
31 pages
Economics Department of The University of Pennsylvania Institute of Social and Economic Research - Osaka University
No ratings yet
Economics Department of The University of Pennsylvania Institute of Social and Economic Research - Osaka University
26 pages
Lecture 1: Problems and Solutions. Optimality Conditions For Unconstrained Optimization
No ratings yet
Lecture 1: Problems and Solutions. Optimality Conditions For Unconstrained Optimization
17 pages
Karush Kuhn Tucker Slides
No ratings yet
Karush Kuhn Tucker Slides
45 pages
(9783110426045 - An Introduction To Nonlinear Optimization Theory) 3 The Study of Smooth Optimization Problems
No ratings yet
(9783110426045 - An Introduction To Nonlinear Optimization Theory) 3 The Study of Smooth Optimization Problems
39 pages
Optimal Control (Course Code: 191561620)
No ratings yet
Optimal Control (Course Code: 191561620)
4 pages
Lec6 Constr Opt
No ratings yet
Lec6 Constr Opt
30 pages
Chapter 2 - Unconstrained Optimization
No ratings yet
Chapter 2 - Unconstrained Optimization
20 pages
Optimization - Homework 6
No ratings yet
Optimization - Homework 6
6 pages
Mathematical Programming Examples 1 - Solutions
No ratings yet
Mathematical Programming Examples 1 - Solutions
4 pages
Introduction To Optimization
No ratings yet
Introduction To Optimization
18 pages
Lecture 2
No ratings yet
Lecture 2
19 pages
Cheatsheet
No ratings yet
Cheatsheet
2 pages
MIT Exercises
No ratings yet
MIT Exercises
11 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
10+2 Level Mathematics For All Exams GMAT, GRE, CAT, SAT, ACT, IIT JEE, WBJEE, ISI, CMI, RMO, INMO, KVPY Etc.
From Everand
10+2 Level Mathematics For All Exams GMAT, GRE, CAT, SAT, ACT, IIT JEE, WBJEE, ISI, CMI, RMO, INMO, KVPY Etc.
Shubhankar Paul
No ratings yet
Project Appendix B
No ratings yet
Project Appendix B
1 page
FAA 2013 0262 0002 - Attachment - 5
No ratings yet
FAA 2013 0262 0002 - Attachment - 5
3 pages
Static Maxwell Equations
No ratings yet
Static Maxwell Equations
1 page
Satellite VF
No ratings yet
Satellite VF
3 pages
Research Projects2024-2026-ADO&AssociatedStudents June2025Presentation V0
No ratings yet
Research Projects2024-2026-ADO&AssociatedStudents June2025Presentation V0
8 pages
Regulation Airworthiness Certification JJezegou Full Course V2 7
No ratings yet
Regulation Airworthiness Certification JJezegou Full Course V2 7
231 pages
Department of Transportation Federal Aviation Administration
No ratings yet
Department of Transportation Federal Aviation Administration
12 pages
Learjet 24F Longitudinal Elevator Data
No ratings yet
Learjet 24F Longitudinal Elevator Data
1 page
Design and Optimization of Aircraft Assembling Based On Comprehensive Simulation of Manufacturing Processes - A. Rozhdestvensky, O. Samsonov
No ratings yet
Design and Optimization of Aircraft Assembling Based On Comprehensive Simulation of Manufacturing Processes - A. Rozhdestvensky, O. Samsonov
10 pages
Taller EHM
No ratings yet
Taller EHM
6 pages
Composite Aircraft Design
100% (2)
Composite Aircraft Design
130 pages
2008-09-13
No ratings yet
2008-09-13
9 pages
Hora Lunes Martes Miércoles Jueves Viernes
No ratings yet
Hora Lunes Martes Miércoles Jueves Viernes
2 pages
MECH 360 Homework 2 W14
No ratings yet
MECH 360 Homework 2 W14
5 pages
Bisection Method: Activity No. 2
No ratings yet
Bisection Method: Activity No. 2
5 pages
Description of DBMS - STATS Oracle v12
No ratings yet
Description of DBMS - STATS Oracle v12
25 pages
Case Problem 2
No ratings yet
Case Problem 2
2 pages
R Numerical Analysis
100% (3)
R Numerical Analysis
279 pages
Lecture 3-M1 - Simplex Method
No ratings yet
Lecture 3-M1 - Simplex Method
24 pages
OR6205 M9 Practice Exam 2
No ratings yet
OR6205 M9 Practice Exam 2
5 pages
Chapter 6
No ratings yet
Chapter 6
5 pages
Adaptive Quadrature - Revisited
No ratings yet
Adaptive Quadrature - Revisited
18 pages
Exercises Power Series
No ratings yet
Exercises Power Series
13 pages
Linear Programming The Simplex Method
No ratings yet
Linear Programming The Simplex Method
91 pages
Possible, and Determine Whether Row Interchanges Are Necessary
No ratings yet
Possible, and Determine Whether Row Interchanges Are Necessary
78 pages
Factoring Polynomials (Difference of Two Squares)
No ratings yet
Factoring Polynomials (Difference of Two Squares)
13 pages
It Is Compulsory To Submit The Assignment Before Filling in The
No ratings yet
It Is Compulsory To Submit The Assignment Before Filling in The
6 pages
Optimisation and Optimal Control
No ratings yet
Optimisation and Optimal Control
82 pages
Ninth Order Method For Nonlinear Equations and Its Dynamic Behaviour
No ratings yet
Ninth Order Method For Nonlinear Equations and Its Dynamic Behaviour
15 pages
Heteroscedasticity Week 1 Econometrics
No ratings yet
Heteroscedasticity Week 1 Econometrics
33 pages
Modeling Simulation Lecture14
No ratings yet
Modeling Simulation Lecture14
7 pages
CH-7, MATH-5 - LECTURE - NOTE - Summer - 20-21
No ratings yet
CH-7, MATH-5 - LECTURE - NOTE - Summer - 20-21
10 pages
Class Lecture - 03
No ratings yet
Class Lecture - 03
15 pages
Chebyshev 1
No ratings yet
Chebyshev 1
11 pages
Handbook Combinational Optimization
No ratings yet
Handbook Combinational Optimization
2,410 pages
Chapter 3: System of Linear Equation
No ratings yet
Chapter 3: System of Linear Equation
60 pages
Journal of Computational and Applied Mathematics: J. Rashidinia, M. Ghasemi
No ratings yet
Journal of Computational and Applied Mathematics: J. Rashidinia, M. Ghasemi
18 pages
Root Finding
No ratings yet
Root Finding
3 pages
Mock Quiz#7 With Solution
No ratings yet
Mock Quiz#7 With Solution
6 pages
Nmode2 160210054831 PDF
No ratings yet
Nmode2 160210054831 PDF
170 pages
4.3. The QR Reduction: QR A A QR (Q (R ( (Ax B ( ( (Ax B (Ax X
No ratings yet
4.3. The QR Reduction: QR A A QR (Q (R ( (Ax B ( ( (Ax B (Ax X
19 pages
Lecture Notes (01-06 Week) - 240926 - 093307
No ratings yet
Lecture Notes (01-06 Week) - 240926 - 093307
38 pages
Math-130 Laboratory 3
No ratings yet
Math-130 Laboratory 3
10 pages
Adaptive Runge Kutta
No ratings yet
Adaptive Runge Kutta
3 pages

MAE Optimization Lecture 3 Handout

Uploaded by

MAE Optimization Lecture 3 Handout

Uploaded by

1MAE004 - Optimization

March 11th, 2024

Introduction to Optimization | | March 11th | Slide 1/43

Optimality conditions for unconstrained optimization

Descent direction methods

Introduction to Optimization | | March 11th | Slide 2/43

We focus on the case where A = Rn . We obtain the following problem:

min f (x) (Punc )

min f (x) (Punc )

First-Order Necessary Optimality Condition

▶ One gets ⟨∇f (x∗ ), d⟩ ≥ 0 for any d ∈ Rn

▶ Answer: we cannot conclude using only ∇f (x∗ )

min f (x) (Punc )

Second-order necessary optimality condition

▶ Hence for h ≈ 0, f (x∗ + h) > f (x∗ ) which implies that x∗ is a strict

Stationary point and maximum 0

Other stationary point 10

Let f ∈ C 2 (Rn , R) and x∗ ∈ Rn such that ∇f (x∗ ) = 0.

▶ λi < 0 for any 1 ≤ i ≤ n ⇒ ∇2 f (x∗ ) ≺ 0 ⇒ x∗ local maximum

▶ λi > 0 and λj < 0 for some 1 ≤ i ̸= j ≤ n ⇒ x∗ saddle point

▶ λi = 0 for some 1 ≤ i ≤ n ⇒ we cannot conclude using ∇2 f (x∗ )

min f (x) (Punc )

1. Find all x∗ ∈ Rn such that ∇f (x∗ ) = 0 (stationary points)

2. Compute ∇2 f (x∗ ) and its eigenvalues for all stationary points x∗

min f (x) (Punc )

Minima of a convex function

First-order convex sufficient optimality condition

Let S ∈ Mn (R) be symmetric and c ∈ Rn . The following quadratic

Introduction to Optimization | Numerical Optimization | March 11th | Slide 26/43

We aim to develop an algorithm in order to find a solution of (Punc )

min f (x) (Punc )

▶ ”Enumerate” all possibilities by discretization

Introduction to Optimization | Numerical Optimization | March 11th | Slide 27/43

Introduction to Optimization | Numerical Optimization | March 11th | Slide 28/43

min f (x) (Punc )

In order to solve (Punc ), we hope to achieve one of the following:

Introduction to Optimization | Numerical Optimization | March 11th | Slide 29/43

Convergence of Iterates to a Solution

Introduction to Optimization | Numerical Optimization | March 11th | Slide 30/43

From an algorithmic standpoint:

Introduction to Optimization | Numerical Optimization | March 11th | Slide 31/43

Introduction to Optimization | Descent direction methods | March 11th | Slide 32/43

Figure: All possible descent directions at xk (gray area)

▶ d = −A∇f (x) with A ≻ 0

▶ We choose an initial iterate x0 ∈ Rn , an initial step α0 > 0, a

▶ We move to the next iteration k ← k + 1

Introduction to Optimization | Descent direction methods | March 11th | Slide 35/43

min f (x) (Punc )

Consider any x ∈ Rn .Then one of the two assertions below holds:

Introduction to Optimization | Descent direction methods | March 11th | Slide 36/43

▶ x0 ∈ Rn , α0 > 0, ϵ > 0, kmax ∈ N

▶ We move to the next iteration k ← k + 1

Introduction to Optimization | Descent direction methods | March 11th | Slide 37/43

Constant Step Size

Optimal Step Size

▶ Optimal decrease at each iteration

Introduction to Optimization | Descent direction methods | March 11th | Slide 38/43

Figure: Gradient method in a favorable case

Introduction to Optimization | Descent direction methods | March 11th | Slide 39/43

Figure: Gradient method with optimal stepsize applied to the Rosenbrock

Introduction to Optimization | Descent direction methods | March 11th | Slide 40/43

min f (x) (Punc )

Quadratic approximation for f ∈ C 2

▶ xk+1 = arg minx∈Rn q(x) if ∇2 f (xk ) ≻ 0 .

Introduction to Optimization | Descent direction methods | March 11th | Slide 41/43

If we set xk+1 = xk + dk , then:

▶ However, dk is defined only when ∇2 f (xk ) ≻ 0.

Introduction to Optimization | Descent direction methods | March 11th | Slide 42/43

▶ x0 ∈ Rn , α0 > 0, ϵ > 0, kmax ∈ N

▶ We move to the next iteration k ← k + 1

Introduction to Optimization | Descent direction methods | March 11th | Slide 43/43

You might also like