0% found this document useful (0 votes)

37 views19 pages

Lecture 2

The document discusses unconstrained optimization methods, including necessary and sufficient conditions for local minima. It introduces gradient descent algorithms like steepest descent that iteratively move in the direction of steepest descent. The steepest descent method chooses the descent direction as the negative gradient but can have slow convergence for problems like the Rosenbrock function.

Uploaded by

Jaco Greeff

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views19 pages

Lecture 2

Uploaded by

Jaco Greeff

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Optimization Methods

Lecture 2

Solmaz S. Kia
Mechanical and Aerospace Engineering Dept.
University of California Irvine
[email protected]

Reading: Sections 7.1-7.5, 8.6, 8.8 of Ref[2].

1 / 19
Unconstrained optimization

x? =argmin f(x)
x∈Rn

x? ∈ Rn Unconstrained local minimum of f if

∃ > 0 s.t. f(x? ) 6 f(x), ∀x with kx − x? k < ,

x? ∈ Rn Unconstrained global minimum of f if

f(x? ) 6 f(x), ∀x ∈ Rn ,

x? ∈ Rn Unconstrained strict local minimum of f if

∃ > 0 s.t. f(x? ) < f(x), ∀x with kx − x? k < ,

x? ∈ Rn Unconstrained strict global minimum of f if

f(x? ) < f(x), ∀x ∈ Rn ,

2 / 19
Necessary conditions for optimality

OPT: x? =argmin f(x)

x∈Rn
x ∈ X (X is the set of constraints)
for X = Rn (problem becomes unconstrained)

D ∈ Rn is a feasible
direction at x ∈ X for OPT
if (x + αd) ∈ X for
α ∈ [0, ᾱ]

Proposition:
First order necessary condition (FONC) consider OPT and let f ∈ C1 if x?
is a local minimizer for f then
∇f(x? )> d > 0, ∀d ∈ Rn , d is a feasible direction
Second order necessary condition (SONC) let f ∈ C2 if x? is a local
minimizer for f then
(i) ∇f(x? )> d > 0
(ii) if ∇f(x? ) = 0 ⇒ d> ∇2 f(x? )d > 0 ∀d ∈ Rn , d is a feasible direction
3 / 19
Necessary conditions for optimality

x? =argmin f(x)
x∈Rn

Proposition (necessary optimality conditions)

Let x? be an unconstrained local minimum of f : Rn → R and assume that f is
continuously differentiable in an open set S containing x? , Then

∇f(x? ) = 0. (First Order Necessary Condition)

If in addition f is twice continuously differentiable within S, then

∇2 f(x? ) : positive semidefinite. (Second Order Necessary Condition)

Proof: see page 13-14 of Ref[1].

Stationary point: Any point x̄ ∈ Rn that satisfies ∇f(x̄) = 0 is called a stationary

point. A stationary point can be a minimum, maximum or saddle point of cost
function f.
4 / 19
Sufficient conditions for optimality

x? =argmin f(x)
x∈Rn

Proposition (Second order sufficient optimality conditions)

Let f : Rn → R be twice continuously differentiable in an open set S. Suppose
that a vector x? satisfies the conditions

∇f(x? ) = 0, ∇2 f(x? ) : positive definite.

Then, x? is a strict unconstrained local minimum of f. In particular, there exist

scalars γ > 0 and > 0 such that
γ
f(x) > f(x? ) + kx − x? k2 , ∀x with kx − x? k < .
2

Proof: see page 15 of Ref[1].

5 / 19
Stationary points: example

f(x) = |x|3 f(x) = −|x|3

f(x) = x3
2
3x x>0 −3x2 x>0
∇f(x) = 3x2 ∇f(x) = ∇f(x) =
−3x2 x<0 3x2 x<0
stationary point: stationary point:
stationary point:
∇f(0) = 0 ∇f(0) = 0
∇f(0) = 0
x? = 0 local minimizer x? = 0 local maximizer
x? = 0 reflection point
− − − −− − − − −−
− − − −−
6x x>0 −6x x>0
∇2 f(x) = 6x ∇2 f(x) = ∇2 f(x) =
−6x x<0 6x x<0
∇2 f(0) = 0
∇2 f(0) = 0 ∇2 f(0) = 0
Note here that in all three of these cases x? satisfies FONC and SONC, but satisfying necessary
conditions does not mean that these points are minimizers. Note that x? does not satisfy the second
order sufficient conditions either. 6 / 19
Singular and non-singular local minimum

Local minimum point that does not satisfy the sufficiency condition
∇f(x? ) = 0, ∇f(x? ) > 0 is called singular otherwise it is called nonsingular.
Singular local minima are harder to deal with
In the absence of convexity of f, their optimality cannot be ascertained using
easily verifiable sufficient conditions
In their neighborhood, the behavior of most commonly used optimization
algorithms tends to be slow and /or erratic

7 / 19
Convex sets and convex functions (see Appendix B of Ref[1])

Convex set Ω: The line connecting any point p, q ∈ Ω belongs to Ω:

∀p, q ∈ C : (t p + (1 − t) q) ∈ Ω for t ∈ [0, 1].

Convex function: f is convex over convex set Ω iff

f(t x1 + (1 − t) x2 ) 6 t f(x1 ) + (1 − t) f(x2 ), ∀x1 , x2 ∈ Ω for t ∈ [0, 1].

8 / 19
Convex function

Convex function: f is convex over convex set Ω iff

f(t x1 + (1 − t) x2 ) 6 t f(x1 ) + (1 − t) f(x2 ), ∀x1 , x2 ∈ Ω for t ∈ [0, 1].

When f is differentiable, it is convex over convex set Ω iff

f(x) > f(x0 ) + ∇f(x0 )(x − x0 ), ∀x0 , x ∈ Ω.

When f is twice differentiable, it is convex over convex set Ω iff

∇2 f(x) > 0, ∀x0 , x ∈ Ω.
9 / 19
Optimality conditions for convex functions

Proposition (Optimality conditions for convex functions)

Let f : X → R be a convex function over the convex set X.
(a) A local minimum of f over X is also a global minimum over X. If in addition
f is strictly convex, then there exists at most one global minimum of f.
(b) If f is convex and the set X is open, then ∇f(x? ) = 0 is a necessary and
sufficient condition for a vector x ∈ X to be a global minimum of f over X.

Proof: see page 14 of Ref[1]

for part (a) use f(αx? + (1 − α)x̄) 6 αf(x? ) + (1 − α)f(x̄)

for part (b) use f(x) > f(x? ) + ∇f(x? )> (x − x? ), ∀x ∈ X.

10 / 19
Numerical solvers (see Section 1.2 of Ref[1])

Iterative descent methods

start from x0 ∈ Rn (initial guess)

successively generate vectors x1 , x2 , · · · such that

f(xk+1 ) < f(xk ), k = 0, 1, 2, · · ·

xk+1 = xk + αk dk
Design factors in iterative descent algorithms:
what direction to move: descent direction
how far move in that direction: step size
11 / 19
Successive descent method

xk+1 = xk + αk dk
1st order Taylor series : f(xk+1 ) = f(xk + αk dk ) ≈ f(xk ) + αk ∇f(xk )> dk
for successive reduction: αk ∇f(xk )> dk < 0
If ∇f(xk ) 6= 0

90◦ < ∠(dk , ∇f(xk )) < 270◦ : ∇f(xk )> d < 0

by appropriate choice of step size αk we can

achieve f(xk+1 ) < f(xk )

Observations above lead to a set of gradient based

algorithms

12 / 19
Steepest descent method

xk+1 = xk + αk dk
1st order Taylor series : f(xk+1 ) = f(xk + αk dk ) ≈ f(xk ) + αk ∇f(xk )> dk
for successive reduction: αk ∇f(xk )> dk < 0

dk = −∇f(xk ) : −∇f(xk )> ∇f(xk ) < 0, ∇f(xk ) 6= 0

Proposition dk = −∇f(xk ) is a descent direction, i.e., f(xk + αk dk ) < f(xk ) for

all sufficiently small values of αk > 0.
Steepest Descent Algorithm
Step 0. Given x0 , set k := 0
Step 1. dk := −∇f(xk ). If dk = 0, then stop.
Step 2. Solve αk = argminf(xk + αdk ) for the stepsize αk (chosen by an
α
exact or inexact linesearch)
Step 3. Set xk+1 ← xk + αk dk , k ← k + 1. Go to Step 1.
Note: from Step 2 and the fact that dk = −∇k f(xk ) is a descent direction it
follows that f(xk+1 ) < f(xk ). 13 / 19
Steepest descent method
Steepest descent method can have slow convergence
Rosenbrock function:
f(x1 , x2 ) = 100(x2 − x21 )2 + (1 − x1 )2
2 2
f(x1 , x2 ) = 1 − e−(10x1 +x2 )

x0 = (−1.2, 1.0)> x? = (1, 1)>

14 / 19
Newton’s method

xk+1 = xk + αk dk
| {z }
∆xk

2nd order Taylor series:

1
f(xk+1 ) = f(xk + ∆xk ) ≈ h(∆xk ) = f(xk ) + ∇f(xk )> ∆xk + ∆x> ∇2 f(xk )∆xk
2 k

For successive reduction: find the ∆xk from minimize h(∆xk )

∆xk

∇h(∆x) = 0 ⇒ ∇2 f(xk )∆xk + ∇f(xk ) = 0 ⇒ ∆xk = −(∇2 f(xk ))−1 ∇f(xk )

xk+1 = xk − (∇2 f(xk ))−1 ∇f(xk )

Newton’s method
Step 0. Given x0 , set k := 0
Step 1. dk := −(∇2 f(xk ))−1 ∇f(xk ). If dk = 0, then stop.
Step 2. Solver αk = 1
Step 3. Set xk+1 ← xk + αk dk , k ← k + 1. Go to Step 1.
15 / 19
Modified Newton’s method
2nd order Taylor series:
f(xk+1 ) = f(xk + ∆xk ) ≈ h(∆xk ) = f(xk ) + ∇f(xk )> ∆xk + ∆x> 2
k ∇ f(xk )∆xk

xk+1 = xk − (∇2 f(xk ))−1 ∇f(xk ),

Note the following:
f(xk+1 ) < f(xk ) is not necessarily guaranteed
Algorithm can be modified to be xk+1 = xk − αk (∇2 f(xk ))−1 ∇f(xk ),
Step 2 the should be modified to be
Step 2. Solve αk = argminf(xk − α (∇2 f(xk ))−1 ∇f(xk )) for the stepsize αk
α
(chosen by an exact or inexact linesearch)
Proposition If H(xk ) = ∇2 f(xk ) is a symmetric positive definite matrix, then
dk := −H(x)−1 ∇f(xk )) is a descent direction, i.e., f(xk + αk dk ) < f(xk ) for all
sufficiently small values of αk > 0.
proof: for dk to be a descent direction we should show that ∇f(xk )> dk < 0.
here: ∇f(xk )> dk = −∇f(xk )> H(x)−1 ∇f(xk ). Because H(xk ) is positive
definite, it follows that ∇f(xk )> dk = −∇f(xk )> H(x)−1 ∇f(xk ) < 0. Here we
used the fact that if a matrix is positive definite, its inverse is also positive definite
16 / 19
Newton and modified Newton methods

Newton method typically converges very fast asymptotically

Does not exhibit the zig-zagging behavior of the steepest descent
on the down side: Newton’s method needs to compute not only the gradient, but
also the Hessian, which contains n(n + 1)/2 second order derivatives (numerically
expensive).
2 2
Example: f(x1 , x2 ) = 1 − e−(10x1 +x2 )

17 / 19
Practical Stopping Conditions for Iterative Optimization Algorithms for
Unconstrained Optimization

In iterative algorithms typically the initial point is picked randomly, or if we have a

guess for the location of local minima, we pick close to them.
Stopping Criteria: The stoping condition is related to the first order optimality
condition of ∇f(x) = 0. The followings are common practical stopping conditions
for iterative unconstrained optimization algorithms. Let > 0:
kf(xk )k 6
close to satisfying first order necessary condition ∇f(x) = 0.
|f(xk+1 ) − f(xk )| 6
Improvements in function value are saturating.
kxk+1 − xk k 6
Movement between iterates has become small.
|f(xk+1 )−f(xk )|
max{1,|f(xk )|} 6
A “relative" measure -removes dependence on the scale of f.
The max is taken to avoid dividing by small numbers.
kxk+1 −xk k
max{1,kxk k} 6
A “relative" measure -removes dependence on the scale of x(k)
The max is taken to avoid dividing by small numbers.
18 / 19
References

[1] Nonlinear Programming: 3rd Edition, by D. P. Bertsekas

[2] Linear and Nonlinear Programming, by D. G. Luenberger, Y. Ye

19 / 19

Weatherwax Nocedal Solutions
No ratings yet
Weatherwax Nocedal Solutions
23 pages
Unconstrained Numerical Optimization An Introduction For Econometricians
100% (1)
Unconstrained Numerical Optimization An Introduction For Econometricians
32 pages
Format For Calibration Record of HPLC: Flow Rate Adjusted To 1.0 ML / Min. Pump-A
No ratings yet
Format For Calibration Record of HPLC: Flow Rate Adjusted To 1.0 ML / Min. Pump-A
5 pages
Essentials of Business Analytics
No ratings yet
Essentials of Business Analytics
74 pages
04 Nonlinear Systems and Optimization
No ratings yet
04 Nonlinear Systems and Optimization
74 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
Chapter 3 Unconstrained Convex Optimization
No ratings yet
Chapter 3 Unconstrained Convex Optimization
28 pages
Bologna 07
No ratings yet
Bologna 07
315 pages
Lecture 05 - Unconstrained
No ratings yet
Lecture 05 - Unconstrained
21 pages
NLP Slides
No ratings yet
NLP Slides
201 pages
OpTimIzation Overview
No ratings yet
OpTimIzation Overview
47 pages
Week02 Convex Optimization
No ratings yet
Week02 Convex Optimization
48 pages
Opt_Lec_10
No ratings yet
Opt_Lec_10
16 pages
2.NCC-SFC-LMT-KKT 2
No ratings yet
2.NCC-SFC-LMT-KKT 2
56 pages
Optimization Class Notes MTH-9842
No ratings yet
Optimization Class Notes MTH-9842
25 pages
Optim
No ratings yet
Optim
70 pages
Process Optimization
No ratings yet
Process Optimization
70 pages
Unconstrained and Constrained Optimization Algorithms by Soman K.P
No ratings yet
Unconstrained and Constrained Optimization Algorithms by Soman K.P
166 pages
Unconstrained Minimization
No ratings yet
Unconstrained Minimization
7 pages
Lecture 12
No ratings yet
Lecture 12
16 pages
Mathematical Methods of Optimization
No ratings yet
Mathematical Methods of Optimization
62 pages
3 ECE5570 - CH3 - 12feb17
No ratings yet
3 ECE5570 - CH3 - 12feb17
44 pages
CS-6777 Liu Abs
No ratings yet
CS-6777 Liu Abs
103 pages
20-Region Elimination Method_ Golden search method-11-03-2025
No ratings yet
20-Region Elimination Method_ Golden search method-11-03-2025
20 pages
Hauser Lecture2
No ratings yet
Hauser Lecture2
26 pages
19 Newton Method
No ratings yet
19 Newton Method
10 pages
HW4 Solutions Autotag
No ratings yet
HW4 Solutions Autotag
7 pages
Chapter 4: Unconstrained Optimization
No ratings yet
Chapter 4: Unconstrained Optimization
25 pages
6 OneD Unconstrained Opt
No ratings yet
6 OneD Unconstrained Opt
29 pages
Cheatsheet
No ratings yet
Cheatsheet
2 pages
Princeton University Notation and Terminology in optimization
No ratings yet
Princeton University Notation and Terminology in optimization
13 pages
Wolfram Mathematica Tutorial Collection
No ratings yet
Wolfram Mathematica Tutorial Collection
38 pages
O4MD 02 Foundations
No ratings yet
O4MD 02 Foundations
8 pages
Lecture 14
No ratings yet
Lecture 14
9 pages
Algorithms Process Optimization
No ratings yet
Algorithms Process Optimization
5 pages
Chương 9
No ratings yet
Chương 9
12 pages
Inequality 20161031
No ratings yet
Inequality 20161031
31 pages
Notes HQ
No ratings yet
Notes HQ
96 pages
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
No ratings yet
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
5 pages
lecture-5-si416-2025
No ratings yet
lecture-5-si416-2025
21 pages
Crs Mfai 2024 Slides
No ratings yet
Crs Mfai 2024 Slides
46 pages
Optimization PPT - Part-2
No ratings yet
Optimization PPT - Part-2
42 pages
Optimization2
No ratings yet
Optimization2
40 pages
ECOM 6302: Engineering Optimization: Chapter Three
100% (1)
ECOM 6302: Engineering Optimization: Chapter Three
56 pages
Note Set 7 - Nonlinear Equations: 7.1 - Overview
No ratings yet
Note Set 7 - Nonlinear Equations: 7.1 - Overview
10 pages
Hawassa University (Hu), Institute of Technology (Iot) Chemical Engineering Department
No ratings yet
Hawassa University (Hu), Institute of Technology (Iot) Chemical Engineering Department
30 pages
MAE_opti_worksheet_4_correction
No ratings yet
MAE_opti_worksheet_4_correction
3 pages
Exam1Review Annotated
No ratings yet
Exam1Review Annotated
13 pages
14 Newton
No ratings yet
14 Newton
24 pages
NLO Notes
No ratings yet
NLO Notes
75 pages
4 Pattern Directions, 21-08-2024
No ratings yet
4 Pattern Directions, 21-08-2024
58 pages
Project For Automated Train by Roshan
No ratings yet
Project For Automated Train by Roshan
6 pages
Chapter 6 Lecture Notes
No ratings yet
Chapter 6 Lecture Notes
4 pages
E1 251 Linear and Nonlinear Op2miza2on
No ratings yet
E1 251 Linear and Nonlinear Op2miza2on
24 pages
02_grad_desc
No ratings yet
02_grad_desc
54 pages
Gradient Based Optimization
No ratings yet
Gradient Based Optimization
24 pages
Optimization Based On Gradient Descent
No ratings yet
Optimization Based On Gradient Descent
24 pages
NEOM UNIT-1 Sept-23
No ratings yet
NEOM UNIT-1 Sept-23
34 pages
Lec4 Gradient Method Revise
No ratings yet
Lec4 Gradient Method Revise
33 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Assignment 1
No ratings yet
Assignment 1
13 pages
MIT18 440S11 Lecture27
No ratings yet
MIT18 440S11 Lecture27
18 pages
..., 2, 1 With ... ... ... : DX DX
No ratings yet
..., 2, 1 With ... ... ... : DX DX
2 pages
365 PDF
No ratings yet
365 PDF
1 page
235
No ratings yet
235
1 page
234 PDF
No ratings yet
234 PDF
1 page
237 PDF
No ratings yet
237 PDF
1 page
Exercise (12.16) (amended) : α α = 0 ⇔ Α = γ δ (i.e. α is simple)
No ratings yet
Exercise (12.16) (amended) : α α = 0 ⇔ Α = γ δ (i.e. α is simple)
3 pages
Exercise (13.43) : GH A A H G
No ratings yet
Exercise (13.43) : GH A A H G
1 page
Extinction
No ratings yet
Extinction
12 pages
) e ,... E, e ,... (E ,... E, e ,... e (:) e ,... E, e ,... e Q (
No ratings yet
) e ,... E, e ,... (E ,... E, e ,... e (:) e ,... E, e ,... e Q (
1 page
G S S G: Exercise (13.47)
No ratings yet
G S S G: Exercise (13.47)
1 page
1202 2537 PDF
No ratings yet
1202 2537 PDF
5 pages
03 Risk Management Applications of Option Strategies1
No ratings yet
03 Risk Management Applications of Option Strategies1
15 pages
02 Basics of Derivative Pricing and Valuation1
No ratings yet
02 Basics of Derivative Pricing and Valuation1
14 pages
Quantum Computing: A Gentle Introduction
No ratings yet
Quantum Computing: A Gentle Introduction
45 pages
Free CFA Mind Maps Level 1 - 2015
100% (6)
Free CFA Mind Maps Level 1 - 2015
18 pages
J11 Question Paper
No ratings yet
J11 Question Paper
4 pages
Complex Numbers 1D: de Moivre's Theorem. Binomial Expansion
No ratings yet
Complex Numbers 1D: de Moivre's Theorem. Binomial Expansion
10 pages
Integration-1 PDF
No ratings yet
Integration-1 PDF
49 pages
Complete Solution Mathematics II Sem (CSVTU)
No ratings yet
Complete Solution Mathematics II Sem (CSVTU)
203 pages
Gaussian Elimination Calculator: Study of Mathematics Online
No ratings yet
Gaussian Elimination Calculator: Study of Mathematics Online
2 pages
P Value
67% (3)
P Value
6 pages
Appendix B Hand Out Gauss Newton Derivation
No ratings yet
Appendix B Hand Out Gauss Newton Derivation
8 pages
Proof of Convolution Theorem
No ratings yet
Proof of Convolution Theorem
2 pages
Metodologi Penelitian
100% (3)
Metodologi Penelitian
51 pages
Screenshot 2022-10-12 at 8.53.29 PM
No ratings yet
Screenshot 2022-10-12 at 8.53.29 PM
18 pages
Certificate of Calibration: 08-Oc31356 - Revno:0
No ratings yet
Certificate of Calibration: 08-Oc31356 - Revno:0
1 page
selfstudys_com_file (2)
No ratings yet
selfstudys_com_file (2)
60 pages
Compliance Guildline
No ratings yet
Compliance Guildline
26 pages
Geometric Sequence
No ratings yet
Geometric Sequence
3 pages
Guidelines for black book
No ratings yet
Guidelines for black book
8 pages
Probability Theory: J.M. Steele
No ratings yet
Probability Theory: J.M. Steele
9 pages
HPLC Simulator
No ratings yet
HPLC Simulator
1 page
Bubble and Dew Point Calculations in Multicomponent and Multireactive Mixtures
No ratings yet
Bubble and Dew Point Calculations in Multicomponent and Multireactive Mixtures
9 pages
NCERT Solutions Fo Class 12 Maths Ex 6.2
No ratings yet
NCERT Solutions Fo Class 12 Maths Ex 6.2
18 pages
Management Science Era and Integrated Approach: Departemen Teknik Industri
No ratings yet
Management Science Era and Integrated Approach: Departemen Teknik Industri
80 pages
QCC Basic
No ratings yet
QCC Basic
7 pages
SKRIPSI Koperasi PDF
No ratings yet
SKRIPSI Koperasi PDF
122 pages
M1 syllabus
No ratings yet
M1 syllabus
2 pages
Detection Limits of Chemical Sensors: Applications and Misapplications
No ratings yet
Detection Limits of Chemical Sensors: Applications and Misapplications
7 pages
CORDIC Algorithm For Sinusoidal Calculations
No ratings yet
CORDIC Algorithm For Sinusoidal Calculations
6 pages
Mean, Median and Mode For Ungropued Data and For Grouped Data
No ratings yet
Mean, Median and Mode For Ungropued Data and For Grouped Data
8 pages
Pengaruh Pemberian Insentif Dan Pelatihan Kerja Terhadap Kinerja Karyawan Pada Pt. Jutam Readymix Concrete Di Kota Batam By: Mauli Siagian
No ratings yet
Pengaruh Pemberian Insentif Dan Pelatihan Kerja Terhadap Kinerja Karyawan Pada Pt. Jutam Readymix Concrete Di Kota Batam By: Mauli Siagian
9 pages
Chapter 9 Integration
75% (4)
Chapter 9 Integration
31 pages

Lecture 2

Uploaded by

Lecture 2

Uploaded by

Optimization Methods

Reading: Sections 7.1-7.5, 8.6, 8.8 of Ref[2].

x? ∈ Rn Unconstrained local minimum of f if

∃  > 0 s.t. f(x? ) 6 f(x), ∀x with kx − x? k < ,

x? ∈ Rn Unconstrained global minimum of f if

x? ∈ Rn Unconstrained strict local minimum of f if

∃  > 0 s.t. f(x? ) < f(x), ∀x with kx − x? k < ,

x? ∈ Rn Unconstrained strict global minimum of f if

f(x? ) < f(x), ∀x ∈ Rn ,

OPT: x? =argmin f(x)

Proposition (necessary optimality conditions)

∇f(x? ) = 0. (First Order Necessary Condition)

If in addition f is twice continuously differentiable within S, then

∇2 f(x? ) : positive semidefinite. (Second Order Necessary Condition)

Proof: see page 13-14 of Ref[1].

Stationary point: Any point x̄ ∈ Rn that satisfies ∇f(x̄) = 0 is called a stationary

Proposition (Second order sufficient optimality conditions)

∇f(x? ) = 0, ∇2 f(x? ) : positive definite.

Then, x? is a strict unconstrained local minimum of f. In particular, there exist

Proof: see page 15 of Ref[1].

f(x) = |x|3 f(x) = −|x|3

Convex set Ω: The line connecting any point p, q ∈ Ω belongs to Ω:

∀p, q ∈ C : (t p + (1 − t) q) ∈ Ω for t ∈ [0, 1].

Convex function: f is convex over convex set Ω iff

f(t x1 + (1 − t) x2 ) 6 t f(x1 ) + (1 − t) f(x2 ), ∀x1 , x2 ∈ Ω for t ∈ [0, 1].

Convex function: f is convex over convex set Ω iff

When f is differentiable, it is convex over convex set Ω iff

When f is twice differentiable, it is convex over convex set Ω iff

Proposition (Optimality conditions for convex functions)

Proof: see page 14 of Ref[1]

for part (b) use f(x) > f(x? ) + ∇f(x? )> (x − x? ), ∀x ∈ X.

Iterative descent methods

successively generate vectors x1 , x2 , · · · such that

90◦ < ∠(dk , ∇f(xk )) < 270◦ : ∇f(xk )> d < 0

by appropriate choice of step size αk we can

Observations above lead to a set of gradient based

dk = −∇f(xk ) : −∇f(xk )> ∇f(xk ) < 0, ∇f(xk ) 6= 0

Proposition dk = −∇f(xk ) is a descent direction, i.e., f(xk + αk dk ) < f(xk ) for

x0 = (−1.2, 1.0)> x? = (1, 1)>

2nd order Taylor series:

For successive reduction: find the ∆xk from minimize h(∆xk )

∇h(∆x) = 0 ⇒ ∇2 f(xk )∆xk + ∇f(xk ) = 0 ⇒ ∆xk = −(∇2 f(xk ))−1 ∇f(xk )

xk+1 = xk − (∇2 f(xk ))−1 ∇f(xk )

xk+1 = xk − (∇2 f(xk ))−1 ∇f(xk ),

Newton method typically converges very fast asymptotically

In iterative algorithms typically the initial point is picked randomly, or if we have a

[1] Nonlinear Programming: 3rd Edition, by D. P. Bertsekas

[2] Linear and Nonlinear Programming, by D. G. Luenberger, Y. Ye

You might also like

∃ > 0 s.t. f(x? ) 6 f(x), ∀x with kx − x? k < ,

∃ > 0 s.t. f(x? ) < f(x), ∀x with kx − x? k < ,