0% found this document useful (0 votes)

15 views16 pages

Opt Lec 10

thong ke

Uploaded by

emilypham056

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views16 pages

Opt Lec 10

thong ke

Uploaded by

emilypham056

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Lecture 10.

Some Algorithms to Solve Unconstrained

Optimization Problems

Conditions for Local Maximizers/ Minimizers

The Training Problem
Gradient Method
Newton’s Method
Conjugate Direction Methods

1 / 16
Conditions for Local Maximizers/ Minimizers

Consider the problem finding x ∈ D such that f (x) attains its maximum
(minimum) where f (x) is continuously differentiable to the second-order.
This problem is a unconstrained optimization problem if an optimal
solution x∗ is an interior point of D or x∗ ∈ intD.
First-order necessary condition: x∗ is local maximizer (minimizer)
of f (x), then ∇f (x∗ ) = 0.
Second-order necessary condition: x∗ is local maximizer
(minimizer) f (x) then the Hessian matrix H(x∗ ) of f (x) at x∗ is
negative (positive) semidefinite.
Sufficient condition (for x∗ to be local maximizer): If ∇f (x∗ ) = 0
and the Hessian matrix H(x∗ ) is negative (positive) semidefinite then
f (x) attains its local maximum (minimum) at x∗ .
2 / 16
The training problem

The Machine Learners Job

(1) Get the labeled data (xx 1 , y 1 ), . . . , (xx n , y n )
(2) Choose a parametrization for hypothesis: hw (xx )
(3) Choose a loss function: ℓ(hw (xx ), y ) ≥ 0
(4) Solve the training problem:

1X n
min ℓ(hw (xx i ), y i ) + λR(w
w)
w ∈Rd n i =1

(5) Test and cross-validate. If fail, go back a few steps.

3 / 16
The Training Problem

The general training problem:

1X n
min ℓ(hw (xx i ), y i ) + λR(w
w)
w ∈Rd n i =1

n
ℓ(hw (xx i ), y i ): Goodness of fit
X
i =1
λ: Control tradeoff between fit and complexity
λR(w
w ): Penalizes complexity
w ) = ∥w ∥22 ,
R(w ∥w ∥1 , ∥w ∥p , . . .

4 / 16
Gradient Methods
The unconstrained Problem: Find x ∈ Rn such that f (xx ) attains its
minimum where f (xx ) is a second-order continuously differentiable
function.
The level set of f (xx ): {x ∈ Rn : f (xx ) = c }
The gradient of f at x 0 : ∇f (xx 0 ).
By the Taylor theorem, we have:

f (xx 0 − α∇f (xx 0 )) = f (xx 0 ) − α∥∇f (xx 0 )∥2 + o(α)

If ∇f (xx 0 ) ̸= 0 then for sufficiently small α > 0, we obtain:

f (xx 0 − α∇f (xx 0 )) < f (xx 0 )

Set x 1 := x 0 − α0 ∇f (xx 0 ), x 1 is an improvement over the point x 0 .

α0 ≥ 0: the step size
5 / 16
Gradient Methods
The steepest descent algorithm: This is a gradient algorithm where
step size αk is chosen to minimize φk (α) = f (xx k − α∇f (xx k )).
1 Let x 0 be a starting point.
2 Assign k := 0
3 Find ∇f (xx k ). If ∇f (xx k ) = 0, then go to 7th Step, otherwise go to next
step.
4 Find step size αk :

αk = arg min f (xx k − α∇f (xx k )

α≥0

5 Set x k +1 = x k − αk ∇f (xx k )
6 Assign k := k + 1 and go back 3rd Step
7 Stop the algorithm and conclude that x k is an optimal solution.
6 / 16
Gradient Methods

Proposition
If {x k }∞
k =0
is a steepest decent sequence for a given function f (xx ) and if
∇f (xx k ) ̸= 0 then f (xx k +1 ) < f (xx k ).

The practical stopping criterion

7 / 16
Gradient Methods

Examples
Solve these problems using steepest descent algorithm:
a. f (x1 , x2 ) = x12 + x22 starting from x 0 = (1, 2).
x12
b. f (x1 , x2 ) = + x22 starting from x 0 = (1, 2).
5
c. f (x1 , x2 ) = x1 + 12 x2 + 12x12 + x22 + 3 with starting point x 0 = (0, 0).
d. f (xx ) = 4x12 − 4x1 x2 + 2x22 with starting point x 0 = (2, 3)
e. f (xx ) = x12 − 2x1 x2 + 2x22 + 2x1 with starting point x 0 = (0, 0)

8 / 16
Gradient Methods

9 / 16
The Method of Steepest Descent with a Quadratic
Function

The form of quadratic function

1
f (xx ) = x T Q x − b T x ,
2
where Q ∈ Rn×n is a symmetric positive definite matrix, b ∈ Rn , and x ∈ Rn .

∇f (xx ) = Q x − b

We write d k = ∇f (xx k ) then

x k +1 = x k − αk d k

where
dkTdk
αk = arg min f (xx k − αd k ) = k T
α≥0 d Qd k
10 / 16
Newton’s Method (Newton-Raphson Method)

The gradient descent method is a first-order method. It relies on the

gradient to improve the solution.
A first-order method is intuitive, but sometimes too slow.
A second-order method relies on the Hessian to update a solution.
We will introduce one second-order method: Newton’s method.
Let’s start from Newton’s method for solving a nonlinear equation.

11 / 16
Newton’s method for a nonlinear equation

Let f : R −→ R be differentiable. We
want to find x satisfying f (x) = 0.
For any x k , let

fL (x k ) = f (x k ) + f ′ (x k )(x − x k )

be the linear approximation of f at x k .

We move from x k to x k +1 by setting:

fL (x k +1 ) = 0 ⇔ f (x k ) + f ′ (x k )(x k +1 − x k ) = 0

We will keep iterating until |f (x k )| < ϵ

or |x k +1 − x k | < ϵ for some
predetermined ϵ > 0.
12 / 16
Newton’s method for single-variate NLPs

Let f be twice differentiable. We want to find x satisfying f ′ (x) = 0

For any x k , let
fL′ (x) = f ′ (x k ) + f ′′ (x k )(x − x k )

be the linear approximation of f ′ at x k .

To approach x we move from x k to x k +1 by setting

fL′ (x k +1 ) = 0 ⇔ f ′ (x k ) + f ′′ (x k )(x k +1 − x k ) = 0

We will keep iterating until |f ′ (x k )| < ϵ or |x k +1 − x k | < ϵ for some

predetermined ϵ > 0.
Note that f ′ (x) = 0 does not guarantee a global minimum. That is
why showing f is convex is useful!
13 / 16
Newton’s method for single-variate NLPs

Let f be twice differentiable. We want to find x satisfying f (x) = 0

For any x k , let
1
fQ (x) = f (x k ) + f ′ (x k )(x − x k ) + f ”(x k )(x − x k )2
2
be the quadratic approximation of f at x k .
We move from x k to x k +1 by moving to the global minimum of the
quadratic approximation
1
x k +1 = arg min[f (x k ) + f ′ (x k )(x − x k ) + f ”(x k )(x − x k )2 ]
x ∈R 2
Differentiating the above objective function with respect to x, we have
f ′ (x k )
f ′ (x k ) + f ′′ (x k )(x k +1 − x k ) = 0 ⇔ x k +1 = x k −
f ”(x k )
14 / 16
Newton’s Method for multi-variate NLPs
The unconstrained Problem: Find x ∗ ∈ Rn such that f (xx ) attains its
minimum where f (xx ) is a third-order continuously differentiable function.
The Taylor series expansion of f about current point x k
1
f (xx ) ≈ f (xx k ) + (xx − x k )T d k + (xx − x k )T H(xx k )(xx − x k ) ≜ q(xx )
2
We have q(xx k ) = f (xx k ), ∇q(xx k ) = ∇f (xx k ) = d k and
∇2 q(xx k ) = ∇2 f (xx k ) = H(xx k ) Then, instead of minimize f (xx ), we
minimize q(xx ).
If H(xx k ) > 0 then q archive a minimum at

x k +1 := x k − H(xx k )−1d k

Example. Let minimize f (xx ) = x14 + 2x12 x22 + x24 with starting point
x 0 = (1, 1)
15 / 16
Newton’s Method

For Newton’s method:

Newton’s method does not have the step size issue.
It in many cases is faster.
For a quadratic function, Newton’s method find an optimal solution in
one iteration.
It may fail to converge for some functions.
More issues in general:
Convergence guarantee.
Convergence speed.
Non-differentiable functions.
Constrained optimization.
Example. Let minimize f (xx ) = x12 + 2x23 with starting point x 0 = (6, 6)

16 / 16

Gradient Descent
No ratings yet
Gradient Descent
18 pages
Optimization PPT - Part-2
No ratings yet
Optimization PPT - Part-2
42 pages
Unconstrained Numerical Optimization An Introduction For Econometricians
100% (1)
Unconstrained Numerical Optimization An Introduction For Econometricians
32 pages
Gradient Descent Algorithm in Machine Learning
No ratings yet
Gradient Descent Algorithm in Machine Learning
21 pages
Lecture 7 Newton
No ratings yet
Lecture 7 Newton
44 pages
06 23ECE216 GradientDescent v2
No ratings yet
06 23ECE216 GradientDescent v2
73 pages
Lec 02
No ratings yet
Lec 02
43 pages
Mathematical Methods of Optimization
No ratings yet
Mathematical Methods of Optimization
62 pages
Sequential Quadratic Programming
No ratings yet
Sequential Quadratic Programming
50 pages
Chapter 3 Unconstrained Convex Optimization
No ratings yet
Chapter 3 Unconstrained Convex Optimization
28 pages
5 1 SD 17122020
No ratings yet
5 1 SD 17122020
47 pages
Mscfe XXX (Course Name) - Module X: Collaborative Review Task
No ratings yet
Mscfe XXX (Course Name) - Module X: Collaborative Review Task
19 pages
Non Linear Optmisation - Notes
No ratings yet
Non Linear Optmisation - Notes
24 pages
Bologna 07
No ratings yet
Bologna 07
315 pages
Unconstrained Minimization
No ratings yet
Unconstrained Minimization
7 pages
Lecture 7 (With Notes)
No ratings yet
Lecture 7 (With Notes)
39 pages
Non-Linear Programming Problems in Engineering Optimization Course
No ratings yet
Non-Linear Programming Problems in Engineering Optimization Course
27 pages
Clnote Oct12
No ratings yet
Clnote Oct12
25 pages
11 Gradient Descent
No ratings yet
11 Gradient Descent
58 pages
Optimization 2
No ratings yet
Optimization 2
40 pages
Unconstrained and Constrained Optimization Algorithms by Soman K.P
No ratings yet
Unconstrained and Constrained Optimization Algorithms by Soman K.P
166 pages
Unit 5
No ratings yet
Unit 5
24 pages
Unconstrained NLP Algorithm Ipad
No ratings yet
Unconstrained NLP Algorithm Ipad
25 pages
CH 4
No ratings yet
CH 4
28 pages
Lecture 05 - Unconstrained
No ratings yet
Lecture 05 - Unconstrained
21 pages
Numerical Optimization
No ratings yet
Numerical Optimization
31 pages
Chapter 6vh
No ratings yet
Chapter 6vh
12 pages
Unit VI Optimization Techniques Question Bank Solved Answer
No ratings yet
Unit VI Optimization Techniques Question Bank Solved Answer
20 pages
Optimization in Neural Network
No ratings yet
Optimization in Neural Network
22 pages
Numerical Results For Gauss-Seidel Iterative Algor
No ratings yet
Numerical Results For Gauss-Seidel Iterative Algor
11 pages
14 Newton
No ratings yet
14 Newton
24 pages
MScFE 650 MLF - Video - Transcripts - M3
No ratings yet
MScFE 650 MLF - Video - Transcripts - M3
19 pages
CS-6777 Liu Abs
No ratings yet
CS-6777 Liu Abs
103 pages
Gradient Descent Algorithm in Machine Learning: Dr. P. K. Chaurasia
No ratings yet
Gradient Descent Algorithm in Machine Learning: Dr. P. K. Chaurasia
24 pages
Optimumengineeringdesign Day3a
No ratings yet
Optimumengineeringdesign Day3a
34 pages
Preguntas Del Examen
No ratings yet
Preguntas Del Examen
8 pages
Second Order Method: Newton Method Quasi Newton Method
No ratings yet
Second Order Method: Newton Method Quasi Newton Method
11 pages
Lecture 12
No ratings yet
Lecture 12
16 pages
BSC Part 3
No ratings yet
BSC Part 3
29 pages
6 Gradient Method
No ratings yet
6 Gradient Method
19 pages
Mit18 S096iap23 Lec06
No ratings yet
Mit18 S096iap23 Lec06
9 pages
ODE/PDE Analysis of Multiple Myeloma Programming in R 1st Edition High-Resolution PDF Download
100% (11)
ODE/PDE Analysis of Multiple Myeloma Programming in R 1st Edition High-Resolution PDF Download
17 pages
Lecture 14
No ratings yet
Lecture 14
9 pages
Chương 9
No ratings yet
Chương 9
12 pages
02-Subgrad Method Notes
No ratings yet
02-Subgrad Method Notes
27 pages
Lecture 2
No ratings yet
Lecture 2
19 pages
RizzCraft Color Guide
100% (1)
RizzCraft Color Guide
17 pages
19 Newton Method
No ratings yet
19 Newton Method
10 pages
Process Optimization
No ratings yet
Process Optimization
70 pages
Multi-Variable Optimization Methods
No ratings yet
Multi-Variable Optimization Methods
21 pages
OpTimIzation Overview
No ratings yet
OpTimIzation Overview
47 pages
Optim
No ratings yet
Optim
70 pages
Chapter 8 Lecture Notes
No ratings yet
Chapter 8 Lecture Notes
4 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
Data Communication
No ratings yet
Data Communication
543 pages
Chapter 9 Lecture Notes
No ratings yet
Chapter 9 Lecture Notes
3 pages
Bilhana-Pancasika Kashmir Edn PDF
No ratings yet
Bilhana-Pancasika Kashmir Edn PDF
83 pages
Newton Gauss Method
No ratings yet
Newton Gauss Method
37 pages
Project For Automated Train by Roshan
No ratings yet
Project For Automated Train by Roshan
6 pages
Procedures in Cosmetic Dermatology: and Lasers, Lights, Energy Devices 5th
No ratings yet
Procedures in Cosmetic Dermatology: and Lasers, Lights, Energy Devices 5th
349 pages
Unconstrained Optimization Methods: Amirkabir University of Technology Dr. Madadi
No ratings yet
Unconstrained Optimization Methods: Amirkabir University of Technology Dr. Madadi
13 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Assignment
No ratings yet
Assignment
12 pages
Optimization Class Notes MTH-9842
No ratings yet
Optimization Class Notes MTH-9842
25 pages
PC Maintenance Lab Report
No ratings yet
PC Maintenance Lab Report
21 pages
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
No ratings yet
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
5 pages
阅读 3.0 书源，选择本地导入即可
No ratings yet
阅读 3.0 书源，选择本地导入即可
596 pages
Algorithms Process Optimization
No ratings yet
Algorithms Process Optimization
5 pages
Parametric Curves Surfaces
No ratings yet
Parametric Curves Surfaces
24 pages
Water Tap
No ratings yet
Water Tap
14 pages
Integrator Op Amp Amplifier: M.S.P.V.L. Polytechnic College, Department of ECE & EEE, Pavoorchatram
No ratings yet
Integrator Op Amp Amplifier: M.S.P.V.L. Polytechnic College, Department of ECE & EEE, Pavoorchatram
14 pages
QR Patrol 2 Page
No ratings yet
QR Patrol 2 Page
2 pages
Breakers EaToN Serie G
No ratings yet
Breakers EaToN Serie G
416 pages
Hints of Assignment5 - Fall 2024
No ratings yet
Hints of Assignment5 - Fall 2024
11 pages
HCIA-HarmonyOS Device Developer V1.0 学员用书
No ratings yet
HCIA-HarmonyOS Device Developer V1.0 学员用书
166 pages
02.1 K-Means Example
No ratings yet
02.1 K-Means Example
12 pages
MTII Fullam Brochure Tensile Test Stage
No ratings yet
MTII Fullam Brochure Tensile Test Stage
4 pages
e-STUDIO 166 - Fac8f4
No ratings yet
e-STUDIO 166 - Fac8f4
1 page
Flat Assembler 1
No ratings yet
Flat Assembler 1
103 pages
Smart Parking System..
No ratings yet
Smart Parking System..
75 pages
Mathematical Physics A Modern Introduction To Its Foundations 2nd Edition 2024 Scribd Download
100% (3)
Mathematical Physics A Modern Introduction To Its Foundations 2nd Edition 2024 Scribd Download
28 pages
NPN RF Transistor: Absolute Maximum Ratings TA 25 Cumess Otherwise Noted
No ratings yet
NPN RF Transistor: Absolute Maximum Ratings TA 25 Cumess Otherwise Noted
3 pages
VenkatSAP MM Resume
No ratings yet
VenkatSAP MM Resume
6 pages
Resizing Disk Partitions On Oracle Exadata Database Machine - Daily DBA
No ratings yet
Resizing Disk Partitions On Oracle Exadata Database Machine - Daily DBA
9 pages
Color Pattern - Test Basic - A4
No ratings yet
Color Pattern - Test Basic - A4
3 pages
Math Lesson Plan The Vitruvian Man
No ratings yet
Math Lesson Plan The Vitruvian Man
9 pages
Job Sheet 60 2025 19 03 12 57 39
No ratings yet
Job Sheet 60 2025 19 03 12 57 39
1 page
FlashSystem Redirect On Write Snapshots 2021 Jul 01
No ratings yet
FlashSystem Redirect On Write Snapshots 2021 Jul 01
8 pages
CRM Contacts Specialist JD
No ratings yet
CRM Contacts Specialist JD
2 pages
ARTA ORGCHART For UPDATING
No ratings yet
ARTA ORGCHART For UPDATING
2 pages

Opt Lec 10

Uploaded by

Opt Lec 10

Uploaded by

Lecture 10.

Some Algorithms to Solve Unconstrained

Conditions for Local Maximizers/ Minimizers

The Machine Learners Job

(5) Test and cross-validate. If fail, go back a few steps.

The general training problem:

f (xx 0 − α∇f (xx 0 )) = f (xx 0 ) − α∥∇f (xx 0 )∥2 + o(α)

If ∇f (xx 0 ) ̸= 0 then for sufficiently small α > 0, we obtain:

f (xx 0 − α∇f (xx 0 )) < f (xx 0 )

Set x 1 := x 0 − α0 ∇f (xx 0 ), x 1 is an improvement over the point x 0 .

αk = arg min f (xx k − α∇f (xx k )

The practical stopping criterion

The form of quadratic function

We write d k = ∇f (xx k ) then

The gradient descent method is a first-order method. It relies on the

be the linear approximation of f at x k .

We will keep iterating until |f (x k )| < ϵ

Let f be twice differentiable. We want to find x satisfying f ′ (x) = 0

be the linear approximation of f ′ at x k .

We will keep iterating until |f ′ (x k )| < ϵ or |x k +1 − x k | < ϵ for some

Let f be twice differentiable. We want to find x satisfying f (x) = 0

For Newton’s method:

You might also like