0% found this document useful (0 votes)

39 views

Rrrdesdelinear and Nonlinear Programming-4

This document discusses quasi-Newton methods for optimization which approximate the inverse Hessian matrix to avoid directly computing it. It presents a modified Newton method that replaces the true inverse Hessian with a positive definite approximation, and proves its convergence properties are similar to gradient descent methods. It also describes a classical modified Newton's method that uses the inverse Hessian evaluated at the initial point.

Uploaded by

bidbifb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views

Rrrdesdelinear and Nonlinear Programming-4

Uploaded by

bidbifb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Chapter 10 QUASI-NEWTON

METHODS

In this chapter we take another approach toward the development of methods lying
somewhere intermediate to steepest descent and Newton’s method. Again working
under the assumption that evaluation and use of the Hessian matrix is impractical
or costly, the idea underlying quasi-Newton methods is to use an approximation to
the inverse Hessian in place of the true inverse that is required in Newton’s method.
The form of the approximation varies among different methods—ranging from
the simplest where it remains fixed throughout the iterative process, to the more
advanced where improved approximations are built up on the basis of information
gathered during the descent process.
The quasi-Newton methods that build up an approximation to the inverse
Hessian are analytically the most sophisticated methods discussed in this book for
solving unconstrained problems and represent the culmination of the development
of algorithms through detailed analysis of the quadratic problem. As might be
expected, the convergence properties of these methods are somewhat more difficult
to discover than those of simpler methods. Nevertheless, we are able, by continuing
with the same basic techniques as before, to illuminate their most important features.
In the course of our analysis we develop two important generalizations of
the method of steepest descent and its corresponding convergence rate theorem.
The first, discussed in Section 10.1, modifies steepest descent by taking as the
direction vector a positive definite transformation of the negative gradient. The
second, discussed in Section 10.8, is a combination of steepest descent and Newton’s
method. Both of these fundamental methods have convergence properties analogous
to those of steepest descent.

10.1 MODIFIED NEWTON METHOD

A very basic iterative process for solving the problem

minimize f x

which includes as special cases most of our earlier ones is

285
286 Chapter 10 Quasi-Newton Methods

xk+1 = xk − k Sk f xk T (1)

where Sk is a symmetric n × n matrix and where, as usual, k is chosen to minimize

fxk+1 . If Sk is the inverse of the Hessian of f , we obtain Newton’s method, while
if Sk = I we have steepest descent. It would seem to be a good idea, in general,
to select Sk as an approximation to the inverse of the Hessian. We examine that
philosophy in this section.
First, we note, as in Section 8.8, that in order that the process (1) be guaranteed
to be a descent method for small values of , it is necessary in general to require
that Sk be positive definite. We shall therefore always impose this as a requirement.
Because of the similarity of the algorithm (1) with steepest descent† it should
not be surprising that its convergence properties are similar in character to our
earlier results. We derive the actual rate of convergence by considering, as usual,
the standard quadratic problem with

f x = 21 xT Qx − bT x (2)

where Q is symmetric and positive definite. For this case we can find an explicit
expression for k in (1). The algorithm becomes

xk+1 = xk − k Sk gk (3a)

where

gk = Qxk − b (3b)
gkT Sk gk
k = (3c)
gkT Sk QSk gk

We may then derive the convergence rate of this algorithm by slightly extending
the analysis carried out for the method of steepest descent.

Modified Newton Method Theorem (Quadratic case). Let x∗ be the unique

minimum point of f, and define Ex = 21 x − x∗ T Qx − x∗ .
Then for the algorithm (3) there holds at every step k
2
B k − bk
E xk+1 E xk (4)
Bk + bk

where bk and Bk are, respectively, the smallest and largest eigenvalues of the
matrix Sk Q.

†
The algorithm (1) is sometimes referred to as the method of deflected gradients, since the
direction vector can be thought of as being determined by deflecting the gradient through
multiplication by Sk .
10.1 Modiﬁed Newton Method 287

Proof. We have by direct substitution

T 2
E xk − E xk+1 g k Sk gk
= T
E xk gk Sk QSk gk gkT Q−1 gk

Letting Tk = S1/2 1/2

k QSk and pk = S1/2
k gk we obtain

T 2
E xk − E xk+1 p k Pk
= T
E xk pk Tk pk pTk T−1
k pk

From the Kantorovich inequality we obtain easily

2
B k − bk
E xk+1 E xk
Bk + bk
−1/2
where bk and Bk are the smallest and largest eigenvalues of Tk . Since S1/2
k Tk S k =
Sk Q, we see that Sk Q is similar to Tk and therefore has the same eigenvalues.
This theorem supports the intuitive notion that for the quadratic problem one
should strive to make Sk close to Q−1 since then both bk and Bk would be close
to unity and convergence would be rapid. For a nonquadratic objective function f
the analog to Q is the Hessian F(x), and hence one should try to make Sk close to
Fxk −1 .
Two remarks may help to put the above result in proper perspective. The
first remark is that both the algorithm (1) and the theorem stated above are only
simple, minor, and natural extensions of the work presented in Chapter 8 on steepest
descent. As such the result of this section can be regarded, correspondingly, not as
a new idea but as an extension of the basic result on steepest descent. The second
remark is that this one simple result when properly applied can quickly characterize
the convergence properties of some fairly complex algorithms. Thus, rather than
an isolated result concerned with a specific form of algorithm, the theorem above
should be regarded as a general tool for convergence analysis. It provides significant
insight into various quasi-Newton methods discussed in this chapter.

A Classical Method
We conclude this section by mentioning the classical modified Newton’s method, a
standard method for approximating Newton’s method without evaluating Fxk −1
for each k. We set

xk+1 = xk − k F x0 −1 f xk T (5)

In this method the Hessian at the initial point x0 is used throughout the process.
The effectiveness of this procedure is governed largely by how fast the Hessian is
changing—in other words, by the magnitude of the third derivatives of f .

Chapter 9 Newton's Method
No ratings yet
Chapter 9 Newton's Method
27 pages
Multi-Variable Optimization Methods
No ratings yet
Multi-Variable Optimization Methods
21 pages
Analysis of Quadratic Case: Theorem
No ratings yet
Analysis of Quadratic Case: Theorem
3 pages
Second Order Method: Newton Method Quasi Newton Method
No ratings yet
Second Order Method: Newton Method Quasi Newton Method
11 pages
Nonlinear Programming (Concepts, Algorithms, and Applications To Chemical Processes) - 3. Newton-Type Methods For Unconstrained Optimization (2010)
No ratings yet
Nonlinear Programming (Concepts, Algorithms, and Applications To Chemical Processes) - 3. Newton-Type Methods For Unconstrained Optimization (2010)
23 pages
Project For Automated Train by Roshan
No ratings yet
Project For Automated Train by Roshan
6 pages
HW 3 Unconstrained-Optimization Advanced
No ratings yet
HW 3 Unconstrained-Optimization Advanced
9 pages
Numerical Experiments With Variations of The Gauss-Newton Algorithm For Nonlinear Least Squares
No ratings yet
Numerical Experiments With Variations of The Gauss-Newton Algorithm For Nonlinear Least Squares
17 pages
Chapter 9 Lecture Notes
No ratings yet
Chapter 9 Lecture Notes
3 pages
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
No ratings yet
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
5 pages
14 Newton
No ratings yet
14 Newton
24 pages
Lecture 05 - Quasi Newthon Methods
No ratings yet
Lecture 05 - Quasi Newthon Methods
10 pages
Newton Scribed
No ratings yet
Newton Scribed
7 pages
Chapter 6vh
No ratings yet
Chapter 6vh
12 pages
10725_Lecture12
No ratings yet
10725_Lecture12
6 pages
E1 251 Linear and Nonlinear Op2miza2on
No ratings yet
E1 251 Linear and Nonlinear Op2miza2on
24 pages
19 Newton Method
No ratings yet
19 Newton Method
10 pages
Newton's Method For Unconstrained Optimization
No ratings yet
Newton's Method For Unconstrained Optimization
14 pages
Optimization Class Notes MTH-9842
No ratings yet
Optimization Class Notes MTH-9842
25 pages
Numerical_Results_for_Gauss-Seidel_Iterative_Algor
No ratings yet
Numerical_Results_for_Gauss-Seidel_Iterative_Algor
11 pages
Lec 02
No ratings yet
Lec 02
43 pages
7 Newton Raphson Method
No ratings yet
7 Newton Raphson Method
20 pages
Lecture 12
No ratings yet
Lecture 12
16 pages
Chương 9
No ratings yet
Chương 9
12 pages
Chapter 8 Lecture Notes
No ratings yet
Chapter 8 Lecture Notes
4 pages
Lecture 7 Newton
No ratings yet
Lecture 7 Newton
44 pages
Opt_Lec_10
No ratings yet
Opt_Lec_10
16 pages
OPTFIT aflevering
No ratings yet
OPTFIT aflevering
9 pages
A Modified BFGS Method and Its Global Convergence in Nonconvex Minimization - 2001 - Journal of Computational and Applied Mathematics
No ratings yet
A Modified BFGS Method and Its Global Convergence in Nonconvex Minimization - 2001 - Journal of Computational and Applied Mathematics
21 pages
"Newton's Method and Loops": University of Karbala College of Engineering Petroleum Eng. Dep
No ratings yet
"Newton's Method and Loops": University of Karbala College of Engineering Petroleum Eng. Dep
11 pages
Newton Gauss Method
No ratings yet
Newton Gauss Method
37 pages
Algorithms Process Optimization
No ratings yet
Algorithms Process Optimization
5 pages
Sequential Quadratic Programming
No ratings yet
Sequential Quadratic Programming
50 pages
Unconstrained Optimization Methods: Amirkabir University of Technology Dr. Madadi
No ratings yet
Unconstrained Optimization Methods: Amirkabir University of Technology Dr. Madadi
13 pages
Steepest Descent
No ratings yet
Steepest Descent
7 pages
Download
No ratings yet
Download
7 pages
Art:10 1007/BF01113251 PDF
No ratings yet
Art:10 1007/BF01113251 PDF
17 pages
Quasi Newton Methods
No ratings yet
Quasi Newton Methods
17 pages
CS-6777 Liu Abs
No ratings yet
CS-6777 Liu Abs
103 pages
Other Nonlinear Regression Methods For Algebraic Models
No ratings yet
Other Nonlinear Regression Methods For Algebraic Models
17 pages
Preguntas del examen
No ratings yet
Preguntas del examen
8 pages
Experimental Study of The Broyden Class Updating Method For Solving Non-Linear Unconstrained Optimization Problems
No ratings yet
Experimental Study of The Broyden Class Updating Method For Solving Non-Linear Unconstrained Optimization Problems
10 pages
Lecture 05 - Unconstrained
No ratings yet
Lecture 05 - Unconstrained
21 pages
BSC Part 3
No ratings yet
BSC Part 3
29 pages
A Truncated Nonmonotone Gauss-Newton Method For Large-Scale Nonlinear Least-Squares Problems
No ratings yet
A Truncated Nonmonotone Gauss-Newton Method For Large-Scale Nonlinear Least-Squares Problems
16 pages
Steepest Descent Algorithm
No ratings yet
Steepest Descent Algorithm
28 pages
Optimumengineeringdesign Day3a
No ratings yet
Optimumengineeringdesign Day3a
34 pages
Newton-Raphson Optimization: Steve Kroon
No ratings yet
Newton-Raphson Optimization: Steve Kroon
4 pages
BFGS
No ratings yet
BFGS
9 pages
Optimization2
No ratings yet
Optimization2
40 pages
Exportar Páginas Numerical-Optimization-Second-Edition - Backup
No ratings yet
Exportar Páginas Numerical-Optimization-Second-Edition - Backup
3 pages
Hauser Lecture2
No ratings yet
Hauser Lecture2
26 pages
Optimization in Neural Network
No ratings yet
Optimization in Neural Network
22 pages
a262114
No ratings yet
a262114
17 pages
Real Analysis Project
50% (2)
Real Analysis Project
14 pages
Doan BFGS
No ratings yet
Doan BFGS
72 pages
Newton-Raphson en INGLES
No ratings yet
Newton-Raphson en INGLES
24 pages
Lavenberg Marquardt Algorithm
No ratings yet
Lavenberg Marquardt Algorithm
18 pages
Sequences and Infinite Series, A Collection of Solved Problems
From Everand
Sequences and Infinite Series, A Collection of Solved Problems
Steven Tan
No ratings yet
Harmonic Analysis and the Theory of Probability
From Everand
Harmonic Analysis and the Theory of Probability
Salomon Bochner
No ratings yet
Arithmetic Circuits - Addition, Subtraction, & Multiplication
No ratings yet
Arithmetic Circuits - Addition, Subtraction, & Multiplication
9 pages
Artificial Intelligence SUMMER 2022 SOLU.
No ratings yet
Artificial Intelligence SUMMER 2022 SOLU.
14 pages
EE338-Digital Signal Processing: Tutorial 2 Solutions
No ratings yet
EE338-Digital Signal Processing: Tutorial 2 Solutions
2 pages
Tree Sample Question
No ratings yet
Tree Sample Question
5 pages
Duality Theory - Assignment A) Primal Problem
No ratings yet
Duality Theory - Assignment A) Primal Problem
5 pages
Soft Comp PDF
No ratings yet
Soft Comp PDF
2 pages
21CS56 - Operating Systems Chapter 8 - (Module3) Deadlocks: Department of Information Science and Engg
No ratings yet
21CS56 - Operating Systems Chapter 8 - (Module3) Deadlocks: Department of Information Science and Engg
49 pages
Digital Fundamentals: Floyd
No ratings yet
Digital Fundamentals: Floyd
56 pages
C5 MDP TERM 2
No ratings yet
C5 MDP TERM 2
4 pages
Muhamad Irfan Mcf1701928 Shafaqat Ramzan Mcf1701896
No ratings yet
Muhamad Irfan Mcf1701928 Shafaqat Ramzan Mcf1701896
3 pages
DHSCH 1
No ratings yet
DHSCH 1
31 pages
Weka Book Questions
0% (1)
Weka Book Questions
2 pages
Clifford Circuit Initialisation For Variational Quantum Algorithms
No ratings yet
Clifford Circuit Initialisation For Variational Quantum Algorithms
11 pages
Dijkstra's Algorithm Group 3 A21+A22+A23
No ratings yet
Dijkstra's Algorithm Group 3 A21+A22+A23
49 pages
Study On Centrality Measures in Social Networks: A Survey: Kousik Das Sovan Samanta Madhumangal Pal
No ratings yet
Study On Centrality Measures in Social Networks: A Survey: Kousik Das Sovan Samanta Madhumangal Pal
11 pages
Data Structures-Quiz Pre UTS
No ratings yet
Data Structures-Quiz Pre UTS
3 pages
PDA Accept Context Free
No ratings yet
PDA Accept Context Free
69 pages
ai graph concepts
No ratings yet
ai graph concepts
12 pages
Zagreb Indices
No ratings yet
Zagreb Indices
43 pages
Introduction To Algorithms and Flowcharts
No ratings yet
Introduction To Algorithms and Flowcharts
17 pages
ADA LAB MANUAL-
No ratings yet
ADA LAB MANUAL-
26 pages
Wireless Channel Coding Techniques ECE DS
No ratings yet
Wireless Channel Coding Techniques ECE DS
2 pages
Applications of Trees in Real Life: Niño Dominic M. Matienzo Bsit - It1A
No ratings yet
Applications of Trees in Real Life: Niño Dominic M. Matienzo Bsit - It1A
18 pages
Object Recognition: Juan Carlos Niebles and Ranjay Krishna Stanford Vision and Learning Lab
No ratings yet
Object Recognition: Juan Carlos Niebles and Ranjay Krishna Stanford Vision and Learning Lab
76 pages
CS402 Sample Paper
No ratings yet
CS402 Sample Paper
12 pages
HT TP: //qpa Pe R.W But .Ac .In: 2010-11 Artificial Intelligence
No ratings yet
HT TP: //qpa Pe R.W But .Ac .In: 2010-11 Artificial Intelligence
7 pages
Classification of Iris Flower Species Updated
100% (1)
Classification of Iris Flower Species Updated
5 pages
MMW Reviewer Endterm
No ratings yet
MMW Reviewer Endterm
9 pages
EI-331 - Design and Analysis of Algorithms - String Matching
No ratings yet
EI-331 - Design and Analysis of Algorithms - String Matching
18 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
72 pages

Rrrdesdelinear and Nonlinear Programming-4

Uploaded by

Rrrdesdelinear and Nonlinear Programming-4

Uploaded by

Chapter 10 QUASI-NEWTON

10.1 MODIFIED NEWTON METHOD

which includes as special cases most of our earlier ones is

xk+1 = xk − k Sk f xk T  (1)

where Sk is a symmetric n × n matrix and where, as usual, k is chosen to minimize

xk+1 = xk − k Sk gk (3a)

Modified Newton Method Theorem (Quadratic case). Let x∗ be the unique

Proof. We have by direct substitution

Letting Tk = S1/2 1/2

From the Kantorovich inequality we obtain easily

xk+1 = xk − k F x0 −1 f xk T  (5)

You might also like

xk+1 = xk − k Sk f xk T (1)

xk+1 = xk − k Sk gk (3a)

xk+1 = xk − k F x0 −1 f xk T (5)