0% found this document useful (0 votes)

122 views14 pages

Trust Region Methods

TRM for unconstrained opt

Uploaded by

Benabida

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

122 views14 pages

Trust Region Methods

TRM for unconstrained opt

Uploaded by

Benabida

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

University Mohamed El-Bachir El-Ibrahimi of

Bordj Bou Arreridj

Faculty of mathematic and computer science
Department of operations research

Non linear optimization mini-project

Trust region methods for unconstrained

minimization problems

Authored by:
Benabida Sif Eddine

Mai 2024
Contents

List of Figures 1

1 Introduction 2

2 The basic trust region algorithm 3

3 Solutions of the subproblem 5

3.1 Trust region Newton’s method: . . . . . . . . . . . . . . . . . . . . 6
3.1.1 The easy case . . . . . . . . . . . . . . . . . . . . . . . . . . 6
3.1.2 The hard case . . . . . . . . . . . . . . . . . . . . . . . . . . 8
3.2 The dogleg method . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

4 Global convergence 9
4.1 Sufficient reduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
4.1.1 The Cauchy point . . . . . . . . . . . . . . . . . . . . . . . . 10
4.2 Convergence to stationary points . . . . . . . . . . . . . . . . . . . 10
4.2.1 General case . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
4.2.2 Algorithm based on Newton’s method . . . . . . . . . . . . . 11

5 Numerical example 12

6 Conclusions 12

References 13

List of Figures
1 Trust region step . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
2 ’∥p(λ)∥ as a function of λ’ . . . . . . . . . . . . . . . . . . . . . . . 7
3 The dogleg method . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1 Introduction
Trust region methods,or sometimes called restricted step methods, are methods
used to solve unconstrained minimization problems .To read about Trust region
methods for constrained optimization I refer to [2] and [1] . The motivation for
the creation of these methods is to deal with the case of the non-positive definite
hessian matrices in the Newton’s method. I refer to Fletcher book [3] to read about
Newton’s method.

The mathematical formulation of the unconstrained minimization problem is:

min f (x)
x∈Rn

where f: Rn → R is a smooth function (a function that has continuous derivatives

up to some desired order over some domain).

The strategy of trust region methods is to construct a model mk that has similar
behavior to the objective function f over some domain near the current iterate
xk using the information gathered about this objective function. This model is
obtained by truncating the Taylor series for f (xk + p) which is:
1
f (xk + p) = f (xk ) + ∇f (xk )T p + pT ∇2 f (xk + tp)p
2
for p ∈ Rn and some t ∈ (0, 1).

In other words, Trust region methods define a region around the current iter-
ate within which they trust the model to be an adequate representation of the
objective function, and then choose the step to be the approximate minimizer of
the model in this region, and if the reduction in f predicted by the model function
is not acceptable according to some criteria that we are going to talk about later,
they reduce the size of this region and find a new minimizer.

On the other hand, if the minimization of the model inside the trust region
is producing good steps and predicting the behavior of the objective function, we
could increase the size of the trust region to allow more efficient steps to be taken
(i.e. that predict the behavior the objective function more efficiently).

Briefly, if the region is too small, it might unnecessarily limit the algorithm’s
ability to explore and find the optimal solution efficiently, and from another point
of view, if it’s too large, the algorithm might take steps that lead to invalid or
undesirable regions.

2
2 The basic trust region algorithm
Before getting into the outlines of the main algorithm, let’s talk about uncon-
strained optimization algorithms.

Algorithms for this type of problems require the user to supply a starting point
denoted x0 ,and starting from it , the algorithm generate a sequence of feasible
solutions x1 , x2 , x3 , ... called iterates . The algorithms stop when there is no more
progress to be made (i.e.f (xk ) ≤ f (x) for all x), or when we get an accurate
approximation to the solution of the problem.

The algorithm purpose is to find a new iterate xk+1 that has a lower function
value than xk .There are non-monotone algorithms that doesn’t insist on that, but
there will be always a decrease in f after some number of iterations m, that is:
f (xk+m ) ≤ f (xk ).

Now let’s define the model mk , as we said earlier , it is based on the Taylor
series for f (xk + p) , by replacing the hessian of the function f by an approximation
Hk to it , we get the model definition :
1
mk (p) = f (xk ) + ∇f (xk )T p + pT Hk p
2
Where Hk is a symmetric matrix.

When ∇2 f (xk ) is available and positive definite, we take Hk to be the exact

hessian of the objective function, we call our approach a Trust region Newton
method.

To find the step that should be taken from the iterate xk we solve this subproblem:

min mk (pk ) s.t ∥pk ∥ ≤ ∆k (1)

pk ∈Rn

Where ∆ is the trust region radius and ∥.∥ is the Euclidean norm. So p∗ is the
minimizer of mk in the ball of radius ∆ ( see figure 1).

To calculate the reduction in f predicted by the model in the iteration k we define

3
Figure 1: Trust region step

the ratio σk by:

f (xk ) − f (xk + pk )
σk = (2)
mk (0) − mk (pk )
Where pk is the step taken at the iteration k with ∥pk ∥ ≤ ∆k and ∆k is the radius
of the Trust region at this iteration.

In σk , note that f (xk ) − f (xk + pk ) is the actual reduction in f and mk (0) − mk (pk )
is the predicted reduction , so σk is simply the reduction in f predicted by the
model function .This is the criteria that we talked about earlier to measure the
accuracy to which the reduction in mk predicts the reduction in f , in the sense
that the closer σk is to unity, the better is the agreement.

We distinguish here three cases:

1. When σk is close to 1, there is a good agreement between f and the model

function over this step (i.e. we could expand the size of the trust region in
the next iteration.)

2. When σk is positive but significantly smaller than 1 , we do not change the

size of the trust region.

3. When σk is close to zero or negative, then there is no agreement between the

objective function and the model over this step, so we shrink the size of the
trust region in the next iteration.

4
Notice here that we’re talking about the size of the trust region, not whether
the step taken is acceptable or no. Actually the step is acceptable whenever
f (xk ) > f (xk + pk ) (i.e. whenever σk > 0 ).

So let’s translate what we’ve just said into the general algorithm of the trust
region method (how does the algorithm works at the iteration k):

Algorithm 1 (Trust region)

(i) Given xk , ∆∞ and ∆k ∈ (0, ∆∞ ).

(ii) Solve (1) to obtain pk ;

(iii) Evaluate σk from ??;

(iv) If σk < 1/4 then ∆k+1 = 14 ∆k ;

(v) Else if σk > 3

4
and ∥pk ∥ = ∆k then ∆k+1 = min(2∆k , ∆∞ );

(vi) Else ∆k+1 = ∆k ;

(vii) If σk > 0 then xk+1 = xk + pk ;

(viii) Else xk+1 = xk

Where ∆∞ is the the overall bound of ∥pk ∥ .

3 Solutions of the subproblem

As you can see in the algorithm 1, finding the approximate solution of the
subproblem is critical in order to find the step that reduce the model value inside
the trust region.

We allow ourselves to drop the subscript k from the subproblem and rewrite
it as follow:
minn m(p) s.t ∥p∥ ≤ ∆ (3)
p∈R

There are different approaches to solve that, but we’re going to talk about two
of them: the trust region Newton method and the dogleg method(to see other
approaches I refer to [6] p33-52) , but first, let’s talk about the theorem (due to

5
Moré and Sorensen) that define the necessary and sufficient optimality conditions
for the subproblem (3):

Theorem 1 (Moré and Sorensen)

We say that p∗ is a global solution of the subproblem (3) if and only if p∗ is

feasible and there exists a scalar λ ≥ 0 such that :

1. (H + λI)p∗ = −∇f

2. λ(∆ − ∥p∗ ∥) = 0

3. H + λI is positive semidefinite.

3.1 Trust region Newton’s method:

To find the solution p∗ of the subproblem (3), we’re going to base our approach
on the theorem mentioned above.
We can see that:
• Either ∥p∗ ∥ ≤ ∆ then λ = 0 and so : p∗ = −H −1 ∇f with H positive
semidefinite. In this case we call p∗ the full step.

• Or else we define p(λ) = −(H +λI)−1 ∇f where H +λI is positive semidefinite

and ∥p∗ ∥ = ∆, so we’re going to search for a scalar λ ≥ 0 that satisfy the
features just described .
To do that we’re going to solve the one dimensional root finding problem in the
variable λ , that is :
∥p(λ)∥ = ∆ (4)
Since H is symmetric , we can apply a spectral decomposition on it and write :

H = QΛQT

Such that : Λ = diag(λ1 , λ2 , . . . , λn ) where λ1 ≤ λ2 ≤ ... ≤ λn , are the eigenvalues

of H and Q = [q1 |q2 |. . . |qn ] where qi is the ith eigenvector of H.
Here we are going to consider two cases based on the value of q1T ∇f :

3.1.1 The easy case

When q1T ∇f ̸= 0, it can be shown that there is a unique λ∗ ∈ (−λ1 , +∞) that
satisfy ∥p(λ)∥ = ∆ and the conditions of the theorem 1, (see figure 3).
for λ ̸= λi we have :

6
Figure 2: ’∥p(λ)∥ as a function of λ’

n
qiT ∇f
p(λ) = −Q(Λ + λI)QT ∇f = −
X
qi
i=1 λi + λ
and we find:
n
(qiT ∇f )2
∥p(λ)∥2 =
X

i=1 (λi + λ)
2

In this case we can solve the problem (4) using the root finding Newton’s method
(I refer to [5] p633 to read more about this method ), and the algorithm goes as
follows:

Algorithm 2 (Trust region subproblem)

(i) Given λk and ∆ > 0

(ii) Factor H + λk I = LT L;

(iii) Solve LT Lpk = −∇f ; LT qk = pk ;

(iv) Set λk+1 = λk + ( ∥p k ∥ 2 ∥pk ∥−∆

∥qk ∥
) ( ∆ );

Note that the factorization used in this algorithm is the Cholesky factorization
(I refer to [5] p608 to read about it ). Some safeguards must be added to the
algorithm to make it practical , for example λk > −λ1 because when λk ≤ −λ1 the
Cholesky factorization will not exist.

7
3.1.2 The hard case
When q1T ∇f = 0 , the situation becomes a little complicated because there may
not be a value λ∗ ∈ (−λ1 , +∞) such that ∥p(λ)∥ = ∆ , but from theorem 1, we
can show that λ∗ ∈ [−λ1 , +∞) , so it remains one possibility, that is: λ∗ = −λ1 .
In this case we set :
n
qiT ∇f
p(λ) = − qi + αz
X

i=2 λi + λ
Where z is the eigenvector of H that corresponds to the eigenvalue λ1 , α is a scalar
and : n
(qiT ∇f )2
∥p(λ)∥2 = + α2
X

i=2 (λ i + λ) 2

And we choose the value of α to ensure that : ∥p(λ)∥ = ∆.

3.2 The dogleg method

The second method that we’re going to talk about can be used when H is
positive definite.
When H is positive definite, we distinguish two cases from theorem 1 :

1. ∥pF ∥ = ∥ − H −1 ∇f ∥ ≤ ∆ and in this case we obviously take : p∗ = pF , such

that pF is the full step.

2. ∆ < ∥pF ∥ (i.e. the full step is not allowed by the trust region). In this case
we have to use the dogleg method that we are going to talk about.

We start by creating a path consisting of two line segments. The first line segment
runs from the origin (the current iterate) to the minimizer of m along the steepest
descent direction −∇f , called the Cauchy point, which is :

∇T f ∇f
pu = − ∇f
∇T f H∇f
and the second line goes from pu to pF (see figure 3).
We formally denote this trajectory by pd (µ) where :

µp
u 0≤µ<1
pd (µ) = 
pu + (µ − 1)(pF − pu ) 1 ≤ µ ≤ 2
There is a lemma that ensures that the trust region boundary ∥p∥ = ∆ intersect
with this path at exactly one point when the full step is not allowed by the trust
region, and nowhere else. The chosen value of p will be at the point of intersection

8
Figure 3: The dogleg method

of the dogleg and the trust region boundary.

To find this point, we solve the following scalar quadratic equation:

∥pu + (1 − µ)(pF − pu )∥2 = ∆2

When ∥pu ∥ ≥ ∆ (i.e. the Cauchy step is not inside the trust region) the solution is
given by:
∆
p∗ = pu
∥pu ∥
Note that when H is an indefinite matrix, we can’t use the dogleg strategy because
pF is not the unconstrained minimizer of m in this case.

4 Global convergence
To yield global convergence, the method must iterates towards a critical point
and at each iteration, pk must gives a sufficient reduction in the model.

4.1 Sufficient reduction

The reduction gotten by pk in the model mk is called the predicted reduction,
that is: mk (0) − mk (pk ). Actually, we say that the predicted reduction is sufficient

9
if :
∥∇f (xk )∥
!
mk (0) − mk (pk ) ≥ c1 ∥∇f (xk )∥ min ∆k , (5)
∥Hk ∥
For some c ∈ (0, 1].

4.1.1 The Cauchy point

The sufficient reduction can be quantified in terms of the Cauchy point defined
by:
∆k
pu = −τk ∇f (xk )
∥∇f (xk )∥
where:

1 ∇f (xk )T Hk ∇f (xk ) ≤ 0
τk =  3

min 1, ∆k ∇f∥∇f (xk )∥
(xk )T Hk ∇f (xk )
otherwise
Actually, it can be shown that the Cauchy point satisfies (5) with c = 1
2
(you can
check the proof of this in [5]). We conclude from this property that if:

mk (0) − mk (pk ) ≥ c2 [mk (0) − mk (pu )]

then pk satisfies (5) with c1 = c22 .
From that we can say that the reduction achieved by our approximate solution pk
is sufficient if it is at least some fixed fraction c2 of the reduction achieved by the
Cauchy point.

4.2 Convergence to stationary points

In this part we are going to consider the convergence of the general algorithm
and then when the subproblem is solved by the trust region Newton method.

4.2.1 General case

To ensure that the algorithm 1 converges to stationary point we are going to
consider some conditions that must be satisfied in order to achieve that.
We define the level set S by:

S = {x | f (x) ≤ f (x0 )}
and an open neighborhood of this set by:

S(r) = {x | ∥x − y∥ < r for some y ∈ S}

10
Where r is a positive constant.

Theorem 2

Algorithm 1 converge to stationary points, which means:

lim inf ∥∇f (xk )∥ = 0

k→∞

If these conditions hold:

1. ∥Hk ∥ ≤ β for some constant β

2. f is bounded below on the level set S and Lipschtz continuously differen-

tiable in the neighborhood S(r).

3. pk satisfies (5).

4. ∥pk ∥ ≤ γ∆k for some constant γ ≥ 1

I refer to the proof of this theorem in [5] p80-82.

4.2.2 Algorithm based on Newton’s method

When we use the Newton’s method to solve the subproblem, there are some
additional conditions that must be satisfied in order to achieve convergence to
critical points.
The following theorem describe how this could be achieved:

Theorem 3

Suppose that the assumptions of theorem 2 are satisfied and in addition:

1. f is twice continuously differentiable in the level set S (i.e. f ∈ C 2 (S).)

2. Hk = ∇2 f (xk )

Then:

lim ∥∇f (xk )∥ = 0

k→∞

I omit the proof, which can be found in Moré and Sorensen [4] section 4.

11
5 Numerical example
In this section, we’re going to apply the Trust region method and the dogleg
strategy to find the minimum of the Rosenbrock function defined by:

f (x, y) = (a − x)2 + b(y − x2 )2

where a and b are constants.
For a = 1 and b = 100 we have :

f (x, y) = (1 − x)2 + 100(y − x2 )2

then:

∇f (x, y) = (−2(1 − x) − 400x(y − x2 ), 200(y − x2 ))T

and:

2 − 400(y − 3x2 ) −400x

" #
∇ f (x, y) =
2
−400x 200
By implementing the algorithm for x0 = (2, 2) we find the optimal solution:
x∗ = (1, 1) with f ∗ = 0. In finance, the Rosenbrock function could be used
in portfolio optimization, where the goal is to find the allocation of assets that
minimizes the risk while maximizing the return on investment. The Rosenbrock
function can represent the objective function that captures the trade-off between
risk and return, and optimization techniques can be employed to find the optimal
portfolio allocation.

6 Conclusions
The main purpose of this work has been to discuss the main trust region
algorithm used to solve unconstrained minimization problems, with an aim towards
understanding the best way to implement it in order to achieve the convergence
towards the global solution. We introduced theoretical aspects of the method, gived
the necessary and sufficient conditions in order to achieve global convergence, and
finally implemented the algorithm on the Rosenbrock function.

12
References
[1] Richard H Byrd, Robert B Schnabel, and Gerald A Shultz. A trust region
algorithm for nonlinearly constrained optimization. SIAM Journal on Numerical
Analysis, 24(5):1152–1170, 1987.

[2] Andrew R Conn, Nicholas IM Gould, and Philippe L Toint. Trust region
methods. SIAM, 2000.

[3] Roger Fletcher. Practical methods of optimization. John Wiley & Sons, 2000.

[4] Jorge J Moré and Danny C Sorensen. Computing a trust region step. SIAM
Journal on scientific and statistical computing, 4(3):553–572, 1983.

[5] Jorge Nocedal and Stephen J Wright. Numerical optimization. Springer, 1999.

[6] Mostafa Rezapour. Trust-Region Methods for Unconstrained Optimization

Problems. Washington State University, 2020.

Trust Region Methods (Book) - Conn, Gould, Toint
No ratings yet
Trust Region Methods (Book) - Conn, Gould, Toint
980 pages
Real Analysis Project
50% (2)
Real Analysis Project
14 pages
Daniel Voigt Godoy - Deep Learning With PyTorch Step-By-Step A Beginner's Guide-Leanpub - Com (2022)
100% (1)
Daniel Voigt Godoy - Deep Learning With PyTorch Step-By-Step A Beginner's Guide-Leanpub - Com (2022)
1,045 pages
Levenberg Marquardt Algorithm
100% (5)
Levenberg Marquardt Algorithm
5 pages
Unconstrained Numerical Optimization An Introduction For Econometricians
100% (1)
Unconstrained Numerical Optimization An Introduction For Econometricians
32 pages
Trust Region Subproblem
No ratings yet
Trust Region Subproblem
80 pages
Lecture 05 - Unconstrained
No ratings yet
Lecture 05 - Unconstrained
21 pages
Trust Region
No ratings yet
Trust Region
23 pages
Unconstrained and Constrained Optimization Algorithms by Soman K.P
No ratings yet
Unconstrained and Constrained Optimization Algorithms by Soman K.P
166 pages
Lecture
No ratings yet
Lecture
39 pages
Numerische Mathematik: Efficient Algorithms For Solving The - Laplacian in Polynomial Time
No ratings yet
Numerische Mathematik: Efficient Algorithms For Solving The - Laplacian in Polynomial Time
32 pages
2023 5 Data-Driven Parameter Estimation Nonlinear
No ratings yet
2023 5 Data-Driven Parameter Estimation Nonlinear
20 pages
Line Search and Trust-Region
0% (1)
Line Search and Trust-Region
4 pages
Chapter 3 Unconstrained Convex Optimization
No ratings yet
Chapter 3 Unconstrained Convex Optimization
28 pages
Jiyue Zeng Honors Thesis
No ratings yet
Jiyue Zeng Honors Thesis
59 pages
04 Nonlinear Systems and Optimization
No ratings yet
04 Nonlinear Systems and Optimization
74 pages
Advances in Operations Research - 2009 - Yuan - A Trust Region Based BFGS Method With Line Search Technique For Symmetric
No ratings yet
Advances in Operations Research - 2009 - Yuan - A Trust Region Based BFGS Method With Line Search Technique For Symmetric
22 pages
Lecture 9
No ratings yet
Lecture 9
8 pages
A Trust Region Filter Method For General Non-Linear Programming
No ratings yet
A Trust Region Filter Method For General Non-Linear Programming
18 pages
ODS Solution 1 WS1415
No ratings yet
ODS Solution 1 WS1415
13 pages
Chapter 6vh
No ratings yet
Chapter 6vh
12 pages
ConvergeofHybridSpaceMapping Alg
No ratings yet
ConvergeofHybridSpaceMapping Alg
12 pages
Introduction To Optimization - Jean-François Aujol
No ratings yet
Introduction To Optimization - Jean-François Aujol
51 pages
Newton-Raphson en INGLES
No ratings yet
Newton-Raphson en INGLES
24 pages
14 Newton
No ratings yet
14 Newton
24 pages
Dogleg Method With Broyden Class Updating Technique For Solving Trust-Region Sub-Problems of Some Unconstrained Multivariate Nonlinear Optimization Problems
No ratings yet
Dogleg Method With Broyden Class Updating Technique For Solving Trust-Region Sub-Problems of Some Unconstrained Multivariate Nonlinear Optimization Problems
12 pages
Trust Region Newton Method For Large-Scale Logistic Regression
No ratings yet
Trust Region Newton Method For Large-Scale Logistic Regression
25 pages
Global Convergence of Trust-Region Algorithms For Convex Constrained Minimization Without Derivatives
No ratings yet
Global Convergence of Trust-Region Algorithms For Convex Constrained Minimization Without Derivatives
7 pages
Seq Slides
No ratings yet
Seq Slides
43 pages
Optimumengineeringdesign Day3a
No ratings yet
Optimumengineeringdesign Day3a
34 pages
D. C. Sorensen: SIAM Journal On Numerical Analysis, Vol. 19, No. 2. (Apr., 1982), Pp. 409-426
No ratings yet
D. C. Sorensen: SIAM Journal On Numerical Analysis, Vol. 19, No. 2. (Apr., 1982), Pp. 409-426
20 pages
A Model-Trust Region Algorithm Utilizing A Quadratic Interpolant
No ratings yet
A Model-Trust Region Algorithm Utilizing A Quadratic Interpolant
11 pages
Global Convergence of A Trust-Region Sqp-Filter Algorithm For General Nonlinear Programming
No ratings yet
Global Convergence of A Trust-Region Sqp-Filter Algorithm For General Nonlinear Programming
25 pages
Chương 9
No ratings yet
Chương 9
12 pages
Opt Lec 10
No ratings yet
Opt Lec 10
16 pages
CS-6777 Liu Abs
No ratings yet
CS-6777 Liu Abs
103 pages
Unconstrained Optimization - Ipynb - Colaboratory
No ratings yet
Unconstrained Optimization - Ipynb - Colaboratory
5 pages
Structural and Multidisciplinary Optimization
No ratings yet
Structural and Multidisciplinary Optimization
33 pages
A new trust region method for unconstrained optimization: 夡 Zhen-Jun Shi, Jin-Hua Guo
No ratings yet
A new trust region method for unconstrained optimization: 夡 Zhen-Jun Shi, Jin-Hua Guo
12 pages
DONE AEM CH 2 Numerical Methods
No ratings yet
DONE AEM CH 2 Numerical Methods
22 pages
Note Set 7 - Nonlinear Equations: 7.1 - Overview
No ratings yet
Note Set 7 - Nonlinear Equations: 7.1 - Overview
10 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
Nocedal - Wright CH - 02-02
No ratings yet
Nocedal - Wright CH - 02-02
12 pages
Deep Learning Unit 1
No ratings yet
Deep Learning Unit 1
32 pages
Unit VI Optimization Techniques Question Bank Solved Answer
No ratings yet
Unit VI Optimization Techniques Question Bank Solved Answer
20 pages
Levenberg-Marquardt Algorithms Vs Trust Region Algorithms: Frank Vanden Berghen
No ratings yet
Levenberg-Marquardt Algorithms Vs Trust Region Algorithms: Frank Vanden Berghen
4 pages
Irjet V2i411
No ratings yet
Irjet V2i411
6 pages
Trust Region Reflective
No ratings yet
Trust Region Reflective
2 pages
L8 Single Variable Optimization Algorithms
No ratings yet
L8 Single Variable Optimization Algorithms
9 pages
Process Optimization
No ratings yet
Process Optimization
70 pages
Lecture12
No ratings yet
Lecture12
6 pages
Chapter 8 Lecture Notes
No ratings yet
Chapter 8 Lecture Notes
4 pages
Chapter 9 Lecture Notes
No ratings yet
Chapter 9 Lecture Notes
3 pages
"Newton's Method and Loops": University of Karbala College of Engineering Petroleum Eng. Dep
No ratings yet
"Newton's Method and Loops": University of Karbala College of Engineering Petroleum Eng. Dep
11 pages
The Levenberg-Marquardt Algorithm: Ananth Ranganathan 8th June 2004
No ratings yet
The Levenberg-Marquardt Algorithm: Ananth Ranganathan 8th June 2004
5 pages
212C Numerical Methods
77% (30)
212C Numerical Methods
22 pages
Algorithms Process Optimization
No ratings yet
Algorithms Process Optimization
5 pages
Optimization Class Notes MTH-9842
No ratings yet
Optimization Class Notes MTH-9842
25 pages
Optim
No ratings yet
Optim
70 pages
5 2 Ot NM 18122020
No ratings yet
5 2 Ot NM 18122020
15 pages
Root-Finding Methods: Newton-Raphson and Secant Methods: June 1, 2025
No ratings yet
Root-Finding Methods: Newton-Raphson and Secant Methods: June 1, 2025
7 pages
Unit 1
No ratings yet
Unit 1
73 pages
MCS 208 Tte Dec Complete
No ratings yet
MCS 208 Tte Dec Complete
25 pages
Becoming AI Engineer Learning Path
No ratings yet
Becoming AI Engineer Learning Path
4 pages
Geo AI
No ratings yet
Geo AI
50 pages
Big Coding Y11
No ratings yet
Big Coding Y11
250 pages
Co 3 Sem 17330 Data Structure Using Cwinter 2018
No ratings yet
Co 3 Sem 17330 Data Structure Using Cwinter 2018
3 pages
Unit 3 - Machine Learning - WWW - Rgpvnotes.in
No ratings yet
Unit 3 - Machine Learning - WWW - Rgpvnotes.in
18 pages
QPM 7 Question Bank
No ratings yet
QPM 7 Question Bank
8 pages
Ug958 Vivado Sysgen Ref
No ratings yet
Ug958 Vivado Sysgen Ref
460 pages
I Ntegrat
No ratings yet
I Ntegrat
18 pages
Ece5520 CH06
No ratings yet
Ece5520 CH06
30 pages
MFL Characterization
No ratings yet
MFL Characterization
6 pages
Chapter 3
No ratings yet
Chapter 3
81 pages
Statistical Classification
No ratings yet
Statistical Classification
6 pages
Lecture 1 Introduction To Algorithms
No ratings yet
Lecture 1 Introduction To Algorithms
28 pages
Test Information and Coding Theory
No ratings yet
Test Information and Coding Theory
3 pages
Engineering Mathematics E2 Exam For May 2019 - Tagged
No ratings yet
Engineering Mathematics E2 Exam For May 2019 - Tagged
6 pages
Brain Stroke
No ratings yet
Brain Stroke
3 pages
Message Authentication Code
No ratings yet
Message Authentication Code
4 pages
20EC3305 - PTRP - Assignment 2 Questions - 2022-23
No ratings yet
20EC3305 - PTRP - Assignment 2 Questions - 2022-23
2 pages
Adaptive Vector Median Filtering
No ratings yet
Adaptive Vector Median Filtering
11 pages
Build Deep Learning NN Models
No ratings yet
Build Deep Learning NN Models
6 pages
Management Science Lecture Day 10
No ratings yet
Management Science Lecture Day 10
108 pages
Image Fusion Using MatlabIMAGE FUSION USING MATLAB
No ratings yet
Image Fusion Using MatlabIMAGE FUSION USING MATLAB
8 pages
Wavelet Convnets For Texture Classification
No ratings yet
Wavelet Convnets For Texture Classification
9 pages
PR3-12217033-Muhammad Aqsal Ilham
No ratings yet
PR3-12217033-Muhammad Aqsal Ilham
6 pages
Compression For Prefix-Free Codes.: // Make A Lookup Table From Trie
No ratings yet
Compression For Prefix-Free Codes.: // Make A Lookup Table From Trie
2 pages
Advanced college algebra study guide
From Everand
Advanced college algebra study guide
Harrison Cook
No ratings yet
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet

Trust Region Methods

Uploaded by

Trust Region Methods

Uploaded by

University Mohamed El-Bachir El-Ibrahimi of

Bordj Bou Arreridj

Non linear optimization mini-project

Trust region methods for unconstrained

2 The basic trust region algorithm 3

3 Solutions of the subproblem 5

The mathematical formulation of the unconstrained minimization problem is:

where f: Rn → R is a smooth function (a function that has continuous derivatives

When ∇2 f (xk ) is available and positive definite, we take Hk to be the exact

min mk (pk ) s.t ∥pk ∥ ≤ ∆k (1)

To calculate the reduction in f predicted by the model in the iteration k we define

the ratio σk by:

We distinguish here three cases:

1. When σk is close to 1, there is a good agreement between f and the model

2. When σk is positive but significantly smaller than 1 , we do not change the

3. When σk is close to zero or negative, then there is no agreement between the

Algorithm 1 (Trust region)

(i) Given xk , ∆∞ and ∆k ∈ (0, ∆∞ ).

(ii) Solve (1) to obtain pk ;

(iii) Evaluate σk from ??;

(iv) If σk < 1/4 then ∆k+1 = 14 ∆k ;

(v) Else if σk > 3

(vi) Else ∆k+1 = ∆k ;

(vii) If σk > 0 then xk+1 = xk + pk ;

(viii) Else xk+1 = xk

Where ∆∞ is the the overall bound of ∥pk ∥ .

3 Solutions of the subproblem

Theorem 1 (Moré and Sorensen)

We say that p∗ is a global solution of the subproblem (3) if and only if p∗ is

3.1 Trust region Newton’s method:

• Or else we define p(λ) = −(H +λI)−1 ∇f where H +λI is positive semidefinite

Such that : Λ = diag(λ1 , λ2 , . . . , λn ) where λ1 ≤ λ2 ≤ ... ≤ λn , are the eigenvalues

3.1.1 The easy case

Algorithm 2 (Trust region subproblem)

(i) Given λk and ∆ > 0

(iii) Solve LT Lpk = −∇f ; LT qk = pk ;

(iv) Set λk+1 = λk + ( ∥p k ∥ 2 ∥pk ∥−∆

And we choose the value of α to ensure that : ∥p(λ)∥ = ∆.

3.2 The dogleg method

1. ∥pF ∥ = ∥ − H −1 ∇f ∥ ≤ ∆ and in this case we obviously take : p∗ = pF , such

of the dogleg and the trust region boundary.

∥pu + (1 − µ)(pF − pu )∥2 = ∆2

4.1 Sufficient reduction

4.1.1 The Cauchy point

mk (0) − mk (pk ) ≥ c2 [mk (0) − mk (pu )]

4.2 Convergence to stationary points

4.2.1 General case

S(r) = {x | ∥x − y∥ < r for some y ∈ S}

Algorithm 1 converge to stationary points, which means:

lim inf ∥∇f (xk )∥ = 0

If these conditions hold:

1. ∥Hk ∥ ≤ β for some constant β

2. f is bounded below on the level set S and Lipschtz continuously differen-

4. ∥pk ∥ ≤ γ∆k for some constant γ ≥ 1

I refer to the proof of this theorem in [5] p80-82.

4.2.2 Algorithm based on Newton’s method

Suppose that the assumptions of theorem 2 are satisfied and in addition:

1. f is twice continuously differentiable in the level set S (i.e. f ∈ C 2 (S).)

lim ∥∇f (xk )∥ = 0

f (x, y) = (a − x)2 + b(y − x2 )2

f (x, y) = (1 − x)2 + 100(y − x2 )2

∇f (x, y) = (−2(1 − x) − 400x(y − x2 ), 200(y − x2 ))T

2 − 400(y − 3x2 ) −400x

[6] Mostafa Rezapour. Trust-Region Methods for Unconstrained Optimization

You might also like