Open navigation menu

Scribd

0% found this document useful (0 votes)

37 views16 pages

Lecture 12

Uploaded by

Ashik Ahmed IUT

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views16 pages

Lecture 12

Uploaded by

Ashik Ahmed IUT

Copyright

© © All Rights Reserved

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

ECE 5314: Power System Operation & Control

Lecture 12: Gradient and Newton Methods

Vassilis Kekatos

R3 S. Boyd and L. Vandenberghe, Convex Optimization, Chapter 9.

R2 A. Gomez-Exposito, A. J. Conejo, C. Canizares, Electric Energy Systems: Analysis and

Operation, Appendix B.

R1 A. J. Wood, B. F. Wollenberg, and G. B. Sheble, Power Generation, Operation, and Control,

Wiley, 2014, Chapter 13.

Lecture 10 V. Kekatos 1
Unconstrained minimization

Assume f convex, twice continuously differentiable, and finite p∗

p∗ := min f (x)
x

unconstrained minimization methods

• produce sequence of points xt with f (xt ) → p∗

• interpreted as iterative methods for solving optimality condition

∇f (x∗ ) = 0

• if ∇2 f (x) mI with m > 0 (strong convexity), then

1
0 ≤ f (x) − p∗ ≤ k∇f (x)k22
2m

useful as stopping criterion (assuming m is known)

Lecture 10 V. Kekatos 2
Examples

Example 1: unconstrained QP (P = P> 0):

min x> Px + 2q> x + r

x

Example 2: analytic center of linear inequalities

m
X
min − log(bi − a>
i x)
x
i=1

Example 3: interior-point methods tackle constrained problems by solving a

sequence of unconstrained minimization problems

Lecture 10 V. Kekatos 3
Descent method

1. Compute search direction ∆xt

2. Choose step size µt > 0

3. Update xt+1 = xt + µt ∆xt

4. Iterate (t → t + 1) until stopping criterion is satisfied

Definition: An iterative method is a descent method if f (xt+1 ) < f (xt ) ∀t

Recall for convex f , we have f (xt+1 ) ≥ f (xt ) + (∇f (x))> (xt+1 − xt ). Then:

f (xt+1 ) < f (xt ) ⇒ descent direction satisfies (∇f (xt ))> ∆xt < 0

Step size µt > 0: constant, exact line search, or backtracking search

exact line search : µt := arg min f (xt + µ∆xt )

µ>0

Lecture 10 V. Kekatos 4
Gradient descent

1. Compute search direction ∆xt = −∇f (xt )

(special case of descent method)

2. Choose a step size µt > 0

3. Update xt+1 = xt + µt ∆xt

4. Iterate until stopping criterion is satisfied

• converges with exact or backtracking line search and upper bounded µ

• convergence rate results: c ∈ (0, 1) depends on m, x0 , and line search

linear for strongly convex f : f (xt ) − p∗ ≤ ct (f (x0 ) − p∗ )

sublinear for general convex: f (xt ) − p∗ ≤ L
t
(f (x0 ) − p∗ )

• very simple but typically slow

Lecture 10 V. Kekatos 5
Example

min x21 + M x22

x

where M > 0

• exact line search

• initialize at x0 = (M, 1)

Figure: [Tom Luo’s slides]

• iterates take the form

t t !
t M −1 M −1
x = M , −
M +1 M +1

• fast convergence when M close to 1; one step if M = 1!

• slow, zig-zagging if M 1 or M 1

Lecture 10 V. Kekatos 6
Example 2

For m = 100 and n = 50, use gradient method (exact line search)
m
X
min c> x − log(a>
i x − bi )
x
i=1

Figure: Function value convergence for gradient method [Z.-Q. Luo’s slides]

Lecture 10 V. Kekatos 7
Steepest descent direction

Term ∇f (x)> z gives approximate decrease in f for small z

f (x + z) ≈ f (x) + ∇f (x)> z

Find the direction of steepest descent (SD):

zsd = arg min ∇f (x)> z

kzk≤1

Euclidean norm kzk2 : zsd = −∇f (x)/k∇f (x)k2 (gradient descent)

√
Quadratic norm kzkP := z> Pz for some P 0
−1/2
zsd = − ∇f (x)> P−1 ∇f (x) P−1 ∇f (x)

Equivalent to SD with Euclidean norm on transformed variables y = P1/2 x

Lecture 10 V. Kekatos 8
Geometric interpretation

move as far as possible in direction −∇f (x), while staying inside the unit ball

Figure: Boyd’s slides

Lecture 10 V. Kekatos 9
Choosing the norm

Figure: choice of P strongly affects speed of convergence [Boyd’s slides]

• steepest descent with backtracking line search for two quadratic norms

• ellipses show {x : kx − xt kP = 1}

Lecture 10 V. Kekatos 10
Pure Newton step and interpretations

Newton update: x+ = x + v

Newton step: v = −∇2 f (x)−1 ∇f (x)

minimizes second-order expansion of f at x

f (x)+∇f (x)> (x+ −x)+ 21 (x+ −x)> ∇2 f (x)(x+ −x)

solves linearized optimality condition

∇f (x) + ∇2 f (x)(x+ − x) = 0

Figure: [Boyd’s slides]

Lecture 10 V. Kekatos 11
Global behavior of Newton iterations

Example: f (x) = log(ex + e−x ), starting at x0 = −1.1

Figure: pure Newton iterations may diverge! [Z.Q. Luo’s slides]

Lecture 10 V. Kekatos 12
Newton method

Also called damped or guarded Newton method

−1
1. Compute Newton direction ∆xt = − ∇2 f (xt ) ∇f (xt )

2. Choose step size µt

3. Update xt+1 = xt + µt ∆xt

4. Iterate until stopping criterion is satisfied

• global convergence with backtracking or exact line search

• quadratic local convergence

• affine invariance:

Newton iterates for minx f (x) and minz f (Tz) for invertible T
are equivalent and xt = Tzt

Lecture 10 V. Kekatos 13
Convergence results

assumptions: mI ∇2 f (x) M I and Lipschitz condition

k∇2 f (x) − ∇2 f (y)k ≤ Lkx − yk

1. damped Newton phase: k∇f (x)k2 ≥ η1 : f (x+ ) ≤ f (x) − η2 , hence

#iterations ≤ η2−1 (f (x0 ) − f ∗ )

2. quadratically convergent phase: k∇f (x)k2 < η1

#iterations ≤ log2 log2 (η3 /)

total # iterations for reaching accuracy f (xt ) − f ∗ ≤ bounded by:

η2−1 (f (x0 ) − f ∗ ) + log2 log2 (η3 /)

η1 , η2 , η3 depend on m, M, L (waived for self-concordant functions)

Lecture 10 V. Kekatos 14
Example
10,000 100,000
X X
f (x) = − log(1 − x2n ) − log(bi − a>
i x)
n=1 i=1

Figure: Two-phase convergence of Newton method [Boyd’s slides]

• x ∈ R10,000 with sparse ai ’s

Lecture 10 V. Kekatos 15
Minimization with linear equality constraints

Linearly-constrained optimization problem:

min{f (x) : Ax = b}
x

Approach 1: solve reduced or eliminated problem

min f (Fz + x0 )
z

where Ax0 = b and range(F) = null(A)

Approach 2: Find feasible update that minimizes second-order approximation

∆x := arg min f (x) + ∇f (x)> v + 12 v> ∇2 f (x)v

v

s.to A(x + v) = b

[Q: How can this be solved?]

Lecture 10 V. Kekatos 16

You might also like

Optimization PPT - Part-2
No ratings yet
Optimization PPT - Part-2
42 pages
Lyapunov Stability Analysis of Certain Third Order
No ratings yet
Lyapunov Stability Analysis of Certain Third Order
8 pages
Chapter 3 Unconstrained Convex Optimization
No ratings yet
Chapter 3 Unconstrained Convex Optimization
28 pages
Unconstrained Numerical Optimization An Introduction For Econometricians
100% (1)
Unconstrained Numerical Optimization An Introduction For Econometricians
32 pages
Lecture 05 - Unconstrained
No ratings yet
Lecture 05 - Unconstrained
21 pages
04 Nonlinear Systems and Optimization
No ratings yet
04 Nonlinear Systems and Optimization
74 pages
Week02 Convex Optimization
No ratings yet
Week02 Convex Optimization
48 pages
Clnote Oct8
No ratings yet
Clnote Oct8
39 pages
Lecture 7 Newton
No ratings yet
Lecture 7 Newton
44 pages
Clnote Oct12
No ratings yet
Clnote Oct12
25 pages
Optimization Class Notes MTH-9842
No ratings yet
Optimization Class Notes MTH-9842
25 pages
10 Unconstrained
No ratings yet
10 Unconstrained
41 pages
CS-6777 Liu Abs
No ratings yet
CS-6777 Liu Abs
103 pages
Optimization 2
No ratings yet
Optimization 2
40 pages
Lecture 7 (With Notes)
No ratings yet
Lecture 7 (With Notes)
39 pages
Numerical Optimization For Inverse Problems - 10 Lectures On Inverse Problems and Imaging
No ratings yet
Numerical Optimization For Inverse Problems - 10 Lectures On Inverse Problems and Imaging
15 pages
Lyapunov Journal
No ratings yet
Lyapunov Journal
18 pages
Numerical Results For Gauss-Seidel Iterative Algor
No ratings yet
Numerical Results For Gauss-Seidel Iterative Algor
11 pages
Using Gradient Directions To Get Global Convergence of Neewton-Type Metods
No ratings yet
Using Gradient Directions To Get Global Convergence of Neewton-Type Metods
22 pages
Lecture 5 Si416 2025
No ratings yet
Lecture 5 Si416 2025
21 pages
14 Newton
No ratings yet
14 Newton
24 pages
Optim
No ratings yet
Optim
70 pages
Unconstrained
No ratings yet
Unconstrained
30 pages
Lecture 14
No ratings yet
Lecture 14
9 pages
Opt Lec 10
No ratings yet
Opt Lec 10
16 pages
O4MD 03 Descent Methods
No ratings yet
O4MD 03 Descent Methods
18 pages
Lecture 9 Si416
No ratings yet
Lecture 9 Si416
14 pages
Lecture 7 8 Other Descent Methods
No ratings yet
Lecture 7 8 Other Descent Methods
7 pages
Lecture12
No ratings yet
Lecture12
6 pages
Optimization in Neural Network
No ratings yet
Optimization in Neural Network
22 pages
Hauser Lecture2
No ratings yet
Hauser Lecture2
26 pages
NLP Slides
No ratings yet
NLP Slides
201 pages
Chương 9
No ratings yet
Chương 9
12 pages
Gradient Descent PDF
No ratings yet
Gradient Descent PDF
9 pages
Optimumengineeringdesign Day3a
No ratings yet
Optimumengineeringdesign Day3a
34 pages
E1 251 Linear and Nonlinear Op2miza2on
No ratings yet
E1 251 Linear and Nonlinear Op2miza2on
24 pages
Newton's Method For Unconstrained Optimization
No ratings yet
Newton's Method For Unconstrained Optimization
14 pages
Advanced Gradient Descent
No ratings yet
Advanced Gradient Descent
14 pages
AGRA Intro
No ratings yet
AGRA Intro
6 pages
Chapter 8 Lecture Notes
No ratings yet
Chapter 8 Lecture Notes
4 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
Newton Scribed
No ratings yet
Newton Scribed
7 pages
6 Gradient Method
No ratings yet
6 Gradient Method
19 pages
BSC Part 3
No ratings yet
BSC Part 3
29 pages
EEE 4731 Mid - 2020
No ratings yet
EEE 4731 Mid - 2020
2 pages
Lecture 2
No ratings yet
Lecture 2
19 pages
19 Newton Method
No ratings yet
19 Newton Method
10 pages
1.nonlinear Robust Control For Single-Machine Infinite-Bus Power Systems With Input Saturation
No ratings yet
1.nonlinear Robust Control For Single-Machine Infinite-Bus Power Systems With Input Saturation
7 pages
Gradient Descent: Ryan Tibshirani Convex Optimization 10-725
No ratings yet
Gradient Descent: Ryan Tibshirani Convex Optimization 10-725
27 pages
Chapter 9 Lecture Notes
No ratings yet
Chapter 9 Lecture Notes
3 pages
Process Optimization
No ratings yet
Process Optimization
70 pages
OpTimIzation Overview
No ratings yet
OpTimIzation Overview
47 pages
Multi-Variable Optimization Methods
No ratings yet
Multi-Variable Optimization Methods
21 pages
(K) K (k+1) (K) K (K)
No ratings yet
(K) K (k+1) (K) K (K)
6 pages
Download
No ratings yet
Download
7 pages
Unconstrained Optimization Methods: Amirkabir University of Technology Dr. Madadi
No ratings yet
Unconstrained Optimization Methods: Amirkabir University of Technology Dr. Madadi
13 pages
Algorithm of Code Minimization
No ratings yet
Algorithm of Code Minimization
3 pages
1406812656-Automation Eng 2
No ratings yet
1406812656-Automation Eng 2
1 page
Project For Automated Train by Roshan
No ratings yet
Project For Automated Train by Roshan
6 pages
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
No ratings yet
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
5 pages
Algorithms Process Optimization
No ratings yet
Algorithms Process Optimization
5 pages
Steepest Descent
No ratings yet
Steepest Descent
7 pages