0% found this document useful (0 votes)

18 views40 pages

Optimization 2

The document outlines various methods and concepts in unconstrained continuous optimization, including gradient descent, Newton's method, and the Levenberg-Marquardt algorithm. It discusses the importance of convexity in optimization problems and introduces key ideas such as axial iteration and conjugate gradients. Additionally, it emphasizes performance issues related to optimization algorithms, such as iteration count and computational cost.

Uploaded by

Santiago Garrido Bullón

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views40 pages

Optimization 2

Uploaded by

Santiago Garrido Bullón

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

Optimization 2

Lecture outline

Unconstrained continuous optimization:

• Convexity
• Iterative optimization algorithms
• Gradient descent
• Newton’s method
• Gauss-Newton method

New topics:
• Axial iteration
• Levenberg-Marquardt algorithm
• Application
Introduction: Problem specification

Suppose we have a cost function (or objective function)

f ! x " # $%n & $%

Our aim is find the value of the parameters x that minimize this function

x ! ' ()* +,-

x
f ! x "
subject to the following constraints:

• equality c i ! x "#$ %, i $ &, . . . , me

• inequality c i ! x " # $ %, i & me ' (,. . . , m

We will start by focussing on unconstrained problems

Unconstrained optimization
function of one
variable f !x"

+,- f ! x "
x

local global x
minimum minimum

• down-hill search (gradient descent) algorithms can find local minima

• which of the minima is found depends on the starting point
• such minima often occur in real applications
Reminder: convexity
Class of functions

convex Not convex

• Convexity provides a test for a single extremum

• A non-negative sum of convex functions is convex
Class of functions continued

single extremum – convex single extremum – non-convex

Not convex

multiple extrema – non-convex noisy horrible

Optimization algorithm – key ideas

! "#$% δx &'() * ) + * f , x - δx . < f , x .

! /)#& 0 12+%&0* 3 0 +$0#24+#520'6%+*20 x n ! " # 7 0 x n - δx

! 82%'(20)20643912:0 3 0 +0&24#2&03;0< = 0 1#$20&2+4()2&0δx 7 α p

-5
-5 0 5 10 15
Optimization algorithm – Random direction
Choosing the direction 1: axial iteration

Alternate minimization over x and y

-5-5 0 5 10 15
Optimization algorithm
axial directions
Gradient and Partial Derivatives

A function of several variables can be written as f (x1 , x2 ), Gradient and Tangent Plane /
etc. Often times, we abbreviate multiple arguments in a 1st Degree Taylor Expansion
single vector as f (x).

Let a function f : Rn → R. The gradient of f is the

column vector of partial derivatives ∇f (x)
 ∂f (x) 
∂x1

∇f (x) :=  .. 
.

 
∂f (x)
∂xn

Suppose now a function g(x, y) with signature g : Rn × τx1 (y) = f (x) + (y − x)> ∇f (x)
Rm → R. Its derivative with respect to just x is written
as ∇x g(x, y).
Choosing the direction 2: steepest descent

Move in the direction of the gradient ! f "xn#

-5-5 0 5 10 15
Optimization algorithm – Steepest descent
Steepest descent
15

-5
-5 0 5 10 15

$ % & ' # ()+,'-.#,/#'0')12&')'#3')3'-+,456)#. 7 # .&'#47-.75) 6,-'/8

$ 9:.')#'*4&#6,-'#;,-,;,<*.,7-#.&'#-'2#()*+,'-.#,/#*62*1/ orthogonal
. 7 .&' 3)'0,75/ /.'3 +,)'4.,7- =.)5' 7: *-1 6,-' ;,-,;,<*.,7-8>

$ ?7-/'@5'-.61A#.&'#,.')*.'/#.'-+#. 7 # <,(B<*(#+72-#.&'#0*66'1#,-#*#0')1##
,-'C4,'-. ;*--')
Gradient Descent

• Iterative method starting at an initial point x(0)

• Step to the next point x(k+1) in the direction of the
negative gradient

x(k+1) = x(k) − ∇f (x(k) )

120
• Repeat until k∇f (x(k) )k < for a chosen 100
80
f 60
• But: No convergence is guaranteed. 40 10
20
For convergence, an additional line search is required. 0
8
6
1.00
0.75 4
0.50

x1
0.25 2
x2 0.00 0.25
Line Search 0.50
0.75 0

• Take the descent step direction d = −∇f (x) Gradient Descent for

• Select the step length α as minα≥0 f (x + αd)

f (x) = 12 (x1 )2 + 5(x2 )2
• In practice, α is selected with heuristics
A harder case: Rosenbrock’s function

! ! !
f " x , y # $ %&&"y ' x # ( " % ' x #
Rosenbrock function
3

2.5

1.5

0.5

-0.5

-1
-2 -1 0 1 2

" # $ # % & % ' #(') * ' +,, ,-

Steepest descent on Rosenbrock function

Steepest Descent Steepest Descent

2.5
0.85
2

1.5 0.8

1
0.75
0.5

0 0.7

-0.5
0.65
-1 -0.95 -0.9 -0.85 -0.8 -0.75
-2 -1 0 1 2

• The zig-zag behaviour is clear in the zoomed view (100 iterations)

• The algorithm crawls down the valley

Optimization algorithm – Steepest descent 2
Optimization algorithm – Steepest descent for matrices
Conjugate Gradients – sketch only
! " # $ # % " & ' &( c o n j u g a t e g r a d i e n t s )"&&*#* *+))#**,-# '#*)#.% ',/#)0
%,&.* p n *+)" % " 1 % , % ,* 2+1/1.%##' % & /#1)" %"# $ , . , $ + $ ,. 1 3. ,% #
.+$4#/ &( *%#5*6

7 81)"9 p n ,9)"&#.9% & 9 4#9)&.:+21%#9% & 9 1;;95/#-,&+9#1/)"9',/#)%,&.*99

< , % " 9 /#*5#)%9% & 9 %"#9=#**,1.9 H>

p!nHp j ? @, @?< j < n

7 ! " # 9 /#+;%,.29#1/)"9',/#)%,&.*91/#9$+%+1;;C9;,.#1/;C ,.'#5#.'#.%6

7 RemarkablyD p n )1. 4# )"&#. +,.2 &.;C E.&<;#'2# &( p n " # , A f F x n " # G 9 9

1.'9A f F x n G 9 F*##9H+$#/,)1; I#),5#*G

Afn!Afn p
pn ? A f n B n" #
A f n!" # A f n " #
Choosing the direction 3: conjugate gradients

Again, uses first derivatives only, but avoids “undoing” previous

work

$ 9 - # DB+,; '-/,7-6#@ 5 + ) * .,4 #: 7 ) ; # 4*-#E'#; ,- ,; ,<' + #,-#a t m o s t N

47 - F5 ( * .' #+'/4'-. /.'3/8

$ G #+ , H ' ) ' - . # / .* ) .,- ( 3 7 ,- ./ 8

$ I , - , ; 5 ; # ,/#)'*4&'+#,-#'J*4.61#K /.'3/8
The Hessian Matrix

Let f : Rn → R twice differentiable. Its second (partial)

derivatives make up the Hessian Matrix ∇2f (x):
2nd Degree Taylor Expansion
 
∂ 2f (x) ∂ 2f (x)
 ∂x ∂x ···
 1 1 ∂x1 ∂xn 
∇2f (x) := 
 .. .. .. 
 2. . . 

 ∂ f (x) 2
∂ f (x) 
···
∂xn ∂x1 ∂xn ∂xn

• The order of differentiation does not matter if the

function has continuous second (higher-order) τx2 (y) = f (x) +
partial derivatives (Schwarz’s Theorem) (y − x)> ∇f (x) +
• Then the Hessian is symmetric >
1
2
2 (y − x) ∇ f (x) (y − x)
∇2f (x) = [∇2f (x)]>
Choosing the direction 4: Newton’s method
Start from Taylor expansion in 2D
9 # :5-4.,7-#;*1#E'#*33)7J,;*.'+#674*661#E1#,./#%*167)#/'),'/#'J3*-/,7-##
*E75.#*#37,-. x $
∂ !f ∂ !f
∂f ∂f δx " ∂x ! ∂x∂y δx
f = x ! δx > L f = x> ! , ! =δx, δy> ∂ !f ∂ !f
∂ x ∂y δy K δy
∂x ∂y ∂y!

%&'# 'J3-/,7-#. 7 # /'47-+#7)+')#,/##@5+).,4 :5-4.,7-

" !
f = x ! δ x > M a ! g ! δx ! δx H δx
K

D72#;,-,;,<'#.&,/#'J3*-/,7-#70') δxN
" !
;,- f = x ! δ x > M a ! g ! δx ! δx H δx
δx K
<
:#$ f , x - δx . 7 a - g>δx - δx>H δx
δx >
"34 + : #$#: ': ?2 42@'#42 * ) + * A f , x - δx . 7 0B +$% &3

A f , x - δx . 7 g - Hδx 7 0
? # * ) &31'*#3$ δx 7 C H O" g , D + * 1 + 9 δx 7 C H E g .F

15
/)#& 0 G#52&0*)20#*24+*#52 '6%+*2

x n ! " & x n ) H#n"gn

-5
-5 0 5 10 15
x n ! " ! x n " H#n"gn

$ P:#f = x > # ,/#@5+).,4A#.&'-#.&'#/765.,7-#,/#:75-+#,-#7-' /.'38

$ % & ' # ;'.&7+#&/#@5+).,4#47-0')('-4'#=/#,-#.&'#" Q 4*/'>8

$ % & ' # /765.,7-#δx M # OH " n#g n ,/#(5)-.''+#. 7 # E'#*#+72-&,66#+,)'4.,7-##

3)70,+'+#. & * . # H,/#37/,.,0' +'R-,.'

$ S.&')#.&-#F5;3#/.),(&.#. 7 # .&'#3)'+,4.'+#/765.,7-#. # x n O # H"n#gnA##

, . # ,/#E'..')#. 7 # 3'):7);#*#line search

x n % # M x n O αnH "n#gn

$ P:#HM # I .&'-#.&,/#)'+54'/#. 7 # /.''3'/. +'/4'-.8

Newton’s method - example
Newton method with line search
Newton method with line search
3 3

2.5 2.5

2 2

1.5 1.5

1 1

0.5 0.5

0 0

-0.5 -0.5

-1 -1
-2 -1 0 1 2 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2
gradient < 1e-3 after 15 iterations gradient < 1e-3 after 15 iterations

ellipses show successive

quadratic approximations

•The algorithm converges in only 15 iterations – far superior to steepest

descent
•However, the method requires computing the Hessian matrix at each
iteration – this is not always feasible
Optimization algorithm – Newton method
Optimization algorithm – Newton2 method
Performance issues for optimization algorithms

1. Number of iterations required

2. Cost per iteration

3. Memory footprint

4. Region of convergence
Non-linear least squares

M
'
f F xG ? ri
i& #
Gradient
M
A f FxG ? J r iF x G A r iF x G Ari
i
Hessian
M
H? A A ! f F x G ? J A 9 r iF x G A 9!r Fi x G
i
M
? J A r i F x G A !9 r iF x G B 9 ri F x G A A !9 r iF x G
i
<",)"9,*9155/&K,$1%#' 1*
!Uri
M Ari
H() ? J A r i F x G A !9 r i F x G
i

! " , * 9 ,*9%"#9G a u s s - N e w t o n 155/&K,$1%,&.

x n ! " " x n # αnH#n"gn $ % & ' Hn ( x ) " H$% ( x n )

Gauss-Newton method with line search

Gauss-Newton method with line search
3 3

2.5 2.5

2 2

1.5 1.5

1 1

0.5 0.5

0 0

-0.5 -0.5

-1 -1
-2 -1 0 1 2 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2

gradient < 1e-3 after 14 iterations gradient < 1e-3 after 14 iterations

•minimization with the Gauss-Newton approximation with line search

takes only 14 iterations
Comparison
Newton Gauss-Newton
Newton method with line search Gauss-Newton method with line search
3 3
2.5 2.5

2 2

1.5 1.5

1 1

0.5 0.5

0 0

-0.5 -0.5

-1 -1
-2 -1 0 1 2 -2 -1 0 1 2
gradient < 1e-3 after 15 iterations gradient < 1e-3 after 14 iterations

• requires computing Hessian •approximates Hessian by

product of gradient of residuals
• exact solution if quadratic
• requires only derivatives
Summary of minimizations methods

&'()*+ x n ! " , x n ! δx

"- %+.*/0-
H δx , # g

1- $)2334%+.*/0-
HVD#δx , # g

5-6$7)(8+0* (+39+0*-
λ δx , # g
Levenberg-Marquardt algorithm
$ 92*1 :)7; .&' ;,-,;5;A ,- )'(,7-/ 7: -'(*.,0' 45)0*.5)'A .&'
V*5//BD'2.7- *33)7J,;*.,7- ,/ -7. 0')1 (77+8

$ P- /54& )'(,7-/A * /,;36' /.''3'/.B+'/4'-. /.'3 ,/ 3)7E*E61 .&' E'/.

36*-8

$ % & ' W'0'-E')(BI)@5)+. ;'.&7+ ,/ * ;'4&-,/; :7) 0)1,-( E'B

.2''- /.''3'/.B+'/4'-. *-+ V*5//BD'2.7- /.'3/ +'3'-+,-( 7- &72
(77+ .&' H() *33)7J,;*.,7- ,/ 674*6618
1.4

1.2

0.8

0.6

0.4

0.2

0
-1 -0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1

Newton gradient
descent
$ % & ' # ; ' . & 7 + # 5/'/#.& ' #; 7 + ,R ' + X'//,*-

H= x , λ > M H$% ! λ I

$ T & ' - #λ ,/#/;66A# H33)7J,;.'/#.& ' #V5//BD'2.7- X'//,*-8

$ T & ' - #λ ,/#6)('A# H,/#467/'#. 7 # .& ' #,+'-.,.1A#45/,-(#/.''3'/.B+'/4'-.##

/.'3/#. 7 # E' .*Y'-8
LM Algorithm
H= x , λ > M H $ % = x > ! λ I

"8#Z'.#λ M # [.[[" =/*1>

K8 Z760' δx M O H = x , λ > & # g

G8 P: f = x n ! δx > > f = x n > A ,-4)'/' λ = \ " [ /1> *-+ (7 . 7 K8

]8 ^.&')2,/'A#+'4)'/'#λ = \ [ . " # /1>A#6'.# x n ' # ( M # x n ! δx A#*-+#(7#. 7 # K8

N o t e : T h i s a l g o r i t h m d o e s n o t r e q u i r e e x p l i c i t lin e searches.
Example

Levenberg-Marquardt method
3 Levenberg-Marquardt method
3

2.5 2.5

2 2

1.5 1.5

1 1

0.5 0.5

0 0

-0.5 -0.5

-1 -1
-2 -1 0 1 2 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2
gradient < 1e-3 after 31 iterations gradient < 1e-3 after 31 iterations

! "#$#%#&'(#)$*+,#$-*./0/$1/2-3"'24+'25(*6 $ ) *7#$/*,/'289:*(';/,*<=**
#(/2'(#)$,>

Matlab: lsqnonlin
Comparison

Gauss-Newton Levenberg-Marquardt
Levenberg-Marquardt method
Gauss-Newton method with line search
3 3

2.5 2.5

2 2

1.5 1.5

1 1

0.5 0.5

0 0

-0.5 -0.5
-1
-2 -1 0 1 2 -1
gradient < 1e-3 after 14 iterations -2 -1 0 1 2
gradient < 1e-3 after 31 iterations

•more iterations than Gauss-Newton,

but
• no line search required,
• and more frequently converges

General Ledger Conversion Document - Workday Community
No ratings yet
General Ledger Conversion Document - Workday Community
7 pages
Unconstrained Numerical Optimization An Introduction For Econometricians
100% (1)
Unconstrained Numerical Optimization An Introduction For Econometricians
32 pages
Talent 100 HSC Study Guide
100% (5)
Talent 100 HSC Study Guide
39 pages
Biology Lesson 9.1 Worksheet
No ratings yet
Biology Lesson 9.1 Worksheet
3 pages
# Managing Conflict in The Workplace
100% (2)
# Managing Conflict in The Workplace
53 pages
Organisational Behaviour
No ratings yet
Organisational Behaviour
75 pages
Optimumengineeringdesign Day3a
No ratings yet
Optimumengineeringdesign Day3a
34 pages
Newton Gauss Method
No ratings yet
Newton Gauss Method
37 pages
Lecture 7 Newton
No ratings yet
Lecture 7 Newton
44 pages
Multi-Variable Optimization Methods
No ratings yet
Multi-Variable Optimization Methods
21 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
Lec3 Gradient Based Method Part I
No ratings yet
Lec3 Gradient Based Method Part I
30 pages
Project For Automated Train by Roshan
No ratings yet
Project For Automated Train by Roshan
6 pages
Opt Lec 10
No ratings yet
Opt Lec 10
16 pages
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
No ratings yet
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
5 pages
14 Newton
No ratings yet
14 Newton
24 pages
Mathematical Methods of Optimization
No ratings yet
Mathematical Methods of Optimization
62 pages
CS-6777 Liu Abs
No ratings yet
CS-6777 Liu Abs
103 pages
Lecture 05 - Unconstrained
No ratings yet
Lecture 05 - Unconstrained
21 pages
Optimization PPT - Part-2
No ratings yet
Optimization PPT - Part-2
42 pages
5 1 SD 17122020
No ratings yet
5 1 SD 17122020
47 pages
Chương 9
No ratings yet
Chương 9
12 pages
Lecture 7 (With Notes)
No ratings yet
Lecture 7 (With Notes)
39 pages
Lecture 12
No ratings yet
Lecture 12
16 pages
Gradient Based Optimization
No ratings yet
Gradient Based Optimization
24 pages
Optimization Based On Gradient Descent
No ratings yet
Optimization Based On Gradient Descent
24 pages
OPTFIT Aflevering
No ratings yet
OPTFIT Aflevering
9 pages
Algorithms Process Optimization
No ratings yet
Algorithms Process Optimization
5 pages
CH 4
No ratings yet
CH 4
28 pages
Newton-Raphson Optimization: Steve Kroon
No ratings yet
Newton-Raphson Optimization: Steve Kroon
4 pages
Unconstrained and Constrained Optimization Algorithms by Soman K.P
No ratings yet
Unconstrained and Constrained Optimization Algorithms by Soman K.P
166 pages
Process Optimization
No ratings yet
Process Optimization
70 pages
Optim
No ratings yet
Optim
70 pages
19 Newton Method
No ratings yet
19 Newton Method
10 pages
Lecture 9 Si416
No ratings yet
Lecture 9 Si416
14 pages
Lecture 2
No ratings yet
Lecture 2
19 pages
OpTimIzation Overview
No ratings yet
OpTimIzation Overview
47 pages
Optimization Class Notes MTH-9842
No ratings yet
Optimization Class Notes MTH-9842
25 pages
Clnote Oct12
No ratings yet
Clnote Oct12
25 pages
BSC Part 3
No ratings yet
BSC Part 3
29 pages
Unconstrained Minimization
No ratings yet
Unconstrained Minimization
7 pages
Calculus - Class Notes
No ratings yet
Calculus - Class Notes
4 pages
Numerical Optimization For Inverse Problems - 10 Lectures On Inverse Problems and Imaging
No ratings yet
Numerical Optimization For Inverse Problems - 10 Lectures On Inverse Problems and Imaging
15 pages
6 Gradient Method
No ratings yet
6 Gradient Method
19 pages
The Levenberg-Marquardt Algorithm: Ananth Ranganathan 8th June 2004
No ratings yet
The Levenberg-Marquardt Algorithm: Ananth Ranganathan 8th June 2004
5 pages
Gradient Based Optimization
No ratings yet
Gradient Based Optimization
8 pages
Unconstrained Optimization
No ratings yet
Unconstrained Optimization
27 pages
Lecture 5 Si416 2025
No ratings yet
Lecture 5 Si416 2025
21 pages
Optimization
No ratings yet
Optimization
21 pages
Nonlinear Programming (Concepts, Algorithms, and Applications To Chemical Processes) - 3. Newton-Type Methods For Unconstrained Optimization (2010)
No ratings yet
Nonlinear Programming (Concepts, Algorithms, and Applications To Chemical Processes) - 3. Newton-Type Methods For Unconstrained Optimization (2010)
23 pages
Lecture8 UnconstrainedII 2023
No ratings yet
Lecture8 UnconstrainedII 2023
57 pages
Unconstrained Optimization Methods: Amirkabir University of Technology Dr. Madadi
No ratings yet
Unconstrained Optimization Methods: Amirkabir University of Technology Dr. Madadi
13 pages
Numerical Results For Gauss-Seidel Iterative Algor
No ratings yet
Numerical Results For Gauss-Seidel Iterative Algor
11 pages
Doan BFGS
No ratings yet
Doan BFGS
72 pages
EDO - Lecture 5 - 2024
No ratings yet
EDO - Lecture 5 - 2024
47 pages
Clnote Oct8
No ratings yet
Clnote Oct8
39 pages
06 23ECE216 GradientDescent v2
No ratings yet
06 23ECE216 GradientDescent v2
73 pages
2.NCC-SFC-LMT-KKT 2
No ratings yet
2.NCC-SFC-LMT-KKT 2
56 pages
Unconstrained Multivariable Optimization
No ratings yet
Unconstrained Multivariable Optimization
42 pages
Mit18 S096iap23 Lec06
No ratings yet
Mit18 S096iap23 Lec06
9 pages
Numerical Experiments With Variations of The Gauss-Newton Algorithm For Nonlinear Least Squares
No ratings yet
Numerical Experiments With Variations of The Gauss-Newton Algorithm For Nonlinear Least Squares
17 pages
Chapter 6vh
No ratings yet
Chapter 6vh
12 pages
Chapter 9 Lecture Notes
No ratings yet
Chapter 9 Lecture Notes
3 pages
Structural and Multidisciplinary Optimization
No ratings yet
Structural and Multidisciplinary Optimization
33 pages
Limits and Continuity (Calculus) Engineering Entrance Exams Question Bank
From Everand
Limits and Continuity (Calculus) Engineering Entrance Exams Question Bank
Mohmmad Khaja Shareef
No ratings yet
Fuzzy 2
No ratings yet
Fuzzy 2
22 pages
OptimumEngineeringDesign Day2b
No ratings yet
OptimumEngineeringDesign Day2b
24 pages
Control Optimo
No ratings yet
Control Optimo
132 pages
Optimization 1
No ratings yet
Optimization 1
32 pages
OptimumEngineeringDesign Day2a
No ratings yet
OptimumEngineeringDesign Day2a
33 pages
OptimumEngineeringDesign Day 1
No ratings yet
OptimumEngineeringDesign Day 1
40 pages
Optimumengineeringdesign Day6
No ratings yet
Optimumengineeringdesign Day6
66 pages
Feedback Linearization
No ratings yet
Feedback Linearization
42 pages
Lyapunov-Based Methods in Control: Dr. Alexander Schaum
No ratings yet
Lyapunov-Based Methods in Control: Dr. Alexander Schaum
22 pages
Exercise Dynamic Programming
No ratings yet
Exercise Dynamic Programming
1 page
Fuzzy 3 Fuzzy Inference Process
No ratings yet
Fuzzy 3 Fuzzy Inference Process
17 pages
Fuzzy 1
No ratings yet
Fuzzy 1
11 pages
Single-Link Flexible Joint Manipulator
No ratings yet
Single-Link Flexible Joint Manipulator
37 pages
The Maximum Principle and Hamilton Jacobi Theory
No ratings yet
The Maximum Principle and Hamilton Jacobi Theory
40 pages
Numerical Optimal Control: July 2011
No ratings yet
Numerical Optimal Control: July 2011
123 pages
Introduction To Mobile Robotics - Burgard PDF
No ratings yet
Introduction To Mobile Robotics - Burgard PDF
745 pages
System Identification
100% (3)
System Identification
23 pages
U X F DT DX: Nonlinear Control
No ratings yet
U X F DT DX: Nonlinear Control
11 pages
6nlc Relay
No ratings yet
6nlc Relay
4 pages
Phase Plane Analysis: Glad & Ljung
No ratings yet
Phase Plane Analysis: Glad & Ljung
8 pages
Chapter 2 - Burl Optimal Quadratic Control: Islamic University of Gaza
No ratings yet
Chapter 2 - Burl Optimal Quadratic Control: Islamic University of Gaza
44 pages
Nonlinear Control
No ratings yet
Nonlinear Control
28 pages
A Simple Explanation of Why Lagrange Multipliers Works: Andrew Chamberlain, PH.D
No ratings yet
A Simple Explanation of Why Lagrange Multipliers Works: Andrew Chamberlain, PH.D
13 pages
Nonlinear Systems
No ratings yet
Nonlinear Systems
30 pages
Optimal Quadratic Control L2
No ratings yet
Optimal Quadratic Control L2
35 pages
Lyapunov Stability Theory:: Problem of Motion Stability, Includes Two Methods For Stability Analysis (The So
No ratings yet
Lyapunov Stability Theory:: Problem of Motion Stability, Includes Two Methods For Stability Analysis (The So
25 pages
1 X X X X : Examples: Example 1: Consider The System
No ratings yet
1 X X X X : Examples: Example 1: Consider The System
16 pages
Chapter 3
No ratings yet
Chapter 3
39 pages
7phase Portraits Chaos FD
No ratings yet
7phase Portraits Chaos FD
30 pages
Lab 2 Tutorial PDF
No ratings yet
Lab 2 Tutorial PDF
16 pages
Design, Characterization and Use of Custom Standard Cells
No ratings yet
Design, Characterization and Use of Custom Standard Cells
18 pages
Diff. Lit. Elements
No ratings yet
Diff. Lit. Elements
11 pages
Screenshot 2024-04-24 at 11.02.13 AM
No ratings yet
Screenshot 2024-04-24 at 11.02.13 AM
26 pages
Introduction To Programming With RAPTOR
No ratings yet
Introduction To Programming With RAPTOR
12 pages
Automotive Cloud
No ratings yet
Automotive Cloud
370 pages
The Neural Substrates of Religious Experience: John Rabin, M.D
No ratings yet
The Neural Substrates of Religious Experience: John Rabin, M.D
13 pages
Parts
No ratings yet
Parts
4 pages
A Set of Measures of Centrality Based On Betweenness
No ratings yet
A Set of Measures of Centrality Based On Betweenness
8 pages
Blending in Perfectly - Jackson Tegu - 2020sep
0% (1)
Blending in Perfectly - Jackson Tegu - 2020sep
8 pages
Organisational Change Od Assignment
No ratings yet
Organisational Change Od Assignment
12 pages
415 V Bus Charging
No ratings yet
415 V Bus Charging
3 pages
Fundamentals of Information Technology
No ratings yet
Fundamentals of Information Technology
2 pages
Ta 193115 001
No ratings yet
Ta 193115 001
1 page
Multi-Level Mock Reading
No ratings yet
Multi-Level Mock Reading
11 pages
Fable The Bird and The Whale
No ratings yet
Fable The Bird and The Whale
3 pages
Datasheet: Model 230 Brushless Slip Ring
No ratings yet
Datasheet: Model 230 Brushless Slip Ring
7 pages
City Sci Yb2019
No ratings yet
City Sci Yb2019
19 pages
Flight Ticket - Vadodara To New Delhi: Fare Rules & Baggage
No ratings yet
Flight Ticket - Vadodara To New Delhi: Fare Rules & Baggage
2 pages
2020 GKS-U Application Guidelines (Regional University Track)
No ratings yet
2020 GKS-U Application Guidelines (Regional University Track)
28 pages
Risk Management and Laboratory Safety
No ratings yet
Risk Management and Laboratory Safety
23 pages
Power Windows Description and Operation
No ratings yet
Power Windows Description and Operation
4 pages
Talent Is Overrated Book Summary
90% (10)
Talent Is Overrated Book Summary
2 pages
Westwood Homework
100% (1)
Westwood Homework
7 pages
Multi-Stage Payment Methods
No ratings yet
Multi-Stage Payment Methods
11 pages

Optimization 2

Uploaded by

Optimization 2

Uploaded by

Optimization 2

Unconstrained continuous optimization:

Suppose we have a cost function (or objective function)

f ! x " # $%n & $%

x ! ' ()* +,-

• equality c i ! x "#$ %, i $ &, . . . , me

We will start by focussing on unconstrained problems

• down-hill search (gradient descent) algorithms can find local minima

convex Not convex

• Convexity provides a test for a single extremum

single extremum – convex single extremum – non-convex

multiple extrema – non-convex noisy horrible

! "#$% δx &'() * ) + * f , x - δx . < f , x .

! /)#& 0 12+%&0* 3 0 +$0#*24+*#520'6%+*20 x n ! " # 7 0 x n - δx

! 82%'(20*)20643912:0* 3 0 +0&24#2&03;0< = 0 1#$20&2+4()2&0δx 7 α p

Alternate minimization over x and y

Let a function f : Rn → R. The gradient of f is the

Move in the direction of the gradient ! f "xn#

$ % & ' # ()*+,'-.#,/#'0')12&')'#3')3'-+,456*)#. 7 # .&'#47-.75) 6,-'/8

• Iterative method starting at an initial point x(0)

x(k+1) = x(k) − ∇f (x(k) )

• Select the step length α as minα≥0 f (x + αd)

" # $ # % & % ' #(') * ' +,, ,-

Steepest Descent Steepest Descent

• The zig-zag behaviour is clear in the zoomed view (100 iterations)

• The algorithm crawls down the valley

7 81)"9 p n ,*9)"&*#.9% & 9 4#9)&.:+21%#9% & 9 1;;95/#-,&+*9*#1/)"9',/#)%,&.*99

p!nHp j ? @, @?< j < n

7 ! " # 9 /#*+;%,.29*#1/)"9',/#)%,&.*91/#9$+%+1;;C9;,.#1/;C ,.'#5#.'#.%6

7 RemarkablyD p n )1. 4# )"&*#. +*,.2 &.;C E.&<;#'2# &( p n " # , A f F x n " # G 9 9

Again, uses first derivatives only, but avoids “undoing” previous

$ 9 - # DB+,; '-/,7-*6#@ 5 * + ) * .,4 #: 7 ) ; # 4*-#E'#; ,- ,; ,<' + #,-#a t m o s t N

$ G #+ , H ' ) ' - . # / .* ) .,- ( 3 7 ,- ./ 8

Let f : Rn → R twice differentiable. Its second (partial)

• The order of differentiation does not matter if the

%&'# 'J3*-/,7-#. 7 # /'47-+#7)+')#,/#*#@5*+)*.,4 :5-4.,7-

x n ! " & x n ) H#n"gn

$ P:#f = x > # ,/#@5*+)*.,4A#.&'-#.&'#/765.,7-#,/#:75-+#,-#7-' /.'38

$ % & ' # ;'.&7+#&*/#@5*+)*.,4#47-0')('-4'#=*/#,-#.&'#" Q 4*/'>8

$ % & ' # /765.,7-#δx M # OH " n#g n ,/#(5*)*-.''+#. 7 # E'#*#+72-&,66#+,)'4.,7-##

$ S*.&')#.&*-#F5;3#/.)*,(&.#. 7 # .&'#3)'+,4.'+#/765.,7-#*. # x n O # H"n#gnA##

$ P:#HM # I .&'-#.&,/#)'+54'/#. 7 # /.''3'/. +'/4'-.8

ellipses show successive

•The algorithm converges in only 15 iterations – far superior to steepest

1. Number of iterations required

2. Cost per iteration

! " , * 9 ,*9%"#9G a u s s - N e w t o n 155/&K,$1%,&.

Gauss-Newton method with line search

•minimization with the Gauss-Newton approximation with line search

• requires computing Hessian •approximates Hessian by

$ P- /54& )'(,7-/A * /,;36' /.''3'/.B+'/4'-. /.'3 ,/ 3)7E*E61 .&' E'/.

$ % & ' W'0'-E')(BI*)@5*)+. ;'.&7+ ,/ * ;'4&*-,/; :7) 0*)1,-( E'B

$ T & ' - #λ ,/#/;*66A# H*33)7J,;*.'/#.& ' #V*5//BD'2.7- X'//,*-8

$ T & ' - #λ ,/#6*)('A# H,/#467/'#. 7 # .& ' #,+'-.,.1A#4*5/,-(#/.''3'/.B+'/4'-.##

"8#Z'.#λ M # [.[[" =/*1>

K8 Z760' δx M O H = x , λ > & # g

G8 P: f = x n ! δx > > f = x n > A ,-4)'*/' λ = \ " [ /*1> *-+ (7 . 7 K8

]8 ^.&')2,/'A#+'4)'*/'#λ = \ [ . " # /*1>A#6'.# x n ' # ( M # x n ! δx A#*-+#(7#. 7 # K8

•more iterations than Gauss-Newton,

You might also like

! /)#& 0 12+%&0* 3 0 +$0#24+#520'6%+*20 x n ! " # 7 0 x n - δx

! 82%'(20)20643912:0 3 0 +0&24#2&03;0< = 0 1#$20&2+4()2&0δx 7 α p

$ % & ' # ()+,'-.#,/#'0')12&')'#3')3'-+,456)#. 7 # .&'#47-.75) 6,-'/8

7 81)"9 p n ,9)"&#.9% & 9 4#9)&.:+21%#9% & 9 1;;95/#-,&+9#1/)"9',/#)%,&.*99

7 ! " # 9 /#+;%,.29#1/)"9',/#)%,&.*91/#9$+%+1;;C9;,.#1/;C ,.'#5#.'#.%6

7 RemarkablyD p n )1. 4# )"&#. +,.2 &.;C E.&<;#'2# &( p n " # , A f F x n " # G 9 9

$ 9 - # DB+,; '-/,7-6#@ 5 + ) * .,4 #: 7 ) ; # 4*-#E'#; ,- ,; ,<' + #,-#a t m o s t N

%&'# 'J3-/,7-#. 7 # /'47-+#7)+')#,/##@5+).,4 :5-4.,7-

$ P:#f = x > # ,/#@5+).,4A#.&'-#.&'#/765.,7-#,/#:75-+#,-#7-' /.'38

$ % & ' # ;'.&7+#&/#@5+).,4#47-0')('-4'#=/#,-#.&'#" Q 4*/'>8

$ % & ' # /765.,7-#δx M # OH " n#g n ,/#(5)-.''+#. 7 # E'#*#+72-&,66#+,)'4.,7-##

$ S.&')#.&-#F5;3#/.),(&.#. 7 # .&'#3)'+,4.'+#/765.,7-#. # x n O # H"n#gnA##

$ % & ' W'0'-E')(BI)@5)+. ;'.&7+ ,/ * ;'4&-,/; :7) 0)1,-( E'B

$ T & ' - #λ ,/#/;66A# H33)7J,;.'/#.& ' #V5//BD'2.7- X'//,*-8

$ T & ' - #λ ,/#6)('A# H,/#467/'#. 7 # .& ' #,+'-.,.1A#45/,-(#/.''3'/.B+'/4'-.##

G8 P: f = x n ! δx > > f = x n > A ,-4)'/' λ = \ " [ /1> *-+ (7 . 7 K8

]8 ^.&')2,/'A#+'4)'/'#λ = \ [ . " # /1>A#6'.# x n ' # ( M # x n ! δx A#*-+#(7#. 7 # K8