0% found this document useful (0 votes)

19 views24 pages

04 23ECE216 OptimizationWithHessians

The document discusses optimization methods for functions of two variables, focusing on necessary and sufficient conditions for identifying local maxima and minima using the gradient and Hessian matrix. It explains how to classify stationary points based on the eigenvalues of the Hessian and provides examples to illustrate these concepts. Additionally, it covers the definitions of convexity and concavity in relation to the Hessian's eigenvalues.

Uploaded by

pvsbym

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views24 pages

04 23ECE216 OptimizationWithHessians

Uploaded by

pvsbym

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Om Namo Bhagavate Vasudevaya

23ECE216 Machine
Learning

Optimizing functions of many variables

Dr. Binoy B Nair (Compiled from Optimization

Methods by D Nagesh Kumar, IISc)

1
Necessary conditions for two variable
optimization
𝜕𝑓 𝜕𝑓
➢ = 0; = 0 at the stationary points.
𝜕𝑥1 𝜕𝑥2

➢i.e. the gradient vector of f(X), ∇𝑥 𝑓 at X = X* = [x1 , x2]

defined as follows, must equal zero:

𝜕𝑓
(Χ ∗)
𝜕𝑥1
∇𝑥 𝑓 = =0
𝜕𝑓
(Χ ∗)
𝜕𝑥2

This is the necessary condition.

2
Sufficient conditions
➢ Consider the following second order derivatives:

𝜕2𝑓 𝜕2𝑓 𝜕2𝑓

; ;
𝜕𝑥12 𝜕𝑥22 𝜕𝑥1 𝜕𝑥2

➢ The Hessian matrix defined by H is made using the above second

order derivatives

𝜕2𝑓 𝜕2𝑓
𝜕𝑥12 𝜕𝑥1 𝜕𝑥2
𝐇=
𝜕2𝑓 𝜕2𝑓
𝜕𝑥1 𝜕𝑥2 𝜕𝑥22 [𝑥1 ,𝑥2 ]

3
Sufficient conditions …contd.
➢The value of determinant of the H is calculated and
➢if H is positive definite then the point X = [x1, x2]
is a point of local minima.
➢if H is negative definite then the point X = [x1, x2]
is a point of local maxima.
➢if H is neither then the point X = [x1, x2] is neither
a point of maxima nor minima.

4
Reminder
• A square matrix is called positive definite if it is
symmetric (i.e. AT =A and all its eigenvalues λ are
positive, that is λ > 0.

• A square matrix is called negative definite if it is

symmetric (i.e. AT =A and all its eigenvalues λ are
negative, that is λ < 0.

5
Example
Consider the function 𝑓(𝐗) = 2𝑥13 /3 − 2𝑥1 𝑥2 − 5𝑥1 + 2𝑥22 + 4𝑥2 + 5
Locate the stationary points of f(X) and classify them as relative
maxima, relative minima or neither.

Solution
𝑓(𝐗) = 2𝑥13 /3 − 2𝑥1 𝑥2 − 5𝑥1 + 2𝑥22 + 4𝑥2 + 5

𝜕𝑓 𝑋 ∗
𝜕𝑥1 2𝑥12 − 2𝑥2 − 5 0
∇𝑥 𝑓 = = =
𝜕𝑓 𝑋 ∗ −2𝑥1 + 4𝑥2 + 4 0
𝜕𝑥2

6
Example …contd.
Solution contd..

From 𝜕𝑥
𝜕𝑓
(X) = 0,
1

8𝑥22 + 14𝑥2 + 3 = 0
(2𝑥2 + 3)(4𝑥2 + 1) = 0

𝑥2 = −3/2 or 𝑥2 = −1/4
Substitute these values in the previous equations to get two
corresponding values for x1
So the two stationary points are
X1 = [-1,-3/2] and X2 = [3/2,-1/4]

7
Example …contd.
Solution Contd..
2 f 2 f 2 f 2 f
= 4 x1 ; 2 = 4; = = −2
x12
x2 x1x2 x2x1

The Hessian of f(X) is

 4 x −2 
H= 1 
 −2 4 

 − 4 x1 2
I - H =
2  −4
+4 2
At X1 = [-1,-3/2] ,  I - H = 2  −4
= ( + 4)( − 4) − 4 = 0

Since one eigen value is positive

 −16 − 4 = 0
2
and one negative, X1 is neither
a relative maximum nor a
𝜆1 = + 20, 𝜆2 = − 20
relative minimum
8
Example …contd.
Solution Contd..

At X2 = [3/2,-1/4]
 −6 2
I - H = = ( − 6)( − 4) − 4 = 0
2  −4

1 = 5 + 5 2 = 5 − 5

Since both the eigen values are positive, X2 is a local minimum.

Minimum value of f(x) is -0.375

9
Example
Maximize 𝑓 𝑿 = 20 + 2𝑥1 − 𝑥12 + 6𝑥2 − 3𝑥22 /2

Solution

𝜕𝑓
(Χ ∗)
𝜕𝑥1 2 − 2𝑥1 0
∇𝑥 𝑓 = = =
𝜕𝑓 6 − 3𝑥2 0 X* = [1,2]
(Χ ∗)
𝜕𝑥2

2 f 2 f 2 f  −2 0 
= −2; 2 = −3; =; 0 H= 
x12
x2 x1x2  0 −3 

10
Example …contd.

+2 0
I - H = = ( + 2)( + 3) = 0
0  +3
1 = −2 and 2 = −3

Since both the Eigen values are negative, f(X) is concave

with a global maximum of f(X) = 27

11
Functions of two variables
• A function of two variables, f(X) where X is a vector = [x1,x2], is strictly
convex if

f (t 1 + (1 − t )2 )  tf (1 ) + (1 − t ) f (2 )

• where X1 and X2 are points located by the coordinates given in their

respective vectors.

• Similarly a two variable function is strictly concave if

f (t 1 + (1 − t )2 )  tf (1 ) + (1 − t ) f (2 )

12
Contour plot of a convex function

13
Contour plot of a concave function

14
Sufficient conditions
• To determine convexity or concavity of a function of
multiple variables, the Eigen values of its Hessian matrix is
examined and the following rules apply.
• If all Eigen values of the Hessian are positive the function is
strictly convex.
• If all Eigen values of the Hessian are negative the function is
strictly concave.
• If some Eigen values are positive and some are negative, or if
some are zero, the function is neither strictly concave nor
strictly convex.

15
Example
Locate the stationary points of f(X) and find out if the function is
convex, concave or neither at the points of optima.

f ( X) = 2 x13 / 3 − 2 x1 x2 − 5 x1 + 2 x22 + 4 x2 + 5

Solution:

𝜕𝑓 𝑋 ∗
𝜕𝑥1 2𝑥12 − 2𝑥2 − 5 0
∇𝑥 𝑓 = = =
𝜕𝑓 𝑋 ∗ −2𝑥1 + 4𝑥2 + 4 0
𝜕𝑥2

3 3 1
𝑋1 = [−1, − ] 𝑋2 = [ , − ]
2 2 4
16
The Hessian is calculated as follows:

𝜕2 𝑓 𝑋 𝜕2 𝑓 𝑋 𝜕2 𝑓 𝑋 𝜕2 𝑓 𝑋
= 4𝑥1 , = 4, = = −2
𝜕𝑥12 𝜕𝑥22 𝜕𝑥1 𝑥2 𝜕𝑥2 𝑥1

4𝑥1 −2
𝐻=
−2 4

𝜆 − 4𝑥1 2
𝜆𝐼 − 𝐻 = =0
2 𝜆−4

i.e. at 𝑋1

𝜆+4 2
𝜆𝐼 − 𝐻 = = 𝜆+4 𝜆−4 −2∗2=0
2 𝜆−4
𝜆2 − 16 − 4 = 0
𝜆2 = 20 or𝜆 = + 20, − 20

Since one eigen value is positive and another negative, the point 𝑋1 is a saddle
point.

17
Example (contd..)

i.e. at 𝑋2

𝜆−6 2
𝜆𝐼 − 𝐻 = = 𝜆−6 𝜆−4 −2∗2=0
2 𝜆−4
2
𝜆 − 10𝜆 + 24 − 4 = 0
𝜆2 − 10𝜆 + 20 = 0
𝜆 = 5 + 5, 5 − 5

Since both eigen values are positive the point 𝑋2 is a local

minimum and the function is convex at this point .

18
Necessary condition

• In case of multivariable functions a necessary condition for a

stationary point of the function f(X) is that each partial derivative is
equal to zero.
• In other words, each element of the gradient vector defined below
must be equal to zero. i.e. the gradient vector of f(X), ∇𝑥 𝑓 at X=X*,
defined as follows, must be equal to zero:

𝜕𝑓 ∗
(Χ )
𝜕𝑥1
𝜕𝑓 ∗
(Χ )
∇𝑥 𝑓 = 𝜕𝑥2 =0
⋮
⋮
𝜕𝑓
(Χ ∗ )
𝜕𝑑𝑥𝑛
19
Sufficient condition
➢ For a stationary point X* to be an extreme point, the matrix of
second partial derivatives (Hessian matrix) of f(X) evaluated at X*
must be:
➢ positive definite when X* is a point of relative minimum, and
➢ negative definite when X* is a relative maximum point.

➢ When all eigen values are negative for all possible values of X,
then X* is a global maximum, and when all eigen values are
positive for all possible values of X, then X* is a global minimum.

➢ If some of the eigen values of the Hessian at X* are positive and

some negative, or if some are zero, the stationary point, X*, is
neither a local maximum nor a local minimum.

20
Example
Analyze the function 𝑓(𝑥) = −𝑥12 − 𝑥22 − 𝑥32 + 2𝑥1𝑥2 + 2𝑥1𝑥3 + 4𝑥1 − 5𝑥3 + 2 and
classify the stationary points as maxima, minima and points of
inflection.

Solution

𝜕𝑓 ∗
(Χ )
𝜕𝑥1
−2𝑥1 + 2𝑥2 + 2𝑥3 + 4 0
𝜕𝑓 ∗
∇𝑥 𝑓 = (Χ ) = −2𝑥2 + 2𝑥1 = 0
𝜕𝑥2 −2𝑥3 + 2𝑥1 − 5 0
𝜕𝑓 ∗
(Χ )
𝜕𝑥3

21
Example …contd.

22
Example …contd.
Hessian of f(X) is:

𝜕2 𝑓
𝐻=
𝜕𝑥𝑖 𝜕𝑥𝑗

−2 2 2
𝐻 = 2 −2 0
2 0 −2

𝜆 + 2 −2 −2
𝜆𝐼 − 𝐻 = −2 𝜆 + 2 0 =0
−2 0 𝜆+2

Solving this results in 𝜆 = −2, −2 2, 2 2 (hence a saddle point)

23
Thank you

Adams Moulton
No ratings yet
Adams Moulton
11 pages
Numerical Methods Using Matlab Fourth Edition Solutions
No ratings yet
Numerical Methods Using Matlab Fourth Edition Solutions
2 pages
Practical Issues in Neural Network Training
No ratings yet
Practical Issues in Neural Network Training
15 pages
Non Linear Programming Problems
No ratings yet
Non Linear Programming Problems
66 pages
Lect 3 Concave and Convex
No ratings yet
Lect 3 Concave and Convex
18 pages
Example 4.1
No ratings yet
Example 4.1
12 pages
Polynomials
No ratings yet
Polynomials
19 pages
Connexions Module: m11240
100% (2)
Connexions Module: m11240
4 pages
Keller: a b α 1 α n n i i α i i i
No ratings yet
Keller: a b α 1 α n n i i α i i i
26 pages
Session 2 Constrained and Unconstrained Optimization
No ratings yet
Session 2 Constrained and Unconstrained Optimization
36 pages
Regularization
No ratings yet
Regularization
5 pages
Optimization 2
No ratings yet
Optimization 2
29 pages
The Hessian and Optimization
No ratings yet
The Hessian and Optimization
8 pages
OPTIMIZATION Lecture
No ratings yet
OPTIMIZATION Lecture
88 pages
Lec 17 Multivariable OT
No ratings yet
Lec 17 Multivariable OT
30 pages
Dssm-U5 MHK
No ratings yet
Dssm-U5 MHK
51 pages
Lect 2 Classical
No ratings yet
Lect 2 Classical
29 pages
BCAC403 ClassNote Module-2
No ratings yet
BCAC403 ClassNote Module-2
47 pages
E1 251 Linear and Nonlinear Op2miza2on: Chapter 4: Convex and Quadra2c Func2ons
No ratings yet
E1 251 Linear and Nonlinear Op2miza2on: Chapter 4: Convex and Quadra2c Func2ons
35 pages
Ada Lab Manual
No ratings yet
Ada Lab Manual
64 pages
Optimization Using Calculus: Stationary Points: Functions of Single and Two Variables
No ratings yet
Optimization Using Calculus: Stationary Points: Functions of Single and Two Variables
28 pages
MTH603 Mcqs MidTerm by Vu Topper RM
No ratings yet
MTH603 Mcqs MidTerm by Vu Topper RM
40 pages
Gradient Based Optimization
No ratings yet
Gradient Based Optimization
24 pages
Classical Optimization Technique
No ratings yet
Classical Optimization Technique
19 pages
Optimization Based On Gradient Descent
No ratings yet
Optimization Based On Gradient Descent
24 pages
Question Bank-Mam581-2017 Final
No ratings yet
Question Bank-Mam581-2017 Final
6 pages
09 - nlp1 - Online
No ratings yet
09 - nlp1 - Online
23 pages
Operation Research: X, X, .., X H) F (X, H,, H,, H
No ratings yet
Operation Research: X, X, .., X H) F (X, H,, H,, H
31 pages
Nptel CN Maths
No ratings yet
Nptel CN Maths
32 pages
ECEG-6311 Power System Optimization and AI: Contd.. Yoseph Mekonnen (PH.D.)
No ratings yet
ECEG-6311 Power System Optimization and AI: Contd.. Yoseph Mekonnen (PH.D.)
28 pages
06 23ECE216 GradientDescent v2
No ratings yet
06 23ECE216 GradientDescent v2
73 pages
03 23ECE216 PartialDerivatives
No ratings yet
03 23ECE216 PartialDerivatives
47 pages
Polynomials: X 3 7 y 8 Xy X 3 7 y 8 Xy
No ratings yet
Polynomials: X 3 7 y 8 Xy X 3 7 y 8 Xy
32 pages
ME554 Sheet 3 Final PDF
No ratings yet
ME554 Sheet 3 Final PDF
31 pages
Mclas Tema1 v2
No ratings yet
Mclas Tema1 v2
74 pages
Simplex Method
No ratings yet
Simplex Method
24 pages
05 23ECE216 TaylorSeries
No ratings yet
05 23ECE216 TaylorSeries
24 pages
ECEG-6311 Power System Optimization and AI
No ratings yet
ECEG-6311 Power System Optimization and AI
22 pages
Algorithm Presentation
No ratings yet
Algorithm Presentation
15 pages
Hierarchical Clustering: Ke Chen
No ratings yet
Hierarchical Clustering: Ke Chen
21 pages
09 23ECE216 LogisticRegression
No ratings yet
09 23ECE216 LogisticRegression
40 pages
Optimization Using Calculus: Convexity and Concavity of Functions of One and Two Variables
No ratings yet
Optimization Using Calculus: Convexity and Concavity of Functions of One and Two Variables
22 pages
Unconstrained WS
No ratings yet
Unconstrained WS
12 pages
19MAT209 3 MVUO Geometrical AnalyticalSolns
No ratings yet
19MAT209 3 MVUO Geometrical AnalyticalSolns
16 pages
Session 2 Slides Revised
No ratings yet
Session 2 Slides Revised
43 pages
CSC303 GreedyAlgo Updated
No ratings yet
CSC303 GreedyAlgo Updated
46 pages
Gradients Derivatives
No ratings yet
Gradients Derivatives
23 pages
02 - 23ECE216 - EDA - Pre Processing
No ratings yet
02 - 23ECE216 - EDA - Pre Processing
16 pages
Numerical Computation - 7 - Linear Regression
No ratings yet
Numerical Computation - 7 - Linear Regression
27 pages
Fazal - CFD - Assignment - Example 7.2
No ratings yet
Fazal - CFD - Assignment - Example 7.2
9 pages
10 - 23ECE216 - Descriptive Statistics
No ratings yet
10 - 23ECE216 - Descriptive Statistics
60 pages
Multivariable Calculus - Revision Notes Eco Hons Sem 2 DU
No ratings yet
Multivariable Calculus - Revision Notes Eco Hons Sem 2 DU
21 pages
Classroom - Notes1 2022
No ratings yet
Classroom - Notes1 2022
48 pages
Maxima Minima For Several Variables
No ratings yet
Maxima Minima For Several Variables
20 pages
OPTIMIZATION - Lecture3 - RSB
No ratings yet
OPTIMIZATION - Lecture3 - RSB
31 pages
Lecture 32 34
No ratings yet
Lecture 32 34
71 pages
Lect 4 Unconstraint Optimization
No ratings yet
Lect 4 Unconstraint Optimization
16 pages
Emg 211 Module 1 Fut Minna
No ratings yet
Emg 211 Module 1 Fut Minna
13 pages
Optimization Using Calculus: Optimization of Functions of Multiple Variables: Unconstrained Optimization
No ratings yet
Optimization Using Calculus: Optimization of Functions of Multiple Variables: Unconstrained Optimization
12 pages
Lecture 3-Hessian Matrix and Conditions For Max and Min
No ratings yet
Lecture 3-Hessian Matrix and Conditions For Max and Min
13 pages
Optimization Using Calculus: Optimization of Functions of Multiple Variables: Unconstrained Optimization
No ratings yet
Optimization Using Calculus: Optimization of Functions of Multiple Variables: Unconstrained Optimization
12 pages
Numerical Methods: Module 2 Part 2 by Carlos Hortinela IV
No ratings yet
Numerical Methods: Module 2 Part 2 by Carlos Hortinela IV
11 pages
CS 3303 - Graded Quiz Unit 5 100%
No ratings yet
CS 3303 - Graded Quiz Unit 5 100%
12 pages
2-FENG 346 Unconstrained and Nonlinear Optimization Problems - B
No ratings yet
2-FENG 346 Unconstrained and Nonlinear Optimization Problems - B
37 pages
Session 17 2024
No ratings yet
Session 17 2024
20 pages
False Position Method: Roots of Equation
No ratings yet
False Position Method: Roots of Equation
4 pages
2418 5607 1 PB
No ratings yet
2418 5607 1 PB
13 pages
Optimization Methods2
No ratings yet
Optimization Methods2
10 pages
ASSIGNMENT: Numerical Analysis Submitted To: Miss Sidra Ayub Submitted by
No ratings yet
ASSIGNMENT: Numerical Analysis Submitted To: Miss Sidra Ayub Submitted by
7 pages
Chapter 2 - Unconstrained Optimization
No ratings yet
Chapter 2 - Unconstrained Optimization
20 pages
Cordeau 2002
No ratings yet
Cordeau 2002
11 pages
Extrems Value
No ratings yet
Extrems Value
7 pages
Solutions To Mid-Semester Examination
No ratings yet
Solutions To Mid-Semester Examination
6 pages
M2L2 LN
No ratings yet
M2L2 LN
8 pages
A Little Modification in Least Cost Method For Unbalanced Transportation Problem
No ratings yet
A Little Modification in Least Cost Method For Unbalanced Transportation Problem
5 pages
08 23ECE216 LinearRegressionUsingGradientDescent
No ratings yet
08 23ECE216 LinearRegressionUsingGradientDescent
11 pages
7H Positive Definite
No ratings yet
7H Positive Definite
19 pages
21ED602 Optimization Techniques in Engineering: Class Notes (Internal Circulation)
No ratings yet
21ED602 Optimization Techniques in Engineering: Class Notes (Internal Circulation)
7 pages
Lecture 7
No ratings yet
Lecture 7
12 pages
OR Chapter - 1 Topics
No ratings yet
OR Chapter - 1 Topics
4 pages
EE5239 JKJKJK
No ratings yet
EE5239 JKJKJK
6 pages
ps6 Sol
No ratings yet
ps6 Sol
5 pages
MAE Opti Worksheet 3 Correction
No ratings yet
MAE Opti Worksheet 3 Correction
6 pages
Aamm Assignment 1
No ratings yet
Aamm Assignment 1
4 pages
Drying and Storage of Cereal Grains - 2016 - Bala - Appendix B Gaussian Elimination Method
No ratings yet
Drying and Storage of Cereal Grains - 2016 - Bala - Appendix B Gaussian Elimination Method
3 pages
Multivariabel Tanpa Kendala
No ratings yet
Multivariabel Tanpa Kendala
5 pages
Lecture When Is A Function Convex Hessian Positive Definite
No ratings yet
Lecture When Is A Function Convex Hessian Positive Definite
7 pages
Module - 2 Lecture Notes - 3 Optimization of Functions of Multiple Variables: Unconstrained Optimization
No ratings yet
Module - 2 Lecture Notes - 3 Optimization of Functions of Multiple Variables: Unconstrained Optimization
4 pages
Review Question 2
No ratings yet
Review Question 2
4 pages
RD Sharma Jan2021 Class 9 Maths Chapter 6 Exercise 6.4
No ratings yet
RD Sharma Jan2021 Class 9 Maths Chapter 6 Exercise 6.4
4 pages
Assignment 1 - Linear Programming I - With Answers
No ratings yet
Assignment 1 - Linear Programming I - With Answers
2 pages
MML 4
No ratings yet
MML 4
3 pages
Review Question 3
No ratings yet
Review Question 3
4 pages
309 22-2S CS 136 Proj9 (Searches)
No ratings yet
309 22-2S CS 136 Proj9 (Searches)
2 pages
Chapter 4: Interpolation (7 Hours) (Required Time: 45 Minutes X 14 Periods) Periods Sub-Topic Methodology Code Time (Min)
No ratings yet
Chapter 4: Interpolation (7 Hours) (Required Time: 45 Minutes X 14 Periods) Periods Sub-Topic Methodology Code Time (Min)
2 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
From Everand
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
Andrew Igla
No ratings yet

04 23ECE216 OptimizationWithHessians

Uploaded by

04 23ECE216 OptimizationWithHessians

Uploaded by

Om Namo Bhagavate Vasudevaya

Optimizing functions of many variables

Dr. Binoy B Nair (Compiled from Optimization

➢i.e. the gradient vector of f(X), ∇𝑥 𝑓 at X = X* = [x1 , x2]

This is the necessary condition.

𝜕2𝑓 𝜕2𝑓 𝜕2𝑓

➢ The Hessian matrix defined by H is made using the above second

• A square matrix is called negative definite if it is

The Hessian of f(X) is

Since one eigen value is positive

Since both the eigen values are positive, X2 is a local minimum.

Since both the Eigen values are negative, f(X) is concave

f (t 1 + (1 − t )2 )  tf (1 ) + (1 − t ) f (2 )

• where X1 and X2 are points located by the coordinates given in their

• Similarly a two variable function is strictly concave if

f (t 1 + (1 − t )2 )  tf (1 ) + (1 − t ) f (2 )

Since both eigen values are positive the point 𝑋2 is a local

• In case of multivariable functions a necessary condition for a

➢ If some of the eigen values of the Hessian at X* are positive and

Solving this results in 𝜆 = −2, −2 2, 2 2 (hence a saddle point)

You might also like