0% found this document useful (0 votes)

19 views20 pages

2 - Non Linear Optimization - V3

The document outlines the principles of Nonlinear Optimization, detailing the general form of nonlinear problems, types of optimization problems, and methods for solving them. It includes definitions of local and global maximizers, level sets, and the characteristics of convex and concave functions. Additionally, it discusses unconstrained and constrained optimization, along with specific methods like the Newton method and gradient search for finding optimal solutions.

Uploaded by

backup91189

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views20 pages

2 - Non Linear Optimization - V3

Uploaded by

backup91189

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

ENGG515 Dr.

Samih Abdul-Nabi

Non Linear Optimization

Table of contents
1 General form of a nonlinear problem .................................................................................... 3
1.1 Neighbors ......................................................................................................................... 3
1.2 Local maximizer ............................................................................................................... 3
1.3 Global maximizer ............................................................................................................. 4
1.4 Level sets.......................................................................................................................... 4
1.5 Interior and boundary points ........................................................................................... 5
1.6 Search direction ............................................................................................................... 6
1.7 Types of a functions ......................................................................................................... 7
1.7.1 Convex sets ............................................................................................................... 7
1.7.2 Concave and Convex Function .................................................................................. 7
2 Types of Nonlinear Programming Problems ........................................................................... 8
2.1 Unconstrained Optimization ............................................................................................ 8
2.2 One variable unconstrained optimization ....................................................................... 9
2.2.1 Newton method ....................................................................................................... 9
2.3 Multivariable unconstrained optimization .................................................................... 10
2.3.1 Gradient search ...................................................................................................... 11
2.3.1.1 The gradient search algorithm ........................................................................ 11
2.3.1.2 Example ........................................................................................................... 11
2.3.2 Newton ................................................................................................................... 13
2.3.2.1 Example ........................................................................................................... 13
2.4 Constrained optimization .............................................................................................. 14
2.4.1 Lagrangian function and Lagrange multipliers ....................................................... 14
2.4.1.1 Example ........................................................................................................... 15
2.4.2 Duality..................................................................................................................... 15
2.4.2.1 Weak duality.................................................................................................... 17
2.4.2.2 Strong duality .................................................................................................. 17
3 Exercises ............................................................................................................................... 18

Page 1 of 20
ENGG515 Dr. Samih Abdul-Nabi

Page 2 of 20
ENGG515 Dr. Samih Abdul-Nabi

1 General form of a nonlinear problem

P = Max f ( x)
Subject to x Î W
With x an n-vector of independent variables: x = [ x1 , x2 ,..., xn ]T ÎÂn these are the decision
variables of the problem. The set W is the set of feasible solutions. A feasible solution is a
solution that satisfies all the constraints of the problem. So the problem P can be written as:

P = Max f ( x)
ì g ( x) £ bi for i = 1, 2,...m
Sobject to í i
îx ³ 0

The function f : Â n ® Â that we wish to maximize is a real-valued function called the objective
function or cost function.

P is a decision problem where we are asked to find the best vector x among all possibilities in W.
Also note that we might be asked to minimize a problem. In fact minimizing f is the same as
maximizing –f.

NOTES:
• If a variable xi is restricted to be negative. A change of variable xi =-x’i is needed with
x’i≥0.
• If a variable xi is not restricted (can be positive or negative) A change of variable xi= x’i- x’j
is needed with x’i≥0 and x’j≥0.
• If a constraint is restricted to be negative, it can be multiplied by -1.

1.1 Neighbors
The neighbors of x, identified as N(x) is the set of variables defined as:
N ( x) = {y ÎW, x - y < e }with Ꜫ > 0 a very small number and x - y the Euclidian distance between

x and y normally defined as x - y = ( x0 - y0 )2 + ( x1 - y1 )2 + ... + ( xn - yn )2 . Note that N(x) is a

neighborhood of x defined by the value of Ꜫ.
1.2 Local maximizer
Definition: x*is a local maximizer if f(x*) ≥f(x) for any xÎN(x*)
Meaning that the value of f(x*) is the highest among the neighbors of x*.
If we have f(x*) >f(x) for any xÎN(x*) the x* is a strict local maximizer.

Page 3 of 20
ENGG515 Dr. Samih Abdul-Nabi
1.3 Global maximizer
Definition: x*is a global maximizer if f(x*) ≥f(x) for any xÎW
Meaning that the value of f(x*) is the highest among all x in the feasible domain.
If we have f(x*) >f(x) for any xÎW, then x* is a strict global maximum.

Figure 1-1 shows some local and global maxima. X1 is a local maximum (not strict) while X2 is a
strict local maximum and X3 is a strict global maximum.

Figure 1-1: local and global maxima

The major difficulty of nonlinear optimization is that algorithms are unable to easily
differentiate between local and global maximizers.
1.4 Level sets
The level set of a function f : Â n ® Â at level c is the set of points: S = {x ÎW, f ( x) = c}.
Figure 1-2 shows a nonlinear optimization problem. The feasible domain defined by the linear
constraints is shown in gray.
To draw the domain:
• Consider the first constraint
- Draw the equation of the curve defined as equality (x1=4)
This line divides the plan into two parts. The set of points satisfying x1≤4 is one of
these parts.
- Test on any point, say (0, 0) to see which part is the feasible part. In our case, we
have 0≤4 therefor the part containing the point (0, 0) belongs to the feasible
domain.
• We repeat the same exercise for the four constraints (including constraints on the sign
of the decision variables).

Page 4 of 20
ENGG515 Dr. Samih Abdul-Nabi
Figure 1-2 shows also some level sets of Z. Note in this case that the highest value of Z is
reached at point (3, 3) which is the global maximizer of Z.

Figure 1-2: level sets

3x1+2x2≤18 à we draw the line 3x1+2x2 = 18 go through (6, 0) (0, 9)

1.5 Interior and boundary points
A point xÎW is said to be interior point if there exists a neighborhood of x with all points on
N(x) are in W. Figure 1-3 shows x as interior point.

Figure 1-3: interior and boundary points

A point xÎW is said to be a boundary point if every neighborhood of x has a point in W and a
point outside W. The set of all boundary points of W is called the boundary of W.
Note that a feasible domain might not have a boundary. Therefore, every point of the set is an
interior point. The set ]-1, 1[ has no boundary.

Page 5 of 20
ENGG515 Dr. Samih Abdul-Nabi
1.6 Search direction
Definition: A vector dÎRn with d¹0 (at least one component is not 0) is a feasible direction at
xÎW if there exists α0>0 such that x+αd Î W for all α Î [0, α0].
Figure 1-4 shows d1 as feasible direction while d2 is not a feasible direction from x. Note that x is
on the frontier on the feasible domain.

Figure 1-4: feasible direction

The directional derivative of the function f in the direction d, is a real valued function denoted
¶f f ( x + a d ) - f(x) ¶f
by: ( x) = lim = Ñf ( x), d = d T Ñf ( x) . If ||d|| =1 (d is a unit vector) then is
¶d a
®0
a ¶d
the rate of increase of the function f at x in the direction of d.

T
Example: consider f : Â ® Â as f ( x) = x1x2 x3 and let d = éê , , ùú then x=(1,1,2) the directional
3 1 1 1
ë2 2 2 û
derivative of the function f in the direction d is:
é 1/ 2 ù
¶f ê ú x x + x x + 2 x1 x2
( x) = Ñf ( x) d = [ x2 x3 , x1 x3 , x1 x2 ] ê 1/ 2 ú = 2 3 1 3
T
¶d ê1/ 2 ú 2
ë û
Note that because d is a unit vector the value is the rate of increase of the function f at x in the
direction of d.

The directional derivative gives an indication about what will happen to the function if we move
in the direction of d starting from x. If the directional derivative is >0 then the value of the
function f will increase if we move in that direction. The maximum increase is found in the
direction of the gradient. Maximum increase with the directional derivative = Ñf ( x)T Ñf ( x)

Page 6 of 20
ENGG515 Dr. Samih Abdul-Nabi

Figure 1-5: Directional derivative

Figure 1-5 shows the directional derivative at two border points. First being at x1, if we move in
the direction of d1 the value of the function f will decrease sine d1T Ñf ( x1 ) < 0 (the angle is larger
than 90). While from x2 moving in the direction of the gradient will increase the value of the
function f.
1.7 Types of a functions
1.7.1 Convex sets
A set WÌRn is called convex if any two points x’ and x’’ÎW, the line segment joining x’ and x’’
completely belongs to W.
In other words: for each t Î[0, 1] the point x = (1 - t ) x'+ tx'' is also in W for every t Î[0, 1].

Figure 1-6: Concave and convex sets

1.7.2 Concave and Convex Function

A function f is concave if the line segment joining any two points on the graph (sketch of the
function) is never above the graph.
Similarly, a function f is convex if the line segment is never below the graph.

Definition: A function f: WÌRn à R defined on a convex set W is concave if given any two points
x’ and x’’ÎW we have: (1-t)*f(x’)+t*f(x’’) ≤ f((1-t)*x’+t*x’’) for every t Î[0, 1].

Page 7 of 20
ENGG515 Dr. Samih Abdul-Nabi

Definition: A function f: WÌRn à R defined on a convex set W is convex if given any two points
x’ and x’’ÎW we have: (1-t)*f(x’)+t*f(x’’) ≥ f((1-t)*x’+t*x’’) for every t Î[0, 1].

Figure 1-7 shown a convex and a concave functions. A concave function is said to be curved
down, while a convex function is said to be curved up.

Figure 1-7: Convex and concave functions

The summation of concave functions is also a concave function. Similarly, the summation of
convex functions is also a convex function.

Note: knowing the type of a function (if it is concave) when we maximize a function of a single
variable without any constraint guarantees that a local maximizer is also a global maximizer.
This guarantee can be given when:
¶2 f
£0
¶x 2
Similarly, a local minimizer of a convex function of a single variable without any constraint is
also a global minimizer if:
¶2 f
³0
¶x 2

2 Types of Nonlinear Programming Problems

With nonlinear programming, no single algorithm can solve all these different types of
problems. Instead, algorithms have been developed for various individual classes (special types)
of nonlinear programming problems.
2.1 Unconstrained Optimization
Unconstrained optimization problem have no constraints. So the problem is simply:
P = Max f ( x)
In such case we can say that W = Ân .

A necessary condition that a solution x is optimal when f(x) is a differentiable equation is:

Page 8 of 20
ENGG515 Dr. Samih Abdul-Nabi
¶f
= 0 for j = 1, 2,..., n
¶x j
Note: when f(x) is a concave function the condition is also sufficient.
2.2 One variable unconstrained optimization
Conditions:
• n=1 (one variable)
• the function is concave
Therefor the necessary and sufficient condition for a particular solution x = x* to be
optimal (a global maximum) is:
¶f
=0
¶x
NOTE: the function might not be that easy to derive. For that different methods exist to find a
local maximizer of such function.
NOTE: it is true that we cannot derive (very hard to derive) but we can get the value of the
derivate for a specific variable.

2.2.1 Newton method

The basic idea behind Newton’s method is to approximate f(x) within the neighborhood of the
current trial solution by a quadratic function and then to maximize (or minimize) the
approximate function exactly to obtain the new trial solution to start the next iteration.

Figure 2-1 how Newton method works graphically.

f(x)

Page 9 of 20
X2 X3 x
X0 X1
Figure 2-1: Newton Method for one variable unconstrained function
ENGG515 Dr. Samih Abdul-Nabi

Starting from X0 we approximate f(x) as a quadratic function and we find its maximum (x1). We
then approximate f(x) at x1 to find x2 then x3. We the difference between xk-1 and xk reaches a
threshold, we stop.
The approximation of the function is obtained using Tayler series as follows:
f '' ( xi )
p ( x ) = f ( xi ) + f ' ( xi )( x - xi ) + ( x - xi )2
2
Note that p(x) is similar to f(x) in the neighborhood of xi.
If we derive and set to 0 to find the maximum we got:

f '' ( xi )
p ' ( x ) = f ' ( xi ) + 2* ( x - xi )
2
f ' ( xi ) + f '' ( xi )( x * - xi ) = 0
f ' ( xi )
x* = xi - then we take xi +1 = x *
f '' ( xi )

f ' ( xi )
Newton, moves from xi in the direction of ( - )
f '' ( xi )
Direction: x=xi + Alpha d

The method:
Step 0. find an initial solution x0 set i=0
Step 1. Compute f’ and f’’ at xi
f ' ( xi )
Step 2. Set xi +1 = xi -
f '' ( xi )
Step 3. if xi +1 - xi < e then stop, else i=i+1 and goto step 1

2.3 Multivariable unconstrained optimization

Now we consider f(X) with X=(x1, x2, … xn) and still no constrain on the feasible domain. We still
consider that partial derivative cannot be solved easily. Two methods are presented here:
- The gradient search procedure
- Newton’s method

Page 10 of 20
ENGG515 Dr. Samih Abdul-Nabi
2.3.1 Gradient search
In this context, the objective function is assumed to be differentiable, thus has a gradient Ñf ( x)
æ ¶f ¶f ¶f ö
with Ñf ( x) = ç , ,..., ÷ . As we said before the direction of the gradient is the one that
¶
è 1x ¶x2 ¶xn ø
increase the most the objective function.
Starting with an initial point X0, we move in the direction of the gradient. So let x1 be the next
point to reach from X0 in the direction on the gradient. X1 = X0 + αÑf(X0).
Note that replacing X by X1 in the objective function (that we need to maximize) gives a
function f(α) with only α as variable. In this case we are back to a one variable unconstraint
function that we know how to maximize.

The question is what is the best value of α that gives the best new X (X1 that maximizes f(X1)
starting from X0. This can be found by maximizing f(α).

2.3.1.1 The gradient search algorithm

The procedure is as follows:
Step 0- Find an initial point X0. Set i=0.
Step 1- Express F(α) by replacing X with X(i+1) = Xi+αÑf(Xi).
Step 2- Use a one variable unconstraint search method to maximize F(α) (α>0)
Step 3- If stopping criteria not met set i=i+1 and go to Step 1.
As stopping criterion, we consider the absolute value of each partial derivative of F at xi to be
<= a very small value.

2.3.1.2 Example
Consider the following two-variable problem:
f ( x) = 2x1x2 + 2x2 - x12 - 2x22
The gradient is Ñf ( x) = ( 2 x2 - 2 x1 , 2 x1 + 2 - 4 x2 )T
To apply the gradient search algorithm, let us start with (0, 0) and we have Ñf (0, 0) = ( 0, 2 )T
Iteration 1: the new point is ( 0 + t (0),0 + t(2) ) = (0, 2t )T
X1 = X0 + t*Ñf(X0) = (0,0)T + t*(0,2)T = (0+t*0, 0+t*2)T = (0,2t)T
f (0,2t) = 4t - 8t 2 this function has its maximum for t=1/4 (by setting the derivative to 0)
Thus the new point is (0, ½)
æ 1ö
Ñf ç 0, ÷ = (1, 0 )
T

è 2ø

Iteration 2: the new point is æç 0 + t (1), + t(0) ö÷ = æç t, ö÷

1 1
è 2 ø è 2ø

Page 11 of 20
ENGG515 Dr. Samih Abdul-Nabi
æ 1ö 1
f ç t, ÷ = t - t 2 + this function has its maximum for t=1/2 (by setting the derivative to 0)
è 2ø 2
Thus the new point is (½, ½)
æ1 1ö
Ñf ç , ÷ = ( 0,1)
T

è2 2ø
And so on until the stopping criterion is met, considering for example 0.1 as maximum value of
each partial derivative. Figure 2-2 shows how the gradient method moves from one solution to
another until it converge towards the optimal solution.

Figure 2-2: Gradient method path

The optimal solution reached is a global maximum since the function is concave. If this was not
the case, the solution will simply be a local maximum.

Figure 2-3 shows another illustration on how the gradient method works. The method starts at
X(0) and moves in the direction of the gradient increasing the objective value to the maximum.
This increase ends when the gradient is tangent to a level set at X(1). Starting from X(1) we
move again in the direction of the gradient (which is perpendicular to the tangent to the level
set) and so on.

Page 12 of 20
ENGG515 Dr. Samih Abdul-Nabi

Figure 2-3: another illustration on how the gradient method moves

2.3.2 Newton
We saw in 2.2.1 how to use Newton to solve one variable unconstrained function. Same
concept and same approximation are used in case of multi-variable unconstrained optimization.
f ' ( xi )
However since we are in higher dimension then xi +1 = xi -
f '' ( xi )

( ) ( )
-1
x(
k +1)
= x( ) - F x( ) Ñf x( )
k k +1 k

F is the Hessian of the function f built from the gradient as follow:

2.3.2.1 Example
Consider the function f ( x1, x2 , x3 , x4 ) = -( x1 +10x2 )2 - 5( x3 - x4 )2 - ( x2 - 2x3 )4 -10( x1 - x4 )4
In order to apply the Newton method, we select x(0) = [3, -1,0,1]T as starting point. This gives
( )
f x(0) = -215 . Before starting with the method let us compute the gradient and the Hessian.

é -2( x1 + 10 x2 ) - 40( x1 - x4 )3 ù
ê ú
ê -20( x1 + 10 x2 ) - 4( x2 - 2 x3 )3 ú
Ñf ( x ) = ê
3 ú
ê -10( x3 - x4 ) + 8( x2 - 2 x3 ) ú
ê 3 ú
ë -10( x3 - x4 ) + 40( x1 - x4 ) û

Page 13 of 20
ENGG515 Dr. Samih Abdul-Nabi

é -2 - 120( x1 - x4 ) 2 -20 0 120( x1 - x4 ) 2 ù

ê 2 2
ú
ê -20 -200 - 12( x2 - 2 x3 ) 24( x2 - 2 x3 ) 0 ú
F ( x) = ê ú
ê 0 24( x2 - 2 x3 ) 2 -10 - 48( x2 - 2 x3 ) 2 10 ú
ê 2 2ú
ë 120( x1 - x4 ) 0 10 -10 - 120( x1 - x4 ) û

Iteration 1:
( )
Ñf x(0) = [-306 144 2 310]
T

é -482 -20 0 480 ù

ê -20 -212 24 0 úú
( )
F x(0) =ê
ê 0 24 -58 10 ú
ê ú
ë 480 0 10 -490û

After computing the inverse of the Hessian and multiplying we got:

x(1) = [1.5873 -0.1587 0.254 0.254] and f x(1) = -31.8
T
( )
Iteration 2:

( ) ( )
-1
x( ) = x( ) - F x( ) Ñf x( )
2 1 1 1

2.4 Constrained optimization

The general form of a constrained optimization problem is as follows:
Maximize f ( X )
Subject to :
gi ( X ) £ bi for i = 1, 2,..., m
Starting from a problem with n variables and m constraints to build a problem with (n+m)
variables and uinconstrained
The question with nonlinear optimization is how to recognize the optimal solution? Remember
that a solution might be a global maximum or a local maximum. There are necessary and
sufficient conditions that help identifying such solution. These conditions are called Karush-
Kuhn-Tucker (KKT conditions). KKT will not be explained.
2.4.1 Lagrangian function and Lagrange multipliers
As a result of KKT conditions we present in this section the Lagrange method to solve nonlinear
constrained problems of the form:
Maximize f ( X )
Subject to :
gi ( X ) £ bi for i = 1, 2,..., m
We have here a problem with n variables as X is in Rn and m constraints. Using Lagrange, we
convert this problem to a new Lagrangian problem with n+m variables and no constraints.

Page 14 of 20
ENGG515 Dr. Samih Abdul-Nabi
m
The method starts by formulating the Lagrangian function F (X, l ) = f (X) - å li [ gi (X) - bi ] where
i =1

the new variables l = ( l1 , l2 ,..., lm ) are called Lagrange multipliers.

Each constraint is multiplied by a Lagrangian multiplier and added to the objective function. The
solution to the function F, say (X*, l*) is a local or global maximum or minimum for the
unconstrained function F and X* is a critical point for the original problem. As a result, the
method now reduces to analyzing F(X,l) by any procedure to solve unconstrained optimization.
Thus, the n+m partial derivatives would be set equal to zero.

2.4.1.1 Example
Consider the problem
Maximize f ( x1 , x2 ) = x12 + 2 x2
Subject to :
g ( x1 , x2 ) = x12 + x22 = 1
The corresponding Lagrangian function is F( x1 , x2 , l ) = x12 + 2 x2 - l ( x12 + x22 - 1) and the partial
derivatives are:
¶F
= 2 x1 - 2l x1 = 0 ® x1 (1 - l ) = 0 ® x1 = 0 or l =1
¶x1
¶F
= 2 - 2l x2 = 0
¶x2
¶F
¶l
( )
= - x12 + x22 - 1 = 0

If l=1 then from the remaining two partial derivatives we can have x2=1 and x1=0
Si x1=0 then from the third partial derivative x2=E1. Therefore, the two critical points are (0,1)
and (0,-1).

2.4.2 Duality
We consider now the following problem
Minimize f ( x)
ìïhi ( x) = 0 i = 1,..., m
Subject to í
ïî g j ( x) £ 0 j = 1,...r
We denote by f* the optimal value of the function f and x* the value of the variables leading to
the optimal value.
The Lagrangian of this problem can be written as follows:
m r
L(x,l ,µ )=f ( x) + å l h ( x) + å µ g ( x)
i =1
i i
j =1
j j

We then define the function q: Rm x RràR as

Page 15 of 20
ENGG515 Dr. Samih Abdul-Nabi
æ m r ö
q(l ,µ ) = infn L(x,l ,µ )= infn ç f ( x) +
xÎÂ xÎÂ ç
å li hi ( x) + å µ j g j ( x) ÷
÷
è i =1 j =1 ø
The function q is called the duality function. The Lagrangian multipliers l and µ are called also
duality variables.

Example: Consider the following problem with linear constraints

Min xT x
Subject to Ax = b
With x Î  n , A an (m, n) matrix and b Î  m
To get the dual function
• Lagrangian is L( x, v) = xT x + vT ( Ax - b)
1
• To minimize L over x, set gradient equal to zero: Ñ x L( x, v) = 2 x + AT v = 0 ® x = - AT v
2
æ 1 ö 1
Set x in L to obtain q(v) = L ç - AT v, v ÷ = - vT A AT v - bT v
è 2 ø 4
Therefore,

For the primal problem

Minimize f ( x)
ìïhi ( x) = 0 i = 1,..., m
Subject to í
ïî g j ( x) £ 0 j = 1,...r
The Lagrange dual problem is
Maximize q(l , µ )
Subject to µ ³ 0
With q a concave Lagrange dual function and l and µ are the Lagrange multipliers associated to
the constraints h and g respectively. Note that the proof that q is concave is not in the scope of
this course. Figure 2-4 shows a sketch of the function f and the function q. for x the value of f
and q are shown on the figure.

Page 16 of 20
ENGG515 Dr. Samih Abdul-Nabi

Figure 2-4: Duality and duality gap

2.4.2.1 Weak duality

Let d* the optimal value of the Lagrange dual problem. Each q(l, μ) is a lower bound for f* and
by definition d* is the best lower bound that is obtained. The following weak duality inequality
therefore always holds: q* ≤ f*
This inequality holds when q* or f* are infinite. The difference q* − f* is called the optimal
duality gap of the original problem.

2.4.2.2 Strong duality

We say that strong duality holds if the optimal duality gap is zero, i.e.: q* = f*

Page 17 of 20
ENGG515 Dr. Samih Abdul-Nabi

3 Exercises
1- Consider the function f ( X ) = -x12 - x2 + x3 defined on W = ìíY Î  3 / y2 =
y1 6y ü
, y3 = 1 ý
î 2 5 þ
a. Say if these points are feasible solution to the problem.
i. X T = ( 0.25 0.5 0.3)
ii. X T = ( 0.5 0.25 0.6)
iii. X T = ( 0.35 0.175 0.45)
b. Find a local maximizer.
c. Say if it is global.
Key Solution
a. No, Yes, No
b. x1 = 0.35 x2 = 0.175 x3 = 0.42
c. The second derivative is always < 0.

2- Consider the function Z = f (x, y) = - ( x - 2 )2 - ( y - 2)2

a. On the xy axes plan, sketch the level curves (level sets) for Z=-1 and Z=-2
b. On the same axes, draw the gradient vector at the point (4, 1)
c. Find the global maximizer of the function f
d. Verify your result using the Hessian test

Solution
The figure shows the level sets. The gradient is as
æ ¶z ö
ç ¶x ÷ æ -2 x + 4 ö æ -2 0 ö
follows ÑZ = ç ÷ = ç ÷ H =ç ÷< 0
ç ¶z ÷ è -2 y + 4 ø è 0 -2 ø
ç ¶y ÷
è ø
Gradient at (4, 1) is (-4, 2). Note this gives us the direction.

æa bö
NOTE: A 2x2 ç ÷ symmetric matrix is
èc dø
1- Positive definite if and only if a > 0 and det(A) > 0
2- negative definite if and only if a < 0 and det(A) > 0
3- indefinite if and only if det(A) < 0

Page 18 of 20
ENGG515 Dr. Samih Abdul-Nabi
3- Consider the function Max f (X) = ln (1 + x1 - x2 )2 - x12 - x22
a. Determine Ñf the gradient of f and Ñ2f the Hessian of f
b. Starting from X(0) =(-5, -5) determine the best search direction
c. Using Newton, what search direction is used?
d. Perform one iteration of Newton method to find X(1). How much increase in the
objective function?
e. Write the expression of X(i) using gradient method.
f. Starting with X(0) =(-5,-5), What problem should be solved to find the best value
of X(1)?
g. Solve the problem to find X(1) and the increase in the objective function.

Solution
a.
é 2 ù é -2 2 ù
ê 1 + x - x - 2 x1 ú ê (1 + x - x )2 - 2 2 ú
(1 + x1 - x2 ) ú
Ñf ( x) = ê 1 2 ú Ñ 2 f ( x1 , x2 ) = ê 1 2
ê 2 ú ê 2 -2 ú
ê - 1 + x - x - 2 x2 ú ê 2
- 2 ú
ë 1 2 û ë (1 + x1 - x2 ) (1 + x1 - x2 )2 û

b. The best search direction from (-5, -5) is in the direction of Ñ. So in the direction
(12, 8)
c. For Newton X (i +1) = X (i ) - H -1 ( X (i ) ) * Ñf ( X (i ) ) = X (i ) + Newton search direction
So the search direction for Newton is (5.33, 4.66)

Recall that the inverse of a 2x2 matrix is given by

d. Therefor X1=(-5,-5) +(5.33,4.66) = (0.33, -0.34) à F(x0) = -50 F(X1) = 0.8421

e. X (i +1) = X (i ) + a * Ñf ( X (i) )
æ -5 ö æ12 ö æ -5 + 12a ö
f. ( )
X (1) = X (0) + a * Ñf X (0) = ç ÷ + a ç ÷ = ç
è -5 ø è 8 ø è -5 + 8a ø
÷ Replacing in F we got

Max f (a ) = ln (1 - 5 + 12a - (-5 + 8a ) ) - ( -5 + 12a ) - ( -5 + 8a )

2 2 2

Max f (a ) = ln(1 + 4a ) 2 - 208a 2 + 200a - 50

We take f’=0 to get α=0.2123 Therefor X(1)=(0.9611, -1.0259)

F(X1) = 0.2124

Page 19 of 20
ENGG515 Dr. Samih Abdul-Nabi
4- Use Lagrangian multipliers to show that the problem
f = 81x2 + y 2 Subject to the constraint 4x2 + y 2 = 9 x, y Î
Has 4 extreme points that need to be identified.

a. Write the relaxed problem (Lagrangian function associated to this problem)

b. Write the equations giving
ÑL( x, y, l ) = 0

c. Find the solution to the system in b.

Hint: do not simplify by y neither by x. Just factorize.
Solution
2 2 2 2
a. L( x, y, l ) = 81x + y + l (4 x + y - 9)
ì162 x + 8l x = 0
ï
í 2 y + 2l y = 0
ï 2
4x + y2 = 9
b. î
c.
x(162+8l)=0 so Either x=0 or l=-20.25
2y(1+l)=0 so either y=0 or l=-1
y=0 à from last constraint x=+ 3/2 or – 3/2
x=0 à from last constraint y=-3 or y = 3
f(−3/2,0)=729/4 f(3/2,0)=729/4f(0,−3)=9 f(0,3)=9

Page 20 of 20

Non Linear Programming
No ratings yet
Non Linear Programming
109 pages
Introduction To Linear Algebra (5th)
No ratings yet
Introduction To Linear Algebra (5th)
585 pages
Sathyabama Becse-Dsregulation
No ratings yet
Sathyabama Becse-Dsregulation
120 pages
Optimization (SF1811 SF1831 SF1841)
100% (1)
Optimization (SF1811 SF1831 SF1841)
198 pages
CFD DR Haghshenasfard
No ratings yet
CFD DR Haghshenasfard
123 pages
Tan, Emil Angelo O. Barcelona, John Rey Tortugo, Jonathan Olayvar, Clark
No ratings yet
Tan, Emil Angelo O. Barcelona, John Rey Tortugo, Jonathan Olayvar, Clark
3 pages
Sensitivity Analysis Sheet: Exercise 1
No ratings yet
Sensitivity Analysis Sheet: Exercise 1
3 pages
CAAM 454 554 1lvazxx
No ratings yet
CAAM 454 554 1lvazxx
422 pages
Chapter 4: Unconstrained Optimization
No ratings yet
Chapter 4: Unconstrained Optimization
25 pages
Eigenvalues, Eigenvectors and Quadratic Forms
No ratings yet
Eigenvalues, Eigenvectors and Quadratic Forms
65 pages
Optimization
No ratings yet
Optimization
16 pages
Opte - Optimization
No ratings yet
Opte - Optimization
125 pages
Optimization Techniques - OT
No ratings yet
Optimization Techniques - OT
96 pages
Matinf 2360 Part 3
No ratings yet
Matinf 2360 Part 3
106 pages
A Study in Industrial Robot Programming
100% (1)
A Study in Industrial Robot Programming
26 pages
Unconstrained Optimization Methods
No ratings yet
Unconstrained Optimization Methods
87 pages
Splitting Methods For Partial Differential Equations With Rough Solutions Holden H Instant Download
No ratings yet
Splitting Methods For Partial Differential Equations With Rough Solutions Holden H Instant Download
90 pages
CH 12. Linear Programming Pyq
No ratings yet
CH 12. Linear Programming Pyq
3 pages
Chapter 4. Optimization
No ratings yet
Chapter 4. Optimization
62 pages
Ot U1
No ratings yet
Ot U1
55 pages
Optimization Techniques
No ratings yet
Optimization Techniques
96 pages
Introductory Methods of Numerical Analysis 5th Edition S.S. Sastry All Chapters Instant Download
No ratings yet
Introductory Methods of Numerical Analysis 5th Edition S.S. Sastry All Chapters Instant Download
45 pages
Notes HQ
No ratings yet
Notes HQ
96 pages
Introduction To Optimization: (I) Objective Function, Maxima, Minima and Saddle Points, Convexity and Concavity
No ratings yet
Introduction To Optimization: (I) Objective Function, Maxima, Minima and Saddle Points, Convexity and Concavity
52 pages
Lecture1 introductionPCA
No ratings yet
Lecture1 introductionPCA
75 pages
Bms Basic NLP 120609
No ratings yet
Bms Basic NLP 120609
103 pages
Cours D'optimisation
No ratings yet
Cours D'optimisation
159 pages
5 Optimization Techniques
No ratings yet
5 Optimization Techniques
40 pages
Solution To Exercise R-1.7, Page 47: Sept 5, 2001
No ratings yet
Solution To Exercise R-1.7, Page 47: Sept 5, 2001
2 pages
Lect5 Removed
No ratings yet
Lect5 Removed
35 pages
2-Polynomials MCQ
No ratings yet
2-Polynomials MCQ
16 pages
Properties of Positive Definite Matrices
No ratings yet
Properties of Positive Definite Matrices
55 pages
Lecture 9
No ratings yet
Lecture 9
15 pages
IE141 - IX.b. Assignment Model
No ratings yet
IE141 - IX.b. Assignment Model
40 pages
Part 3 Nonlinear Op Tim Ization
No ratings yet
Part 3 Nonlinear Op Tim Ization
69 pages
Chapter8-Unconstrained Optimization
No ratings yet
Chapter8-Unconstrained Optimization
14 pages
chp#06
No ratings yet
chp#06
12 pages
Optimisation
No ratings yet
Optimisation
38 pages
Unit-1 - Ot
No ratings yet
Unit-1 - Ot
23 pages
W1M3-LA Review Matrices
No ratings yet
W1M3-LA Review Matrices
20 pages
Introduction To Optimization - (Part 1)
No ratings yet
Introduction To Optimization - (Part 1)
71 pages
Chapter 2
No ratings yet
Chapter 2
16 pages
MF 30 Unit 6 Review
No ratings yet
MF 30 Unit 6 Review
23 pages
Module 1-2
No ratings yet
Module 1-2
37 pages
Optimisation and Optimal Control
No ratings yet
Optimisation and Optimal Control
82 pages
Lec 2 Opt Problem Formulation
No ratings yet
Lec 2 Opt Problem Formulation
31 pages
Hernawan, E. (2022)
No ratings yet
Hernawan, E. (2022)
14 pages
NLO Notes
No ratings yet
NLO Notes
75 pages
Princeton University Notation and Terminology in Optimization
No ratings yet
Princeton University Notation and Terminology in Optimization
13 pages
1 Ot 16092022
No ratings yet
1 Ot 16092022
49 pages
01 Intro Notes Cvxopt f22
No ratings yet
01 Intro Notes Cvxopt f22
25 pages
14 Linear Discriminant Analysis 05-09-2024
No ratings yet
14 Linear Discriminant Analysis 05-09-2024
3 pages
Chapter 6 Lecture Notes
No ratings yet
Chapter 6 Lecture Notes
4 pages
Chapter 1 Intro To LP
No ratings yet
Chapter 1 Intro To LP
30 pages
Chapter 3 - Matrix
No ratings yet
Chapter 3 - Matrix
8 pages
Lecture 1 2 Background
No ratings yet
Lecture 1 2 Background
6 pages
02 BasicsI Handout
No ratings yet
02 BasicsI Handout
7 pages
Lecture 6 Instrumental Variables (IV) Estimation and Two Stage Least Squares (2SLS)
No ratings yet
Lecture 6 Instrumental Variables (IV) Estimation and Two Stage Least Squares (2SLS)
2 pages
Primal & Dual
No ratings yet
Primal & Dual
6 pages
Lec 18
No ratings yet
Lec 18
6 pages
Math Chapter 7
No ratings yet
Math Chapter 7
4 pages
NLP 2016 Intro
No ratings yet
NLP 2016 Intro
36 pages
Particulars Mittal 2015
No ratings yet
Particulars Mittal 2015
29 pages
Wisdom of Crowds Intro
No ratings yet
Wisdom of Crowds Intro
53 pages
Optimization: Dixit 1990 Simon and Blume 1994 Carter 2001 de La Fuente 2000
No ratings yet
Optimization: Dixit 1990 Simon and Blume 1994 Carter 2001 de La Fuente 2000
25 pages
Analythical Methods
No ratings yet
Analythical Methods
45 pages
Single Variable Optimization
No ratings yet
Single Variable Optimization
24 pages
NLP Nctu
No ratings yet
NLP Nctu
19 pages
Math 273a: Optimization: Instructor: Wotao Yin Department of Mathematics, UCLA Fall 2015
No ratings yet
Math 273a: Optimization: Instructor: Wotao Yin Department of Mathematics, UCLA Fall 2015
17 pages
OpTimIzation Overview
No ratings yet
OpTimIzation Overview
47 pages
2 Polynomials PDF
No ratings yet
2 Polynomials PDF
4 pages
Gaussian Process Emulation of Dynamic Computer Codes (Conti, Gosling Et Al)
No ratings yet
Gaussian Process Emulation of Dynamic Computer Codes (Conti, Gosling Et Al)
14 pages
FP1.C1: Roots of Polynomial Equations
No ratings yet
FP1.C1: Roots of Polynomial Equations
6 pages
Chapter One: 1.1 Optimal Control Problem
No ratings yet
Chapter One: 1.1 Optimal Control Problem
25 pages
Numerical Solution For Nonlinear MHD Jeffery-Hamel Blood Flow Problem Through Neural Networks Optimized Techniques
No ratings yet
Numerical Solution For Nonlinear MHD Jeffery-Hamel Blood Flow Problem Through Neural Networks Optimized Techniques
11 pages
1 - Theory of Maxima and Minima
No ratings yet
1 - Theory of Maxima and Minima
31 pages
B For I 1,, M: N J J J
No ratings yet
B For I 1,, M: N J J J
19 pages
APR - 16/SLIP/ (PR) - 2 S.Y. B.Sc. (Computer Science)
No ratings yet
APR - 16/SLIP/ (PR) - 2 S.Y. B.Sc. (Computer Science)
25 pages
Handout 1 Introduction
No ratings yet
Handout 1 Introduction
7 pages
Cons Train and Unconstrain
No ratings yet
Cons Train and Unconstrain
2 pages
Massachusetts Institute of Technology: X X X F
No ratings yet
Massachusetts Institute of Technology: X X X F
3 pages
Summative Test Q2
No ratings yet
Summative Test Q2
2 pages
CSC2411 - Linear Programming and Combinatorial Optimization Lecture 1: Introduction To Optimization Problems and Mathematical Programming
No ratings yet
CSC2411 - Linear Programming and Combinatorial Optimization Lecture 1: Introduction To Optimization Problems and Mathematical Programming
9 pages
Introduction of Optimization
No ratings yet
Introduction of Optimization
18 pages
Optimization Methods in Finance
No ratings yet
Optimization Methods in Finance
3 pages
Optimization Problems: 1.1 Preliminary Definitions
No ratings yet
Optimization Problems: 1.1 Preliminary Definitions
4 pages
Nonlinear Optimization
No ratings yet
Nonlinear Optimization
6 pages
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet

2 - Non Linear Optimization - V3

Uploaded by

2 - Non Linear Optimization - V3

Uploaded by

ENGG515 Dr.

Non Linear Optimization

1 General form of a nonlinear problem

x and y normally defined as x - y = ( x0 - y0 )2 + ( x1 - y1 )2 + ... + ( xn - yn )2 . Note that N(x) is a

Figure 1-1: local and global maxima

Figure 1-2: level sets

3x1+2x2≤18 à we draw the line 3x1+2x2 = 18 go through (6, 0) (0, 9)

Figure 1-3: interior and boundary points

Figure 1-4: feasible direction

Figure 1-5: Directional derivative

Figure 1-6: Concave and convex sets

1.7.2 Concave and Convex Function

Figure 1-7: Convex and concave functions

2 Types of Nonlinear Programming Problems

2.2.1 Newton method

Figure 2-1 how Newton method works graphically.

2.3 Multivariable unconstrained optimization

2.3.1.1 The gradient search algorithm

Iteration 2: the new point is æç 0 + t (1), + t(0) ö÷ = æç t, ö÷

Figure 2-2: Gradient method path

Figure 2-3: another illustration on how the gradient method moves

F is the Hessian of the function f built from the gradient as follow:

é -2 - 120( x1 - x4 ) 2 -20 0 120( x1 - x4 ) 2 ù

é -482 -20 0 480 ù

After computing the inverse of the Hessian and multiplying we got:

2.4 Constrained optimization

the new variables l = ( l1 , l2 ,..., lm ) are called Lagrange multipliers.

We then define the function q: Rm x RràR as

Example: Consider the following problem with linear constraints

For the primal problem

Figure 2-4: Duality and duality gap

2.4.2.1 Weak duality

2.4.2.2 Strong duality

2- Consider the function Z = f (x, y) = - ( x - 2 )2 - ( y - 2)2

Recall that the inverse of a 2x2 matrix is given by

d. Therefor X1=(-5,-5) +(5.33,4.66) = (0.33, -0.34) à F(x0) = -50 F(X1) = 0.8421

Max f (a ) = ln (1 - 5 + 12a - (-5 + 8a ) ) - ( -5 + 12a ) - ( -5 + 8a )

Max f (a ) = ln(1 + 4a ) 2 - 208a 2 + 200a - 50

We take f’=0 to get α=0.2123 Therefor X(1)=(0.9611, -1.0259)

a. Write the relaxed problem (Lagrangian function associated to this problem)

c. Find the solution to the system in b.

You might also like