0% found this document useful (0 votes)

8 views12 pages

Nonlinear Program

Uploaded by

angelinshinyj.ug19.cs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views12 pages

Nonlinear Program

Uploaded by

angelinshinyj.ug19.cs

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Nonlinear Optimization

(Com S 477/577 Notes)

Yan-Bin Jia

Nov 1, 2022

1 Introduction
Given a single function f that depends on one or more independent variables, we want to find
the values of those variables where f is maximized or minimized. Often the computational cost is
dominated by the cost of evaluating f (and also perhaps its partial derivatives with respect to all
variables).
Finding a global extremum is, in general, a very difficult problem. Two standard heuristics are
widely used: i) find local extrema starting from widely varying values of the independent variables,
and then pick the most extreme of these; ii) perturb a local extremum by taking a finite amplitude
step away from it, and then see if your routine can get to a better point, or “always” to the same
one. Recently, “simulated annealing” methods have demonstrated important successes on a variety
of global optimization problems.
The diagram below describes a function in an interval [a, b]. The derivative vanishes at the
points B, C, D, E, F . The points B and D are local but not global maxima. The points C and E
are local but not global minima. The global maximum occurs at F where the derivative vanishes.
The global minimum occurs at the left endpoint A of the interval so that the derivative need not
vanish.
F
D
E
B

C
A

a b

Nonlinear optimization, also referred to as nonlinear programming when nonlinear constraints

are involved, have many applications. Just to name a few: minimization of energy/power consump-
tion, optimal control, robot path planning in potential ﬁelds, thermal and ﬂuid model calibration.

1
Recall how the optimization process works in one dimension when f is a function from R to R.
First we might compute the critical set of f ,

cf = { x | f ′ (x) = 0 }

By examining this set we can determine those x that are global minima or maxima. Notice that
the computation of cf seems to entail finding the zeros of the derivative f ′ . In other words, we
have reduced the optimization problem to the root-finding problem. So why do we need to study
nonlinear programming? Well, in higher dimensions, it is often easier to find (local) minimum than
one would think. Intuitively, this is because f ′ is not an arbitrary function but rather a derivative
whose integral (f ) is given. We will thus have a lot more to say about optimization in higher
dimension than we did about root finding.
Since a maximization problem can be turned into a minimization problem simply by negating
the objective function, we will deal with minimization only from now on.

2 Golden Section Search

This method bears some analogy to bisection used in root ﬁnding. The basic idea is to bracket
a minimum and then shrink the bracket. One problem is that we do not know the value of the
function at the minimum, so we cannot know whether we have bracketed a minimum. Fortunately,
if we are willing to settle for a local minimum, then we really just want to bracket a zero of f ′ .
That we can do, by a numerical approximation to the derivative of f . The inner part of the loop
works as follows. We start with three points a, b, c with a < b < c such that

f (b) < min{f (a), f (c)}.

Now choose a point x, say, half way between a and b. If f (x) > f (b), as shown in the ﬁgure below,
then the new bracketing triple becomes [x, b, c]. If f (x) < f (b), then the new bracketing triple
becomes [a, x, b]. To ensure that the interval [a, c] will shrink toward a point, one should alternate
the bracketing tuples, at least after a few rounds. For instance, halve [a, b] in this round while [b, c]
in the next round.

a x b c

The ﬁgure on the next page shows a more complete example about how golden section works.
The minimum is originally bracketed by points 1, 3, 2. The function is evaluated at 4, which
replaces 2; then at 5, which replaces 1; and then at 6, which replaces 4. Note that the center point
is always lower than the two outside points. The minimum is bracketed by points 5, 3, 6 after the
three steps.

2
2

1
4

5
6

Golden section search is applicable to extremizing functions of one variable only, just like bisec-
tion is to ﬁnding the roots of functions of this type only. Now we consider functions of more than
one variables. Let the function f : Rn → R be twice continuous diﬀerentiable, that is, f ∈ C 2 . A
point x∗ ∈ Rn is said to be a relative minimum point or a local minimum point if there is an ǫ > 0
such that f (x) ≥ f (x∗ ) for all x ∈ Rn with kx − x∗ k < ǫ. If f (x) > f (x∗ ) for all x 6= x∗ with
kx − x∗ k < ǫ, then x∗ is said to be a strict relative minimum point of f .
A point x∗ is said to be a global minimum point of f if f (x) ≥ f (x∗ ) for all x. It is said to be
a strict global minimum point if f (x) > f (x∗ ) for all x 6= x∗ .
Let x = (x1 , x2 , . . . , xn )T . Recall that the gradient of f is a vector

∂f ∂f ∂f
∇f (x) = , ,···, . (1)
∂x1 ∂x2 ∂xn

It gives the direction in which the value of f increases the fastest. The Hessian H of f is deﬁned
as an n × n matrix:  ∂2f ∂2f 2f
· · · ∂x∂1 ∂x

∂x21 ∂x1 ∂x2 n
 
 ∂2f ∂2f ∂2f
· · · ∂x2 ∂xn 

∂x22
 ∂x2 ∂x1
H(x) =   
.
.. .
.. .. .
..

.
 
 
2
∂ f 2
∂ f 2
∂ f
∂xn ∂x1 ∂xn ∂x2 · · · ∂x2 n

The Taylor series at x∗= (x∗1 , x∗2 , . . . , x∗n )T

has the form

∗ ∂f ∗ ∗ ∂f ∗ ∗
f (x) = f (x ) + (x )(x1 − x1 ) + · · · + (x )(xn − xn )
∂x1 ∂xn
2
∂2f ∂2f ∗

1 ∂ f ∗ ∗ 2 ∗ ∗ ∗ ∗ 2
+ (x )(x1 − x1 ) + (x )(x1 − x1 )(x2 − x2 ) + · · · + 2 (x )(xn − xn )
2 ∂x21 ∂x1 ∂x2 ∂xn
+ ···
1
= f (x∗ ) + ∇f (x∗ )(x − x∗ ) + (x − x∗ )T H(x∗ )(x − x∗ ) + O kx − x∗ k3 . (2)
2
If x∗ is a relative minimum, then the following conditions hold:

i) ∇f (x∗ ) = 0,
ii) dT H(x∗ )d ≥ 0 for every d ∈ Rn .

3
In one dimension, the above necessary conditions are familiar to us:

f ′ (x∗ ) = 0 and f ′′ (x∗ ) ≥ 0.

x∗

The Hessian H(x∗ ) at a relative minimum x∗ is symmetric positive semi-definite, that is,
xT H(x∗ )x ≥ 0 for any x. If x∗ is a strict relative minimum, then H(x∗ ) is positive definite,
that is, xT H(x∗ )x > 0 for any x 6= 0. We can now derive sufficient conditions for a relative
minimum.

If ∇f (x∗ ) = 0 and H(x∗ ) is positive deﬁnite, then x∗ is a strict relative minimum.

Example 1 Suppose c is a real number, b is a n × 1 vector, and A is an n × n symmetric positive deﬁnite

matrix. Consider the function f : Rn → R given by
1
f (x) = c + bT x + xT Ax. (3)
2
We can easily see that

∇f (x) = bT + xT A,
H(x) = A.

So there is a single extremum, located at x∗ , which is the solution to the system Ax = −b. Since A is
positive deﬁnite, this extremum is a strict local minimum. Since it is the only one, it is in fact a global
minimum.

If we neglect the higher order terms inside the Big-O in the Taylor series (2), every function in
one dimension behaves like this (locally near a minimum):
1
y = c + bx + ax2 ,
2
y ′ = b + ax,
y ′′ = a > 0.

So y ′ = 0 when x = − ab . This is illustrated in the next ﬁgure.

4
c
b2
c− 2a

− ab

3 Convex Function
We just saw that H is positive definite at and near a strict local minimum. By Taylor’s theorem
every function looks like a quadratic near a strict local minimum. Furthermore, if f happens to
be quadratic globally, formed from a symmetric positive definite matrix A as in (3), then it has a
unique local minimum. This local minimum is therefore a global minimum.
Can we say more about functions whose local minima are global minima? A broad class of such
functions are the convex functions.
A function f : Ω → R defined on a convex domain Ω is said to be convex if for every pair of
points x1 , x2 ∈ Ω and any α with 0 ≤ α ≤ 1, the following holds:

f αx1 + (1 − α)x2 ≤ αf (x1 ) + (1 − α)f (x2 ).

a x1 αx1 + (1 − α)x2 x2 b

Convex functions describe functions near local minima as studied in the following proposition.

Proposition 1 Let f ∈ C 2 . Then f is convex over a convex set Ω containing an interior point if
and only if the Hessian matrix H is positive semi-deﬁnite in Ω.

The minima of convex functions are global minima, as shown by the following theorem.

5
Theorem 2 Let f be a convex function deﬁned on a convex set Ω. Then the set Γ where f achieves
its minimum value is convex. Furthermore, any relative minimum is a global minimum.

Proof If f has no relative minima then the theorem is valid by default. Assume therefore that
c0 is the minimum value of f on Ω. Deﬁne the set

Γ = { x ∈ Ω | f (x) = c0 }.

Suppose x1 , x2 ∈ Γ are two values that minimize f . We have that, for 0 ≤ α ≤ 1,

f αx1 + (1 − α)x2 ≤ αf (x1 ) + (1 − α)f (x2 )
= αc0 + (1 − α)c0
= c0 .

But c0 is the minimum, hence the above inequality must be an equality:

f αx1 + (1 − α)x2 = c0 .

In other words, f is also minimized at the point αx1 + (1 − α)x2 . Thus all the points on the line
segment connecting x1 and x2 are in Γ. Since x1 and x2 are arbitrarily chosen from Γ, the set
must be convex.
Suppose now that x∗ ∈ Ω is a relative minimum point of f but not a global minimum. Then
there exists some y ∈ Ω such that f (y) < f (x∗ ). On the line { αy + (1 − α)x∗ | 0 ≤ α ≤ 1 }, we
have, for 0 < α ≤ 1

f αy + (1 − α)x∗ ≤ αf (y) + (1 − α)f (x∗ )
< αf (x∗ ) + (1 − α)f (x∗ )
= f (x∗ ),

contradicting the fact that x∗ is a relative minimum.

4 Steepest Descent
Now let us return to the general problem of minimizing a function f : Rn → R. We want to find
the critical values where the gradient ∇f = 0. This is a system of n equations:
∂f
= 0,
∂x1
..
.
∂f
= 0.
∂xn
We might expect to encounter the usual difficulties associated with higher-dimensional root finding.
Fortunately, the derivative nature of the equations imposes some helpful structure to the problem.

6
In one dimension, to ﬁnd a local minimum, we might employ the following rule:

if f ′ (x) < 0 then move to the right;

if f ′ (x) > 0 then move to the left;
if f ′ (x) = 0 then stop.

In higher dimensions we use the negative gradient −∇f to point us toward a minimum. This is called
steepest descent. In particular, the algorithm repeatedly performs one-dimensional minimizations
along the direction of steepest descent.
In the algorithm, we start with x(0) as an approximation to a local minimum of f : Rn → R.

for m = 0, 1,2, . . . until satisﬁed do

u = ∇f x(m)
if u = 0 then stop
else minimize the function g(t) = f x(m) − tu
let t∗ > 0 be the closest such minimum to zero
x(m+1) ← x(m) − t∗ u

The method is also referred to as the line search strategy since during each iteration it moves
on the line x(m) − tu away from x(m) until encountering a local minimum of g(t). How to carry
out the line minimization of g(t)? Anyway you want. For instance, solve g ′ (t) = 0 directly. Or,
step along the line until you produce a bracket, and then reﬁne it.

Example 2. The function

f (x1 , x2 ) = x31 + x32 − 2x21 + 3x22 − 8
has the gradient
∇f = (3x21 − 4x1 , 3x22 + 6x2 ).
Given the guess x(0) = (1, −1)T for a local minimum of f , we ﬁnd

∇f (x(0) ) = (−1, −3).

Thus, in the ﬁrst step of steepest descent, we look for a minimum of the function

g(t) = f x(0) − t∇f (x(0) )
= f (1 + t, −1 + 3t)
= (1 + t)3 + (−1 + 3t)3 − 2(1 + t)2 + 3(−1 + 3t)2 − 8.

Setting g ′ (t) = 0 yields the equation

0 = 3(1 + t)2 + 3(3t − 1)2 3 − 4(1 + t) + 3 · 2(3t − 1)3

= 84t2 + 2t − 10,

which has two solutions

1 5
t1 = and t2 = − .
3 14
We choose the positive root, t1 = 31 , since we intend to walk from x(0) in the direction of −∇f x(0) . This

gives x(1) = ( 43 , 0)T . The gradient ∇f vanishes at x(1) . Therefore f achieves at least a local minimum at
( 43 , 0)T .

7
It turns out that steepest descent converges globally to a relative minimum. And the convergence
rate is linear. Let A and a be the largest and smallest eigenvalues, respectively, of the Hessian H
at the local minimum. Then the following holds regarding the ratio between the errors at two
adjacent steps
A−a 2

|em+1 |
∼ .
|em | A+a
The steepest descent method can take many steps. The problem is that the method repeatedly
moves in a steepest direction to a minimum along that direction. Consequently, consecutive steps
are perpendicular to each other (this behavior is illustrated in the ﬁgure below). To see why,
consider moving at x(k) along u = −∇f (x(k) ) to reach x(k+1) = x(k) + t∗ u where f (x) no longer
decreases. The rate at which the value of f changes, after a movement of tu from x(k) , is measured
by the directional derivative, ∇f (x(k) + tu) · u. This derivative is negative at x(k) (when t = 0),
and has not changed its sign before x(k+1) (when t = t∗ ). Suppose ∇f (x(k+1) ) 6⊥ ∇f (x(k) ). Then,
∇f (x(k+1) ) · u < 0 must hold. This means that the value of f would further decrease if continuing
the movement in the direction u from x(k+1) . Hence a contradiction.
So there are a number of back and forth steps that only slowly converge to a minimum. This
situation can get very bad in a narrow valley, where successive steps undo some of their previous
progress. Ideally, in Rn we would like to take n perpendicular steps, each of which attains a
minimum. This idea will lead to the conjugate gradient method.

y start

f (x) = c4

f (x) = c3

f (x) = c2

f (x) = c1

A Matrix Calculus
This appendix presents some basic rules of diﬀerentiations of scalars and vectors with respect to
vectors and matrices. These rules will be used later on in the course.
A.1 Diﬀerentiation With Respect to a Vector
The derivative of a vector function f (x) = (f1 (x), f2 (x), . . . , fm (x))T with respect to x is an m × n
matrix:  ∂f 
∂f1 ∂f1
1
· · ·
 ∂x1 ∂x2 ∂xn

∂f ∂f ∂f ∂f2 
 ∂x1 ∂x2 · · · ∂x
2 2

= n . (4)

∂x . . . .
 .. .. .. .. 


∂fm ∂fm
∂x1 ∂x2 · · · ∂f∂xn
m

The above matrix is referred to as the Jacobian of f .

Let c be an n-vector and A an m × n matrix. Then cT x and Ax are scalar and vector functions
of x, respectively. Applying (1) and (4), respectively, we can easily verify the following:

∂(cT x)
= cT , (5)
∂x
∂(Ax)
= A. (6)
∂x

A.2 Diﬀerentiation of Inner and Cross Products

Suppose that u and v are both three-dimensional vectors. Then, equation (5) is applied in diﬀer-
entiation of their dot product:

∂(u · v) ∂(uT v)
= = uT ,
∂v ∂v
∂(u · v) ∂(v T u)
= = vT .
∂u ∂u

To diﬀerentiate the cross product u × v, for any vector w = (w1 , w2 , w3 )T we denote by w× the
following 3 × 3 anti-symmetric matrix:
 
0 −w3 w2
w× =  w3 0 −w1  .
−w2 w1 0

Apparently, the product of the matrix u× with v is the cross product u × v. It then follows that

∂(u × v)
= u×,
∂v
∂(u × v) ∂(v × u)
= −
∂u ∂u
= −v× .

Now let us look at how to diﬀerentiate the scalar xT Ax, where x is an n-vector and A an n × n
matrix, with respect to x. We have

∂xT Ax ∂ ∂
xT (Ay) (y T A)x

= +
∂x ∂x y=x ∂x y=x

9
∂
(Ay)T x +y T A|y=x

=
∂x y=x

= (Ay)T +y T A|y=x
y=x
= xT A + x A.
T T

The above reduces to 2xT A when A is symmetric.

A.3 Trace of a Product Matrix

The derivative of a scalar function f (A), where A = (aij )m×n is a matrix, is deﬁned as
 ∂f ∂f

∂a11 ∂a12 · · · ∂a∂f1n
 
 ∂f ∂f
∂f · · · ∂a∂f2n

=  ∂a21 ∂a22 . (7)
 
∂A  .. .. ..
.
..
 . . .


∂f ∂f ∂f
∂am1 ∂am2 · · · ∂amn

Let C = (cij )m×n and X = (xij )m×n be two matrices. The trace of the product matrix CX T is
the sum of its diagonal entries:
m n
!
X X
Tr(CX T ) = crs xrs .
r=1 s=1

Immediately, we have
∂
Tr(CX T ) = cij ,
∂xij
which implies that
∂
Tr(CX T ) = C.
∂X
Next, we diﬀerentiate the trace of the product matrix XCX T as follows:
∂ ∂ ∂
Tr(XCX T ) = Tr(XCY T ) + Tr (Y C)X T
∂X ∂X Y =X ∂X Y =X
∂
= Tr (Y C T )X T +Y C
∂X Y =X Y =X

= Y CT +XC
Y =X
= XC T + XC.

The above reduces to 2XC when C is symmetric.

A.4 Matrix in One Variable

Consider an m×n matrix A in which every element is a function of some variable t. Write the matrix
as A(t) = (aij (t))m×n . Let the overdot denote diﬀerentiation with respect to t. Diﬀerentiation and

10
integration of the matrix operate element-wise:

Ȧ(t) = (ȧij ) ,
Z Z
A(t) dt = aij (t) dt .

Suppose A(t) is n × n and non-singular. Then we have AA−1 = In , the n × n identity matrix.
Thus,
d
0 = (AA−1 )
dt
d −1
= ȦA−1 + A (A ),
dt
which yields the derivative of the inverse matrix:
d −1
(A ) = −A−1 ȦA−1 . (8)
dt
An interesting case is with the rotation matrix R, which is also orthogonal, i.e., RRT = RT R =
In . We obtain

RRT = I ⇒ ṘRT + RṘT = 0

T
⇒ ṘRT + ṘRT =0
T
⇒ ṘRT = − ṘRT .

The above implies that the matrix ṘRT is anti-symmetric. Therefore, it can be written as
 
0 −ωz ωy
ṘRT =  ωz 0 −ωx  .
−ωy ωx 0

The vector ω = (ωx , ωy , ωz )T is the angular velocity, where the cross product ω × v = ṘRT v
describes the change rate of the vector v (i.e., the velocity of the destination point of the vector)
due to the body rotation described by R.
Using the Taylor expansion, we deﬁne the exponential function:
∞
X (At)j
eAt = ,
j!
j=0

where A is an n × n matrix. The function’s importance comes from that it is the solution to the
linear system ẋ = Ax + bu, where u is the control vector. It has the derivative
d At
e = AeAt = eAt A.
dt

11
References
[1] M. Erdmann. Lecture notes for 16-811 Mathematical Fundamentals for Robotics. The Robotics
Institute, Carnegie Mellon University, 1998.

[2] D. G. Luenberger. Linear and Nonlinear Programming. Addison-Wesley, 2nd edition, 1984.

[3] W. H. Press, et al. Numerical Recipes in C++: The Art of Scientiﬁc Computing. Cambridge
University Press, 2nd edition, 2002.

Course Notes For MATH 524: Non-Linear Optimization
No ratings yet
Course Notes For MATH 524: Non-Linear Optimization
112 pages
MTH101 Practice Qs Solutions Lectures 23 To 45
86% (7)
MTH101 Practice Qs Solutions Lectures 23 To 45
27 pages
Chap1 Lec1 Introduction To NLO
No ratings yet
Chap1 Lec1 Introduction To NLO
3 pages
Chap2 Lec1 Coercive Functions and Global Minimizers
No ratings yet
Chap2 Lec1 Coercive Functions and Global Minimizers
5 pages
Bedd
No ratings yet
Bedd
13 pages
Chapter 6: Decision Theory: Decision Theory Problems Are Categorized by The Following
No ratings yet
Chapter 6: Decision Theory: Decision Theory Problems Are Categorized by The Following
13 pages
OQM Lecture Note - Part 8 Unconstrained Nonlinear Optimisation
No ratings yet
OQM Lecture Note - Part 8 Unconstrained Nonlinear Optimisation
23 pages
Chapter 4: Unconstrained Optimization
No ratings yet
Chapter 4: Unconstrained Optimization
25 pages
Derivation of Stiffness Matrix For A Beam
100% (2)
Derivation of Stiffness Matrix For A Beam
21 pages
Convention: Throughout This Discussion A Feasible Direction D at A Point Is by Definition Taken
No ratings yet
Convention: Throughout This Discussion A Feasible Direction D at A Point Is by Definition Taken
12 pages
Nonlinear Program
No ratings yet
Nonlinear Program
13 pages
09d Ch1lec3comb
No ratings yet
09d Ch1lec3comb
6 pages
5 Optimization: F Emp
No ratings yet
5 Optimization: F Emp
52 pages
Optimization in Practice
No ratings yet
Optimization in Practice
27 pages
20-Region Elimination Method - Golden Search Method-11-03-2025
No ratings yet
20-Region Elimination Method - Golden Search Method-11-03-2025
20 pages
chp#06
No ratings yet
chp#06
12 pages
Chapter 6 Lecture Notes
No ratings yet
Chapter 6 Lecture Notes
4 pages
Opt Notes
No ratings yet
Opt Notes
2 pages
Coercive Ness
No ratings yet
Coercive Ness
13 pages
Minimization or Maximization of Functions
No ratings yet
Minimization or Maximization of Functions
4 pages
Opt7 20
No ratings yet
Opt7 20
8 pages
OPTIMIZATION Lecture
No ratings yet
OPTIMIZATION Lecture
88 pages
Univariate Calculus and Multivariate Calculus
No ratings yet
Univariate Calculus and Multivariate Calculus
141 pages
Multivariate Calculus 2022
No ratings yet
Multivariate Calculus 2022
58 pages
pdfHXu ch1
No ratings yet
pdfHXu ch1
30 pages
Optimality Conditions
No ratings yet
Optimality Conditions
5 pages
06 Optimization
No ratings yet
06 Optimization
42 pages
MAE Opti Worksheet 3 Correction
No ratings yet
MAE Opti Worksheet 3 Correction
6 pages
MML 4
No ratings yet
MML 4
3 pages
Chapter8-Unconstrained Optimization
No ratings yet
Chapter8-Unconstrained Optimization
14 pages
CAPE PureMath-Units 1 and 2
100% (1)
CAPE PureMath-Units 1 and 2
68 pages
Mclas Tema1 v2
No ratings yet
Mclas Tema1 v2
74 pages
Concave + Convex
No ratings yet
Concave + Convex
37 pages
Zaphiris, Ang, Laghos - 2009 - Online Communities - Human-Computer Interaction Design Issues, Solutions, and Applications
No ratings yet
Zaphiris, Ang, Laghos - 2009 - Online Communities - Human-Computer Interaction Design Issues, Solutions, and Applications
48 pages
Princeton University Notation and Terminology in Optimization
No ratings yet
Princeton University Notation and Terminology in Optimization
13 pages
Optimization Techniques - OT
No ratings yet
Optimization Techniques - OT
96 pages
Lecture 8
No ratings yet
Lecture 8
9 pages
Nocedal - Wright CH - 02-01
No ratings yet
Nocedal - Wright CH - 02-01
9 pages
Chapter 2 - Unconstrained Optimization
No ratings yet
Chapter 2 - Unconstrained Optimization
20 pages
04 Nonlinear Systems and Optimization
No ratings yet
04 Nonlinear Systems and Optimization
74 pages
Dssm-U5 MHK
No ratings yet
Dssm-U5 MHK
51 pages
Review Questions 1
No ratings yet
Review Questions 1
3 pages
Optimization Techniques
No ratings yet
Optimization Techniques
96 pages
Exam1Review Annotated
No ratings yet
Exam1Review Annotated
13 pages
E1 251 Linear and Nonlinear Op2miza2on: Chapter 4: Convex and Quadra2c Func2ons
No ratings yet
E1 251 Linear and Nonlinear Op2miza2on: Chapter 4: Convex and Quadra2c Func2ons
35 pages
Bachelor Thesis Swot
100% (3)
Bachelor Thesis Swot
6 pages
Notes - Differentiation 3
No ratings yet
Notes - Differentiation 3
13 pages
Instrumental Methods of Analysis: Danni Ramdhani
No ratings yet
Instrumental Methods of Analysis: Danni Ramdhani
17 pages
NLP Slides
No ratings yet
NLP Slides
201 pages
Grade 12 Mathematics Inverse Functions Solutions
100% (2)
Grade 12 Mathematics Inverse Functions Solutions
20 pages
Maritime Thesis List
100% (1)
Maritime Thesis List
7 pages
Lecture 3
No ratings yet
Lecture 3
5 pages
US - TMC - 05 - Optimization 2022
No ratings yet
US - TMC - 05 - Optimization 2022
43 pages
Hypothesis Testing v2.0
No ratings yet
Hypothesis Testing v2.0
40 pages
5 Optimization Techniques
No ratings yet
5 Optimization Techniques
40 pages
Review Question 3
No ratings yet
Review Question 3
4 pages
The Peformance Based Management Handbook Vol 3 Accountability For Performance PDF
No ratings yet
The Peformance Based Management Handbook Vol 3 Accountability For Performance PDF
72 pages
Machine Learning Notes2
No ratings yet
Machine Learning Notes2
34 pages
CS-6777 Liu Abs
No ratings yet
CS-6777 Liu Abs
103 pages
Chapter 4. Extrema and Double Integrals: Section 4.1: Extrema, Second Derivative Test
No ratings yet
Chapter 4. Extrema and Double Integrals: Section 4.1: Extrema, Second Derivative Test
6 pages
Process Optimization
No ratings yet
Process Optimization
70 pages
Week02 Convex Optimization
No ratings yet
Week02 Convex Optimization
48 pages
Bisection Method
No ratings yet
Bisection Method
8 pages
NEOM UNIT-1 Sept-23
No ratings yet
NEOM UNIT-1 Sept-23
34 pages
As ISO 18118-2006 Surface Chemical Analysis - Auger Electron Spectroscopy and X-Ray Photoelectron Spectroscop
No ratings yet
As ISO 18118-2006 Surface Chemical Analysis - Auger Electron Spectroscopy and X-Ray Photoelectron Spectroscop
8 pages
Lecture 3
No ratings yet
Lecture 3
5 pages
Trigonometric Equations and Graphs Questions Only PDF
No ratings yet
Trigonometric Equations and Graphs Questions Only PDF
5 pages
Chapter 4 Exercises
No ratings yet
Chapter 4 Exercises
5 pages
Mcnotes 41
No ratings yet
Mcnotes 41
8 pages
G Lecture05
No ratings yet
G Lecture05
39 pages
Conformal Deformation of A Riemannian Metric To Constant Scalar Curvature
No ratings yet
Conformal Deformation of A Riemannian Metric To Constant Scalar Curvature
18 pages
Educational Research Together Notes
No ratings yet
Educational Research Together Notes
523 pages
The TSP Code Matlab
No ratings yet
The TSP Code Matlab
20 pages
Comparison Test For Improper Integrals
No ratings yet
Comparison Test For Improper Integrals
8 pages
Sun2018 Global Output-Feedback Stabilization For Stochastic Nonlinear Systems A Double-Domination Approach
No ratings yet
Sun2018 Global Output-Feedback Stabilization For Stochastic Nonlinear Systems A Double-Domination Approach
12 pages
Chapter 4: Linear Programing
No ratings yet
Chapter 4: Linear Programing
18 pages
Geometric Hahn Banach
No ratings yet
Geometric Hahn Banach
5 pages
Optim
No ratings yet
Optim
70 pages
Chapter 03 SI
No ratings yet
Chapter 03 SI
91 pages
Mukabi JN (PH.D) - CV - Edited Jan2012
No ratings yet
Mukabi JN (PH.D) - CV - Edited Jan2012
260 pages
19 Dynamic Programming
No ratings yet
19 Dynamic Programming
11 pages
Optimization of Membrane Filtration Systems
No ratings yet
Optimization of Membrane Filtration Systems
5 pages
SRM Institute of Science and Technology: 18Mee305T - Finite Element Method
No ratings yet
SRM Institute of Science and Technology: 18Mee305T - Finite Element Method
15 pages
Classical Optimization Technique
No ratings yet
Classical Optimization Technique
19 pages
Business Analytics in Supply Chain Management
No ratings yet
Business Analytics in Supply Chain Management
4 pages
Unconstrained Optimization 2011
No ratings yet
Unconstrained Optimization 2011
6 pages
Corp. Finance Math Chapter 6 Making Capital Investment Decision
No ratings yet
Corp. Finance Math Chapter 6 Making Capital Investment Decision
12 pages

Nonlinear Program

Uploaded by

Nonlinear Program

Uploaded by

Nonlinear Optimization

(Com S 477/577 Notes)

Nonlinear optimization, also referred to as nonlinear programming when nonlinear constraints

2 Golden Section Search

f (b) < min{f (a), f (c)}.

The Taylor series at x∗= (x∗1 , x∗2 , . . . , x∗n )T

f ′ (x∗ ) = 0 and f ′′ (x∗ ) ≥ 0.

If ∇f (x∗ ) = 0 and H(x∗ ) is positive deﬁnite, then x∗ is a strict relative minimum.

Example 1 Suppose c is a real number, b is a n × 1 vector, and A is an n × n symmetric positive deﬁnite

So y ′ = 0 when x = − ab . This is illustrated in the next ﬁgure.

Suppose x1 , x2 ∈ Γ are two values that minimize f . We have that, for 0 ≤ α ≤ 1,

But c0 is the minimum, hence the above inequality must be an equality:

contradicting the fact that x∗ is a relative minimum.

if f ′ (x) < 0 then move to the right;

for m = 0, 1,2, . . . until satisﬁed do

Example 2. The function

∇f (x(0) ) = (−1, −3).

Setting g ′ (t) = 0 yields the equation

0 = 3(1 + t)2 + 3(3t − 1)2 3 − 4(1 + t) + 3 · 2(3t − 1)3

which has two solutions

The above matrix is referred to as the Jacobian of f .

A.2 Diﬀerentiation of Inner and Cross Products

The above reduces to 2xT A when A is symmetric.

A.3 Trace of a Product Matrix

The above reduces to 2XC when C is symmetric.

A.4 Matrix in One Variable

RRT = I ⇒ ṘRT + RṘT = 0

You might also like

for m = 0, 1,2, . . . until satisﬁed do