0% found this document useful (0 votes)

151 views4 pages

Assign1 APM462 S2016

This document contains solutions to three questions regarding nonlinear optimization and function approximation. Question 1 involves finding the point satisfying the first order conditions and proving it is a global minimum for the function f(x,y)=2x^2 + y^2 + xy - y. Question 2 involves finding all local minimum points for the function f(x,y,z)=2x^2 + xy + y^2 + yz + z^2 - 6x - 8y - 8z + 9 and proving it is a global minimum. Question 3 involves approximating a parabolic function g(x)=x^2 by a linear polynomial p_a(x)=a_0 + a

Uploaded by

Wendy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

151 views4 pages

Assign1 APM462 S2016

Uploaded by

Wendy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Assignment 1

APM462 – Nonlinear Optimization – Summer 2016

Christopher J. Adkins

Solutions

Question 1 Let f (x, y) = 2x2 + y 2 + xy − y

(a) Find a point satisfying the first order conditions for f

(b) Prove that the point you found in a) is a global minimum for f

Solution Notice we may rewrite f as

1
f (x) = (x, Qx) − (b, x)
2
where (notation def: (x1 , x2 ) := xT1 x2 )
! !
4 1 0
Q= & b=
1 2 1

(a) Now it’s easy to see the first order condition (∇f = 0) gives

∇f (x∗ ) = Qx∗ − b = 0 =⇒ x∗ = Q−1 b

One may now easily compute x∗ :

! !
−1 1 2 −1 −1/7
Q = =⇒ x∗ =
7 −1 4 4/7

(b) We compute the eigenvalues of Q using P (λ) = det(Q − 1λ) = λ2 − 6λ + 7. Finding the roots gives the
two eigenvalues as
√
λ± = 3 ± 2

i.e. λ+ > λ− > 0 which means Q is positive definite. Now if we “complete the square” on f , we see
1 1 1
f (x) = (x − x∗ , Q(x − x∗ )) − (x∗ , Qx∗ ) > − (x∗ , Qx∗ )
2 2 2
Thus x∗ is a global minimum for f .

Question 2 Find all local minimum points for the function

f (x, y, z) = 2x2 + xy + y 2 + yz + z 2 − 6x − 8y − 8z + 9

Prove that your solution really is a global minimum.

1
Assignment 1 – Spring 2016 - CJA APM462

Solution Notice we may rewrite f as

1
f (x) = (x, Qx) − (b, x) + 9
2
where    
4 1 0 6
   
Q = 1
 2 1
 & b = 8


0 1 2 8
−1
One may easily compute x∗ using x∗ = Q b as we saw in the previous question. Plug and Chug
   
3 −2 1 1
1 
Q−1
  
= −2 8 −4 =⇒ x∗ = 2
 
10  
1 −4 7 3
Now we show Q is positive definite by checking the eigenvalues. We see the characteristic equation is given by

P (λ) = det(Q − 1λ) = −λ3 + 8λ2 − 18λ + 10

and the obvious bound of

P (λ) > 10 when λ60
shows that all eigenvalues are positive(since Q is symmetric, the eigenvalues must be real, and we’ve shown
there are no non-positive eigenvalues), hence x∗ is a global minimum by completing the square as we saw in the
previous question. Another method of of checking if Q is positive definite (could use this for question 1 as well)
is through Sylvester’s Criterion. This states that a symmetric matrix M is positive definite if and only if the
following matrices have positive determinate: the upper left 1-by-1 corner of M , the upper left 2-by-2 corner of
M , . . . , M itself. In this case its easy to see that
 
! 4 1 0
4 1  
det(4) =⇒ 4 & det =7 & 1 2  = 10
1
1 2 
0 1 2
Thus Sylvester’s Criterion gives us Q is positive definite.

Question 3 To approximate a function g : [0, 1] → R by an n-th order polynomial, one can minimize the
function f defined by Z 1
f (a) = (g(x) − pa (x))2 dx
0
where, for a = (a0 , . . . , an ) ∈ Rn+1 , we use the notation

pa (x) = a0 + a1 x + . . . + an xn = (x, a) & x = (1, x, . . . , xn )

In this question we will investigate approximating the parabola g(x) = x2 by the linear polynomials pa (x) =
a0 + a1 x.

(a) Show that f (a) can be written in the form

f (a) = aT Qa − 2bT a + c

for a 2 × 2 matrix Q, a vector b ∈ R2 and a number c. Find formulas for Q, b and c. It should be clear
from your formula that Q is symmetric.

(b) Find the first-order necessary condition for a point a∗ ∈ R2 to be a minimum point for f .

2
Assignment 1 – Spring 2016 - CJA APM462

Solution

(a) Expanding out f (a), we see that

Z 1 Z 1
2
f (a) = (g(x) − pa (x)) dx = (g(x) − (x, a))2 dx
0 0
Z 1
(x, a)(x, a) − 2g(x)(x, a) + g(x)2 dx

=
0
Z 1
(a, xxT a) − 2(g(x)x, a) + g(x)2 dx

=
0

=(a, Qa) − 2(b, a) + c

where Z 1 Z 1
1
Q= xxT dx =⇒ Qij = xi−1 xj−1 dx =
0 0 i+j−1
Z 1 Z 1
b= g(x)xdx =⇒ bi = g(x)xi−1 dx
0 0
Z 1
c= g(x)2 dx
0

Note that the particular form of Q implies that it is positive definite (since (v, xxT v) = (v, x)2 > 0 and
det Q 6= 0). If g(x) = x2 and x = (1, x), we see
!
1 1/2
Q=
1/2 1/3
! !
1
x2 1/3
Z
b= dx =
0 x3 1/4
Z 1
c= x4 dx = 1/5
0

(b) As usual with functions of this form, we know that critical point a∗ must satisfy Qa∗ = b, and since Q is
positive definite we have that a∗ = Q−1 b.

Question 4 Assume that g is a convex function on Rn , that f is a convex function of a single variable, and
in addition that f is a nondecreasing function (which means that f (r) > f (s) whenever r > s).

(a) Show that F (x) := f ◦ g(x) = f (g(x)) is convex by directly verifying the convexity inequality

F (tx1 + (1 − t)x2 ) 6 tF (x1 ) + (1 − t)F (x2 )

explain where each hypothesis (convexity of g, convexity of f , and the fact that f is nondecreasing) is
used in your reasoning.

(b) Now assume that f and g are both C 2 . Express the matrix of second derivatives ∇2 F (x) in terms of f
and g. Prove directly (without using part a)) that ∇2 F (x) is positive semidefinite at every x.

3
Assignment 1 – Spring 2016 - CJA APM462

Solution

(a) By direct computation, we see

F (tx1 + (1 − t)x2 ) =f (g(tx1 + (1 − t)x2 )) definition of F

6f (tg(x1 ) + (1 − t)g(x2 )) g is convex and f nondecreasing
6tf (g(x1 )) + (1 − t)f (g(x2 )) f is convex
=tF (x1 ) + (1 − t)F (x2 ) defintion of F

(b) First compute the gradient, we see that

∇F (x) = f 0 (g(x))∇g

Computing the matrix of second derivatives now shows we have (by product rule)

∇2 F (x) = f 00 (g(x))∇g∇g T + f 0 (g(x))∇2 g

Since f is nondecreasing and convex, we have that f 0 > 0 and f 00 > 0 at every x. Since g is convex, we
have that ∇2 g is positive semidefinite at every x. As we’ve mentioned before, matrices of the form xxT
are positive semidefinite(in this case we have ∇g∇g T ). Thus ∇2 F is positive semidefinite at every x, i.e.

(y, ∇2 F (x) y) = f 00 (g(x))(y, ∇g∇g T (x) y) + f 0 (g(x))(y, ∇2 g(x) y) > 0 ∀y ∈ Rn

Question 5 Prove that if f1 and f2 are two convex functions on Rn , then

g(x) := max{f1 (x), f2 (x)}

is also convex.

Solution This is easy to verify directly:

g(tx1 + (1 − t)x2 ) = max{f1 (tx1 + (1 − t)x2 ), f2 (tx1 + (1 − t)x2 )}

6 max{tf1 (x1 ) + (1 − t)f1 (x2 ), tf2 (x1 ) + (1 − t)f2 (x2 )} f1 and f2 are convex.
6t max{f1 (x1 ), f2 (x1 )} + (1 − t) max{f1 (x2 ), f2 (x2 )} bound by the bigger function at x1 and x2
=tg(x1 ) + (1 − t)g(x2 )

Quests On: Ereq Le: Tly y Asked
No ratings yet
Quests On: Ereq Le: Tly y Asked
280 pages
Assign1 APM462 S2016
No ratings yet
Assign1 APM462 S2016
4 pages
Sylvester's Criterion
No ratings yet
Sylvester's Criterion
4 pages
Athena Scientific - Introduction To Linear Optimization - Bertsimas - Dimitris (1997)
100% (1)
Athena Scientific - Introduction To Linear Optimization - Bertsimas - Dimitris (1997)
186 pages
Stable Convergence and Stable Limit Theorems: Erich Häusler Harald Luschgy
No ratings yet
Stable Convergence and Stable Limit Theorems: Erich Häusler Harald Luschgy
231 pages
(Dimitris N. Politis, Joseph P. Romano, Michael Subsampling
No ratings yet
(Dimitris N. Politis, Joseph P. Romano, Michael Subsampling
180 pages
F07HW8 Taylor 11.14
No ratings yet
F07HW8 Taylor 11.14
7 pages
STA457
No ratings yet
STA457
30 pages
150iqs-Second Ed-Fifteen Questions Solutions
No ratings yet
150iqs-Second Ed-Fifteen Questions Solutions
30 pages
04-05 - SQL-2up csc343
No ratings yet
04-05 - SQL-2up csc343
38 pages
Week 10: The Entity-Relationship Model
No ratings yet
Week 10: The Entity-Relationship Model
15 pages
Tutorial 6 - XML DTD
No ratings yet
Tutorial 6 - XML DTD
9 pages
02b - SQL-DDL - CSC 343
No ratings yet
02b - SQL-DDL - CSC 343
6 pages
Lab4-PL - SQL 000
0% (1)
Lab4-PL - SQL 000
9 pages
Lecture Slides For Introduction To Applied Linear Algebra: Vectors, Matrices, and Least Squares
No ratings yet
Lecture Slides For Introduction To Applied Linear Algebra: Vectors, Matrices, and Least Squares
470 pages
Functional Dependencies
100% (1)
Functional Dependencies
73 pages
FDs Solutions
No ratings yet
FDs Solutions
2 pages
A 3 Solution
No ratings yet
A 3 Solution
18 pages
10 DBDesignStG-4up
No ratings yet
10 DBDesignStG-4up
12 pages
Functional Analysis I PDF
100% (2)
Functional Analysis I PDF
286 pages
Database Application Development: CSC343 - Introduction To Databases - A. Vaisman 1
No ratings yet
Database Application Development: CSC343 - Introduction To Databases - A. Vaisman 1
21 pages
(J.W. Gardner, R. Wiegandt) Radical Theory of Ring
100% (2)
(J.W. Gardner, R. Wiegandt) Radical Theory of Ring
408 pages
t3 Simple SQL
No ratings yet
t3 Simple SQL
5 pages
t3 Simple SQL
No ratings yet
t3 Simple SQL
5 pages
Assignment 1: Learning Goals
No ratings yet
Assignment 1: Learning Goals
5 pages
99 Ab
No ratings yet
99 Ab
12 pages
Eigenvalues and Eigenvectors
No ratings yet
Eigenvalues and Eigenvectors
15 pages
Assignment 2
No ratings yet
Assignment 2
4 pages
Ziemer, Modern Real Analysis
No ratings yet
Ziemer, Modern Real Analysis
408 pages
STA457 Project
No ratings yet
STA457 Project
6 pages
03 RAlgebra PDF
No ratings yet
03 RAlgebra PDF
15 pages
Banach Spaces
100% (2)
Banach Spaces
34 pages
Harmonic Analysis
No ratings yet
Harmonic Analysis
84 pages
Multilinear Algebra - MIT
No ratings yet
Multilinear Algebra - MIT
141 pages
Metric Spaces PDF
No ratings yet
Metric Spaces PDF
33 pages
Applied Numerical Computing
100% (1)
Applied Numerical Computing
257 pages
Taylor Classical Mechanics Homework
100% (1)
Taylor Classical Mechanics Homework
7 pages
MATH858D Markov Chains: Maria Cameron
No ratings yet
MATH858D Markov Chains: Maria Cameron
44 pages
Black - Derivations in Applied Mathematics
No ratings yet
Black - Derivations in Applied Mathematics
492 pages
Automata
100% (1)
Automata
17 pages
Statistical Mechanics Notes: Leonard Susskind's Lectures
No ratings yet
Statistical Mechanics Notes: Leonard Susskind's Lectures
21 pages
Random Matrix Theory
No ratings yet
Random Matrix Theory
65 pages
A Panorama of Harmonic Analysis
No ratings yet
A Panorama of Harmonic Analysis
61 pages
Applied and Computational Linear Algebra A First Course Charles L. Byrne
No ratings yet
Applied and Computational Linear Algebra A First Course Charles L. Byrne
469 pages
Fuzzy Min-Max Neural Networks
No ratings yet
Fuzzy Min-Max Neural Networks
32 pages
SQL: Queries, Programming, Triggers: CSC343 - Introduction To Databases - A. Vaisman 1
No ratings yet
SQL: Queries, Programming, Triggers: CSC343 - Introduction To Databases - A. Vaisman 1
32 pages
Geometrical Methods in The Theory of ODE
100% (1)
Geometrical Methods in The Theory of ODE
370 pages
Support Vector Machines: The Interface To Libsvm in Package E1071 by David Meyer FH Technikum Wien, Austria
No ratings yet
Support Vector Machines: The Interface To Libsvm in Package E1071 by David Meyer FH Technikum Wien, Austria
8 pages
Geometric Measure Theory by The Book - Notes, Articles and Books by Kevin R. Vixie
No ratings yet
Geometric Measure Theory by The Book - Notes, Articles and Books by Kevin R. Vixie
5 pages
MATH 545, Stochastic Calculus Problem Set 2: January 24, 2019
No ratings yet
MATH 545, Stochastic Calculus Problem Set 2: January 24, 2019
7 pages
Quant Interview Prep
No ratings yet
Quant Interview Prep
14 pages
Book QuantLib
No ratings yet
Book QuantLib
40 pages
45+ Behavioral Interview Questions in 2024 (+ Sample Answers)
No ratings yet
45+ Behavioral Interview Questions in 2024 (+ Sample Answers)
49 pages
MarkJoshi Advice
No ratings yet
MarkJoshi Advice
15 pages
Random Matrices
No ratings yet
Random Matrices
27 pages
CSC336 Midterm 1 Fall 2011
No ratings yet
CSC336 Midterm 1 Fall 2011
3 pages
Measure Theory and Fourier Analysis
No ratings yet
Measure Theory and Fourier Analysis
2 pages
Bode - Shannon - A Simplified Derivation of Linear Least Square Smoothing and Prediction Theory - 1950
No ratings yet
Bode - Shannon - A Simplified Derivation of Linear Least Square Smoothing and Prediction Theory - 1950
9 pages
Simplified Amplifier Analysis: Feedbacd
No ratings yet
Simplified Amplifier Analysis: Feedbacd
7 pages
Madhava MC Paper 12
No ratings yet
Madhava MC Paper 12
2 pages
Books For Quant Interviews
No ratings yet
Books For Quant Interviews
1 page
A Stronger Sylvester's Criterion For Positive Semidefinite Matrices
No ratings yet
A Stronger Sylvester's Criterion For Positive Semidefinite Matrices
18 pages
Wall Street Quant - Self-Assessment
No ratings yet
Wall Street Quant - Self-Assessment
3 pages
Complex analysis A Complete Guide
From Everand
Complex analysis A Complete Guide
Gerardus Blokdyk
No ratings yet
Generalized Functions and Partial Differential Equations
From Everand
Generalized Functions and Partial Differential Equations
Avner Friedman
No ratings yet
Lectures on the Coupling Method
From Everand
Lectures on the Coupling Method
Torgny Lindvall
No ratings yet

Assign1 APM462 S2016

Uploaded by

Assign1 APM462 S2016

Uploaded by

Assignment 1

APM462 – Nonlinear Optimization – Summer 2016

Question 1 Let f (x, y) = 2x2 + y 2 + xy − y

(a) Find a point satisfying the first order conditions for f

Solution Notice we may rewrite f as

∇f (x∗ ) = Qx∗ − b = 0 =⇒ x∗ = Q−1 b

One may now easily compute x∗ :

Question 2 Find all local minimum points for the function

Prove that your solution really is a global minimum.

Solution Notice we may rewrite f as

P (λ) = det(Q − 1λ) = −λ3 + 8λ2 − 18λ + 10

and the obvious bound of

pa (x) = a0 + a1 x + . . . + an xn = (x, a) & x = (1, x, . . . , xn )

(a) Show that f (a) can be written in the form

(a) Expanding out f (a), we see that

=(a, Qa) − 2(b, a) + c

F (tx1 + (1 − t)x2 ) 6 tF (x1 ) + (1 − t)F (x2 )

(a) By direct computation, we see

F (tx1 + (1 − t)x2 ) =f (g(tx1 + (1 − t)x2 )) definition of F

(b) First compute the gradient, we see that

∇2 F (x) = f 00 (g(x))∇g∇g T + f 0 (g(x))∇2 g

(y, ∇2 F (x) y) = f 00 (g(x))(y, ∇g∇g T (x) y) + f 0 (g(x))(y, ∇2 g(x) y) > 0 ∀y ∈ Rn

Question 5 Prove that if f1 and f2 are two convex functions on Rn , then

g(x) := max{f1 (x), f2 (x)}

Solution This is easy to verify directly:

g(tx1 + (1 − t)x2 ) = max{f1 (tx1 + (1 − t)x2 ), f2 (tx1 + (1 − t)x2 )}

You might also like