0% found this document useful (0 votes)

12 views7 pages

Newton Scribed

Uploaded by

Ronak Shah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views7 pages

Newton Scribed

Uploaded by

Ronak Shah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

10-725/36-725: Convex Optimization Fall 2016

Lecture 14: Newton’s Method

Lecturer: Javier Pena Scribes: Varun Joshi, Xuan Li

Note: LaTeX template courtesy of UC Berkeley EECS dept.

Disclaimer: These notes have not been subjected to the usual scrutiny reserved for formal publications.
They may be distributed outside this class only with the permission of the Instructor.

14.1 Review of previous lecture

Given a function f : Rn → R, we define its conjugate f ∗ : Rn → R as,
f ∗ (y) = max(y T x − f (x))
x

Some properties of the convex conjugate of a function are as follows:

• Conjugate f ∗ is always convex (regardless of the convexity of f )

• When f is quadratic and Q 0 then f ∗ is quadratic in Q−1 i.e., for f (x) = 1 T
2 x Qx + bT x, with
Q 0, f ∗ (y) = 12 (y − b)T Q−1 (y − b).
• When f is a norm, f ∗ is the indicator of the dual norm unit ball
• When f is closed and convex x ∈ ∂f ∗ (y) ⇐⇒ y ∈ ∂f (x)

A key result that helps us write down the dual in terms of the conjugate is the Fenchel duality:
P rimal : min f (x) + g(x)
x
Dual : max −f ∗ (u) − g ∗ (−u)
u

14.2 Introduction
In this section, we present the Newton’s method and show that it can be interpreted as minimizing a quadratic
approximation to a function at a point. We also briefly discuss the origin of Newton’s method and how it
can be used for finding the roots of a vector-valued function.

14.2.1 Newton’s Method

Newton’s method is a second-order method in the setting where we consider the unconstrained, smooth
convex optimization problem
min f (x)
x

where f is convex, twice differentiable and dom(f ) = Rn .

Newton’s method: choose initial x(0) ∈ Rn , and

−1
x(k) = x(k−1) − ∇2 f (x(k−1) ) ∇f (x(k−1) ), k = 1, 2, 3, . . .

14-1
14-2 Lecture 14: October 19

This is called pure Newton’s method since there is no concept of a step-size involved. In Newton’s method,
we move in the direction of the negative Hessian inverse times the gradient. Compare this to gradient descent
where we move in the direction of the negative gradient: choose initial x(0) ∈ Rn , and
x(k) = x(k−1) − tk ∇f (x(k−1) ), k = 1, 2, 3, . . .

14.2.2 Newton’s method interpretation

Newton’s method can be interpreted as minimizing a quadratic approximation to a function at a given
−1
point. The step x+ = x − ∇2 f (x) ∇f (x) can be obtained by minimizing over y the following quadratic
approximation:
1
f (y) ≈ f (x) + ∇f (x)T (y − x) + (y − x)T ∇2 f (x)(y − x)
2
On the other hand, the gradient descent step x+ = x − t∇f (x) can be obtained by minimizing over y the
following quadratic approximation:
1
f (y) ≈ f (x) + ∇f (x)T (y − x) + ky − xk22
2t
As we can see, Newton’s method minimizes a finer quadratic approximation to a function as compared to
gradient descent. For example, for minimizing the function f (x) = (10x21 + x22 )/2 + 5 log(1 + exp(−x1 − x2 )) a
comparison of the steps taken by Newton’s method and gradient descent is provided in figure 14.1. The figure

Figure 14.1: Comparison of Newton’s Method (blue) with Gradient Descent (black)

shows a contrast between the behaviour of Newton’s method and gradient descent. In gradient descent the
direction of steps is always perpendicular to the level curves while that is not the case in Newton’s method
(due to the hessian term).
For a quadratic one step of Newton’s method minimizes the function directly because the quadratic approx-
imation to the quadratic function will be the function itself.

14.2.3 Newton’s method for root finding

Newton’s method was originally developed by Newton (1685) and Raphson (1690) for finding roots of poly-
nomials. This was later generalized to minimization of nonlinear equations by Simpson (1740). Suppose
F : Rn → Rn is a differentiable vector-valued function and consider the system of equations
F (x) = 0
Lecture 14: October 19 14-3

Then, the Newton’s method for finding the solution to this system of equations is: choose initial x(0) ∈ Rn ,
and 0 −1
x(k) = x(k−1) − F (x(k−1) ) F (x(k−1) ), k = 1, 2, 3, . . .
0
where F (x) is the Jacobian matrix of F at x.
0
The Newton step x+ = x − F (x)−1 F (x) can be obtained by solving over y the linear approximation
0
F (y) ≈ F (x) + F (x)(y − x) = 0
Newton’s method for root finding is directly related to the Newton’s method for convex minimization. In
particular, newton’s method for
min f (x)
x
is the same as Newton’s method for finding the roots of
∇f (x) = 0.

14.3 Properties
In this section, we present two key properties of Newton’s method which distinguish it from first order
methods.

14.3.1 Affine Invariance

Assume f : Rn → R is twice differentiable and A ∈ Rn×n is nonsingular. Let g(y) := f (Ay). Then, Newton
step for g at the point y is given by
−1
y + = y − ∇2 g(y) ∇g(y)
For the affine transformation x = Ay, it turns out that the Newton step for f at the point x is x+ = Ay + .
This means that the progress of Newton’s method is independent of linear scaling. This property is not true
for gradient descent.

14.3.2 Local Convergence

Newton’s method has the property of local convergence. The formal statement of the property is as follows.
Theorem 14.1 Assume F : Rn → Rn is continuously differentiable and x? ∈ Rn is a root of F , that is,
F (x? ) = 0 such that F 0 (x? ) is non-singular. Then
(a) There exists δ > 0 such that if kx(0) − x? k < δ then Newton’s method is well defined and
kx(k+1) − x? k
lim = 0.
k→∞ kx(k) − x? k

(b) If F 0 is Lipschitz continuous in a neighbourhood of x? then there exists K > 0 such that
kx(k+1) − x? k ≤ Kkx(k) − x? k2 .

Part (a) of the theorem says that Newton’s method has super-linear local convergence. Note that this is
stronger than linear convergence: x(k) → x? linearly ⇐⇒ kx(k+1) − x? k ≤ ckx(k) − x? k for some c ∈ (0, 1).
If we further assume that F 0 is Lipschitz continuous then from part (b) we get that Newton’s method has
local quadratic convergence which is even stronger than super-linear convergence.

Note that the above theorem talks only about local convergence so it holds only when we are close to
the root. Newton’s method does not necessarily converge in the global sense.
14-4 Lecture 14: October 19

14.4 Newton Decrement

For a smooth, convex function f the Newton decrement at a point x is defined as
1/2
T 2
−1
λ(x) = ∇f (x) ∇ f (x) ∇f (x)

For an unconstrained convex optimization problem

min f (x)
x

there are two ways to interpret the Newton Decrement.

Interpretation 1: Newton decrement relates the difference between f (x) and the minimum of its quadratic
approximation:
1
f (x)− min f (x) + ∇f (x)T (y − x) + (y − x)T ∇2 f (x)(y − x)

y 2
1 −1 1
= ∇f (x)T ∇2 f (x) ∇f (x) = λ(x)2

2 2
Thus, we can think of λ(x)2 /2 as an approximate bound on the suboptimality gap f (x) − f ? . The bound is
approximate because we are considering only the minimum of the quadratic approximation, not the actual
minimum of f (x).
−1
Interpretation 2: Suppose the step in Newton’s method is denoted by v = − ∇2 f (x) ∇f (x), then
1/2
λ(x) = v T ∇2 f (x)v = kvk∇2 f (x)
Thus, λ(x) is the length of the Newton step in the norm defined by the Hessian.

Fact: Newton decrement is affine invariant i.e., for g(y) = f (Ay) for a nonsingular A, λg (y) = λf (x)
at x = Ay.

14.5 Convergence Analysis for backtracking line search

14.5.1 Introduction to algorithm
The pure Newton’s Method does not always converge, depending on the staring point. Thus, damped
Newton’s method is introduced to work together with pure Newton Method. With 0 < α ≤ 21 and 0 < β < 1,
at each iteration we start with t = 1, and while
f (x + tv) <= f (x) + αt∇f (x)T v
we perform the the Newton update, else we shrink t = βt. Here
−1
v = − ∇2 f (x) ∇f (x)

14.5.2 Example: logistic regression

In lecture we are given a logistic regression example with n = 500 and p = 100. With backtracking, the
Newton’s Method is compared with gradient descent and the covergence curve is shown in 14.2. It is seen
that Newton’s Method has a different regime of convergence. Notice that the comparison might be unfair
since the computation cost in these two methods might vary significantly.
Lecture 14: October 19 14-5

Figure 14.2: Comparison of Newton’s Method with Gradient Descent (backtracking)

14.5.3 Convergence analysis

Given the assumption that

• f is strongly convex with parameter L, twice differentiable, and dom(f ) = Rn

• ∇2 f Lipschitz with parameter M

Newton’s Method with backtracking line search satisfies the following convergence bounds
(
(k) ? f (x(0) ) − f ? − γk if k ≤ k0
f (x ) − f ≤ 2m3 1 2k −k0 +1
M2 ( 2 ) if k > k0

where γ = αβ 2 η 2 m/L2 , η = min{1, 3(1 − 2α)}m2 /M , and k0 is the number of steps till ||∇f (x(k0 +1) )||2 < η.
More precisely, the results indicates that in damped phase, we have

f (x(k+1) ) − f (x(k) ) ≤ γ

In pure phase, backtracking selects t = 1, we have

2
M M
||∇f (x(k+1) )||2 ≤ ||∇f (xk )||2
2m2 2m2
Also, once we enter pure phase, we won’t leave.
Finally, to reach f (x(k) ) − f ? ≤ , at most

f (x(k) ) − f ?
+ log log(0 /)
γ
14-6 Lecture 14: October 19

3
iterations are need, where 0 = 2mM2 .
The “log log” term in the convergence result makes the convergence quadratic. However, the quadratic
convergence result is only local, it is guaranteed in the second or pure phase only. Finally, the above bound
depends on L, m, M , but the algorithm itself does not.

14.6 Convergence Analysis for self concordant functions

14.6.1 Definition
To achieve a scale-free analysis we introduce self-concordant functions. A function is self-concordant if it
convex on an open segment of R and satisfies
000 00
|f (x)| ≤ 2f (x)3/2
Pn
Two example would be f (x) = − i=1 log(xj ) and f (X) = − log(det(X)).

14.6.2 Property
If g is self-concordance and A, b are of the right dimension, then
f (x) := g(Ax − b)
is also self-concordant.

14.6.3 Convergence Analysis

For self-concordant function f , Newton’s method with backtracking line search needs at most
C(α, β)(f (x(0) ) − f ? ) + log log(1/)
iterations to achieve f (xk ) − f ? ≤ where α, β are constants.

14.7 Comparison to first order methods

14.7.1 High-level comparison
• Memory : Each iteration of Newton’s method requires O(n2 ) storage due to the n×n Hessian whereas
each gradient iteration requires O(n) storage for the n-dimensional gradient.
• Computation : Each Newton iteration requires O(n3 ) flops as it solves a dense n × n linear system.
Each gradient descent iteration requires O(n) flops attributed to scaling/adding n-dimensional vectors.
• Backtracking : Backtracking line search has roughly the same cost for both methods, which use O(n)
flops per inner backtracking step.
• Conditioning : Newton’s method is not afected by a problem’s conditioning(due to affine invariance),
but gradient descent can seriously degrade, since it depends adversely on the condition number.
• Fragility : Newton’s method may be empirically more sensitive to bugs/numerical errors, whereas
gradient descent is more robust.

We can see that even though Newton’s method has quadratic convergence as compared to linear convergence
of gradient descent, however, computing the Hessian might make the method a lot slower. If the Hessian is
sparse and structured(e.g. banded), then both memory and computation are O(n).
Lecture 14: October 19 14-7

14.8 Equality-constrained Newton’s method

14.8.1 Introduction
Suppose now we have problems with equality constraints

min f (x) subject to Ax = b

Here we have three options: eliminating the equality constraints by writing x = F y + x0 , where F spans null
space of A, and Ax0 = b; deriving the dual; or use the most straightforward option equality- constrained
Newton’s Method.

14.8.2 Definition
In equality-constrained Newton’s Method, we take Newton steps which are confined to a region satisfied by
the constraints. The Newton update is now x+ = x + tv where

T 1 T 2
v = argmin f (x) + ∇f (x) z + z ∇ f (x)z
A(x+z)=b 2

From KKT condition it follows that for some u, v we have

2
∇ f (x)AT

v ∇f (x)
· =−
A 0 w Ax − b

The latter is the root-finding Newton step for KKT conditions of the origin equality-constrained problem
that
∇f (x) + AT y

0
=
Ax − b 0

References
• S. Boyd and L. Vandenberghe (2004), “Convex optimization”, Chapters 9 and 10
• Guler (2010),“Foundations of Optimization”, Chapter 14.
• Y. Nesterov (1998), “Introductory lectures on convex optimization: a basic course”, Chapter 2

• Y. Nesterov and A. Nemirovskii (1994), “Interior-point polynomial methods in convex programming”,

Chapter 2
• J. Nocedal and S. Wright (2006), “Numerical optimization”, Chapters 6 and 7
• L. Vandenberghe, Lecture notes for EE 236C, UCLA, Spring 2011-2012

Chapter 9 Newton's Method
No ratings yet
Chapter 9 Newton's Method
27 pages
Synthetic Division
75% (4)
Synthetic Division
4 pages
A3c5 PDF
No ratings yet
A3c5 PDF
335 pages
Numerical Methods Notes
100% (1)
Numerical Methods Notes
180 pages
Point-to-Point Wireless Communication (III) :: Coding Schemes, Adaptive Modulation/Coding, Hybrid ARQ/FEC
No ratings yet
Point-to-Point Wireless Communication (III) :: Coding Schemes, Adaptive Modulation/Coding, Hybrid ARQ/FEC
156 pages
Lecture 1 - Introduction To Optimization PDF
No ratings yet
Lecture 1 - Introduction To Optimization PDF
31 pages
Operations Research
67% (3)
Operations Research
1 page
Error Detection Methods
No ratings yet
Error Detection Methods
6 pages
A G1002 Pages: 2: Answer Any Two Full Questions, Each Carries 15 Marks
No ratings yet
A G1002 Pages: 2: Answer Any Two Full Questions, Each Carries 15 Marks
2 pages
DSP Chapter7 PDF
No ratings yet
DSP Chapter7 PDF
51 pages
E1 251 Linear and Nonlinear Op2miza2on
No ratings yet
E1 251 Linear and Nonlinear Op2miza2on
24 pages
Lease Squares Method
No ratings yet
Lease Squares Method
10 pages
Narrative Report For Upd Parmap Processing Surigao Del Norte Block 65Cd
No ratings yet
Narrative Report For Upd Parmap Processing Surigao Del Norte Block 65Cd
25 pages
Newton's Method For Unconstrained Optimization
No ratings yet
Newton's Method For Unconstrained Optimization
14 pages
16 Primal Dual
No ratings yet
16 Primal Dual
20 pages
Neville.: TAREA #2. Métodos Numéricos. Por: Ingrid Jiménez López. C.C. 1.152.700.197
No ratings yet
Neville.: TAREA #2. Métodos Numéricos. Por: Ingrid Jiménez López. C.C. 1.152.700.197
11 pages
Binomial - Distribution Practice Questions
No ratings yet
Binomial - Distribution Practice Questions
46 pages
Newton's Method and Its Use in Optimization: European Journal of Operational Research September 2007
No ratings yet
Newton's Method and Its Use in Optimization: European Journal of Operational Research September 2007
12 pages
Lecture Notes - Decision Tree
No ratings yet
Lecture Notes - Decision Tree
13 pages
Output Primitives
No ratings yet
Output Primitives
14 pages
University of Central Punjab: Department of Electrical Engineering
No ratings yet
University of Central Punjab: Department of Electrical Engineering
3 pages
Sources of Fund
No ratings yet
Sources of Fund
16 pages
Unconstrained
No ratings yet
Unconstrained
30 pages
FNN
No ratings yet
FNN
24 pages
I B.SC CS DS Unit V
No ratings yet
I B.SC CS DS Unit V
22 pages
Aqua-Spa and Hydro-Lux Example
No ratings yet
Aqua-Spa and Hydro-Lux Example
7 pages
Lecture 9
No ratings yet
Lecture 9
8 pages
Answer Key To Re-Exam DD2410 - Dec 2022-1
No ratings yet
Answer Key To Re-Exam DD2410 - Dec 2022-1
13 pages
Rrrdesdelinear and Nonlinear Programming-4
No ratings yet
Rrrdesdelinear and Nonlinear Programming-4
3 pages
PDF of Digital Signal Processing Ramesh Babu 2 PDF
No ratings yet
PDF of Digital Signal Processing Ramesh Babu 2 PDF
2 pages
DSA Syllabus
No ratings yet
DSA Syllabus
2 pages
9 MCQ Lines and Angles
No ratings yet
9 MCQ Lines and Angles
7 pages
7 Newton Raphson Method
No ratings yet
7 Newton Raphson Method
20 pages
Algorithm of Code Minimization
No ratings yet
Algorithm of Code Minimization
3 pages
B.Tech (Sem Viii) Theory Examination 2017-18 Data Compression
No ratings yet
B.Tech (Sem Viii) Theory Examination 2017-18 Data Compression
2 pages
Lecture 13
No ratings yet
Lecture 13
7 pages
Multi-Variable Optimization Methods
No ratings yet
Multi-Variable Optimization Methods
21 pages
Nonlinear Programming (Concepts, Algorithms, and Applications To Chemical Processes) - 3. Newton-Type Methods For Unconstrained Optimization (2010)
No ratings yet
Nonlinear Programming (Concepts, Algorithms, and Applications To Chemical Processes) - 3. Newton-Type Methods For Unconstrained Optimization (2010)
23 pages
Real Analysis Project
50% (2)
Real Analysis Project
14 pages
Class Notes Dip
No ratings yet
Class Notes Dip
3 pages
FALLSEM2023-24 EEE1020 ETH VL2023240103124 2023-08-22 Reference-Material-I
No ratings yet
FALLSEM2023-24 EEE1020 ETH VL2023240103124 2023-08-22 Reference-Material-I
9 pages
Newton's Method
No ratings yet
Newton's Method
3 pages
Homework 03
No ratings yet
Homework 03
3 pages
Unconstrained Optimization Methods: Amirkabir University of Technology Dr. Madadi
No ratings yet
Unconstrained Optimization Methods: Amirkabir University of Technology Dr. Madadi
13 pages
Session 18-Cluster Analysis
No ratings yet
Session 18-Cluster Analysis
20 pages
Newton-Raphson Optimization: Steve Kroon
No ratings yet
Newton-Raphson Optimization: Steve Kroon
4 pages
Chapter 6vh
No ratings yet
Chapter 6vh
12 pages
Optimization Class Notes MTH-9842
No ratings yet
Optimization Class Notes MTH-9842
25 pages
"Newton's Method and Loops": University of Karbala College of Engineering Petroleum Eng. Dep
No ratings yet
"Newton's Method and Loops": University of Karbala College of Engineering Petroleum Eng. Dep
11 pages
Lecture 9 Si416
No ratings yet
Lecture 9 Si416
14 pages
The Bridge Between Newtons Method and Newton-Raph
No ratings yet
The Bridge Between Newtons Method and Newton-Raph
9 pages
Process Optimization
No ratings yet
Process Optimization
70 pages
Hauser Lecture2
No ratings yet
Hauser Lecture2
26 pages
Lecture 12
No ratings yet
Lecture 12
16 pages
Newton Method and Self-Concordance: October 23, 2008
No ratings yet
Newton Method and Self-Concordance: October 23, 2008
27 pages
Machine Problem
No ratings yet
Machine Problem
15 pages
Lecture 7 Newton
No ratings yet
Lecture 7 Newton
44 pages
Additition and Subtraction of Polynomials
No ratings yet
Additition and Subtraction of Polynomials
27 pages
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
No ratings yet
(k+1) K (K) (K) (K) : Recall That A Direction Is A Vector of Unit Length
5 pages
Newton Gauss Method
No ratings yet
Newton Gauss Method
37 pages
Optim
No ratings yet
Optim
70 pages
Algorithms Process Optimization
No ratings yet
Algorithms Process Optimization
5 pages
Lecture 05 - Unconstrained
No ratings yet
Lecture 05 - Unconstrained
21 pages
14 Newton
No ratings yet
14 Newton
24 pages
Lie Optimization
No ratings yet
Lie Optimization
9 pages
HW 3 Unconstrained-Optimization Advanced
No ratings yet
HW 3 Unconstrained-Optimization Advanced
9 pages
Lecture 2
No ratings yet
Lecture 2
19 pages
19 Newton Method
No ratings yet
19 Newton Method
10 pages
Jiyue Zeng Honors Thesis
No ratings yet
Jiyue Zeng Honors Thesis
59 pages
Chương 9
No ratings yet
Chương 9
12 pages
Optimization PPT - Part-2
No ratings yet
Optimization PPT - Part-2
42 pages
Newtons Method - Notes
No ratings yet
Newtons Method - Notes
2 pages
IV Ai & Ds Al3451 ML Unit2
No ratings yet
IV Ai & Ds Al3451 ML Unit2
50 pages
3927 14144 2 PB
No ratings yet
3927 14144 2 PB
14 pages
Chapter 3 Unconstrained Convex Optimization
No ratings yet
Chapter 3 Unconstrained Convex Optimization
28 pages
Interview Questions
No ratings yet
Interview Questions
26 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
Optimization 2
No ratings yet
Optimization 2
40 pages
Opt Lec 10
No ratings yet
Opt Lec 10
16 pages
My Test
No ratings yet
My Test
17 pages
10 Unconstrained
No ratings yet
10 Unconstrained
41 pages
Chapter 9 Lecture Notes
No ratings yet
Chapter 9 Lecture Notes
3 pages
Lecture12
No ratings yet
Lecture12
6 pages
Chapter 8 Lecture Notes
No ratings yet
Chapter 8 Lecture Notes
4 pages
Sequential Quadratic Programming
No ratings yet
Sequential Quadratic Programming
50 pages
Numerical Results For Gauss-Seidel Iterative Algor
No ratings yet
Numerical Results For Gauss-Seidel Iterative Algor
11 pages
MAE Opti Worksheet 4 Correction
No ratings yet
MAE Opti Worksheet 4 Correction
3 pages
Mathematical Methods of Optimization
No ratings yet
Mathematical Methods of Optimization
62 pages
Machine Learning and Pattern Recognition-Dr Dibakar Raj Pant
No ratings yet
Machine Learning and Pattern Recognition-Dr Dibakar Raj Pant
2 pages
Lecture 14
No ratings yet
Lecture 14
9 pages
5 2 Ot NM 18122020
No ratings yet
5 2 Ot NM 18122020
15 pages
Clnote Oct12
No ratings yet
Clnote Oct12
25 pages

Newton Scribed

Uploaded by

Newton Scribed

Uploaded by

10-725/36-725: Convex Optimization Fall 2016

Lecture 14: Newton’s Method

Note: LaTeX template courtesy of UC Berkeley EECS dept.

14.1 Review of previous lecture

Some properties of the convex conjugate of a function are as follows:

• Conjugate f ∗ is always convex (regardless of the convexity of f )

14.2.1 Newton’s Method

where f is convex, twice differentiable and dom(f ) = Rn .

Newton’s method: choose initial x(0) ∈ Rn , and

14.2.2 Newton’s method interpretation

14.2.3 Newton’s method for root finding

14.3.1 Affine Invariance

14.3.2 Local Convergence

14.4 Newton Decrement

For an unconstrained convex optimization problem

there are two ways to interpret the Newton Decrement.

14.5 Convergence Analysis for backtracking line search

14.5.2 Example: logistic regression

Figure 14.2: Comparison of Newton’s Method with Gradient Descent (backtracking)

14.5.3 Convergence analysis

• f is strongly convex with parameter L, twice differentiable, and dom(f ) = Rn

In pure phase, backtracking selects t = 1, we have

14.6 Convergence Analysis for self concordant functions

14.6.3 Convergence Analysis

14.7 Comparison to first order methods

14.8 Equality-constrained Newton’s method

min f (x) subject to Ax = b

From KKT condition it follows that for some u, v we have

• Y. Nesterov and A. Nemirovskii (1994), “Interior-point polynomial methods in convex programming”,

You might also like