Lecture 17

The document discusses the projected gradient descent algorithm for constrained optimization problems. It introduces the projection of a point onto a convex set and proves the projection theorem. The projected gradient descent algorithm modifies the gradient descent updating rule by projecting the new point back onto the feasible set in each iteration. The algorithm is guaranteed to produce feasible solutions in each iteration and its convergence rate is analyzed.

Uploaded by

Tấn Long Lê

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views2 pages

Lecture 17

Uploaded by

Tấn Long Lê

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

SYS 6003: Optimization Fall 2016

Lecture 17
Instructor: Quanquan Gu Date: Oct 26th
Today we are going to study the projected gradient descent algorithm.
Consider the following contrained optimization problem :

min f (x). (1)

x∈D

If we apply the gradient descent algorithm directly, we cannot guarantee that in each iter-
ation, xt+1 = xt − ηt ∇f (xt ) will be in D. In other words, we may end up with infeasible
solutions. To ensure that the new point, xt+1 , that obtained in each iteration will be always
in D, one way is to project the new point back onto the feasible set.
Let us first define the projection of a point onto a set.
Definition 1 (Projection) The projection point of x onto a set C is defined as ΠC (x) :=
arg miny∈C 12 kx − yk22 .

Theorem 1 (Projection Theorem) Let C ⊆ Rd be a convex set. For any x ∈ Rd and

y ∈ C, it holds that
(1) (ΠC (x) − y)> (ΠC (x) − x) ≤ 0.;

(2) kΠC (x) − yk22 + kΠC (x) − xk22 ≤ kx − yk22 .

Proof: (1) Let f (y) = 21 kx − yk22 . By the first order necessary condition of local minimum
y∗ = ΠC (x), we have ∇f (y∗ )> d ≥ 0 where d is any feasible directions at y∗ . Let d =
y − ΠC (x). For any y ∈ C, it then follows that

∇f (y∗ )T (y − ΠC (x)) ≥ 0. (2)

Note that ∇f (y∗ ) = −(x − y∗ ) = y∗ − x and y∗ = ΠC (x). From (2), it then follows

(ΠC (x) − x)> (y − ΠC (x)) ≥ 0, i.e.,

(ΠC (x) − x)> (ΠC (x) − y) ≤ 0.

(2) We have

kx − yk22 = kx − ΠC (x) + ΠC (x) − yk22

= kx − ΠC (x)k22 + kΠC (x) − yk22 − 2(ΠC (x) − y)> (ΠC (x) − x)
≥ kx − ΠC (x)k22 + kΠC (x) − yk22 ,

where the inequality follows from part (1). This completes the proof.

Remark 1 Geometrically, the projection theorem says that the the angle between vectors
y − ΠC (x) and ΠC (x) − x is either acute or right.

1
Algorithm 1 Projected Gradient Descent
1: Input: ηt
2: Initialize: x1 ∈ D
3: for t = 1 to T − 1 do
4: xt+1 = ΠD [xt − ηt ∇f (xt )]
5: end for

So we modify the updating rule of gradient descent to be xt+1 = ΠD [xt − ηt ∇f (xt )],
where ΠD (x) is the projection of x onto D. Then we have the projected gradient descent
algorithm shown in Algorithm 1. It is worth noting that if the gradient of f does not exist
at xt , then in the fourth line of Algorithm 1, we can use any subgradient of f at xt instead
of its gradient.
The following theorem provides the convergence rate for the projected gradient descent
algorithm.

Theorem 2 Suppose that f is a convex function, and its subgradient g(x) is bounded by√G,
i.e., kg(x)k2 ≤ G, for any x ∈ D. Then for the projected gradient descent with ηt = 1/ t,
it holds that
X T 2
1 ∗ R 2 1
f xt − f (x ) ≤ +G √
T t=1 2 T

where x∗ is the optimal solution to problem (1) and R = maxx,y∈D kx − yk2 is the diameter
of the set convex D.

Duchi SH Si CH 08
No ratings yet
Duchi SH Si CH 08
8 pages
Ee227c Notes 2 PDF
No ratings yet
Ee227c Notes 2 PDF
122 pages
Gradient Descent Algorithm in Machine Learning
No ratings yet
Gradient Descent Algorithm in Machine Learning
21 pages
Chapter 3 Unconstrained Convex Optimization
No ratings yet
Chapter 3 Unconstrained Convex Optimization
28 pages
IELTS Collocation
100% (2)
IELTS Collocation
30 pages
CntrlEngg (Optimization) ConvexOptimizationAlgorithms DimitriBertsekas
No ratings yet
CntrlEngg (Optimization) ConvexOptimizationAlgorithms DimitriBertsekas
578 pages
Convex Optimization For Machine Learning
No ratings yet
Convex Optimization For Machine Learning
110 pages
Proximal Gradient Methods For Machine
No ratings yet
Proximal Gradient Methods For Machine
96 pages
Alegbra Hard Qs Ans
No ratings yet
Alegbra Hard Qs Ans
176 pages
DS303: Introduction To Machine Learning: Stochastic Gradient Descent
No ratings yet
DS303: Introduction To Machine Learning: Stochastic Gradient Descent
19 pages
Non Convex Optimization
No ratings yet
Non Convex Optimization
139 pages
Week02 Convex Optimization
No ratings yet
Week02 Convex Optimization
48 pages
02 Grad Desc
No ratings yet
02 Grad Desc
54 pages
09 Convex
No ratings yet
09 Convex
48 pages
ConvexSpring25 Week9
No ratings yet
ConvexSpring25 Week9
26 pages
L10 - Subgrad - PGD (Partially Annotated)
No ratings yet
L10 - Subgrad - PGD (Partially Annotated)
39 pages
Coordinate Descent
No ratings yet
Coordinate Descent
32 pages
LiftProj Siena04
No ratings yet
LiftProj Siena04
41 pages
Berkeley-Tutorial Optimization For Machine Learningpart2
No ratings yet
Berkeley-Tutorial Optimization For Machine Learningpart2
35 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
ISYE 8803 - Kamran - M5 - Optimization Methods 2
No ratings yet
ISYE 8803 - Kamran - M5 - Optimization Methods 2
17 pages
Lecture 7 (With Notes)
No ratings yet
Lecture 7 (With Notes)
39 pages
OpenDaylight As A Platform For Network Programmability Extended Version
No ratings yet
OpenDaylight As A Platform For Network Programmability Extended Version
68 pages
A Unified Convergence Analysis of Block Successive Minimization Methods For Nonsmooth Optimization
No ratings yet
A Unified Convergence Analysis of Block Successive Minimization Methods For Nonsmooth Optimization
34 pages
Projected Gradient
No ratings yet
Projected Gradient
21 pages
Mixed Integer Linearity in Nonlinear Optimization: A Trust Region Approach
No ratings yet
Mixed Integer Linearity in Nonlinear Optimization: A Trust Region Approach
22 pages
Convex Module B
No ratings yet
Convex Module B
29 pages
Mirror Descent Slides
No ratings yet
Mirror Descent Slides
35 pages
Article: Projection-Based Curve Pattern Search For Black-Box Optimization Over Smooth Convex Sets
No ratings yet
Article: Projection-Based Curve Pattern Search For Black-Box Optimization Over Smooth Convex Sets
20 pages
Ee227c Notes PDF
No ratings yet
Ee227c Notes PDF
122 pages
Nisheeth VishnoiFall2014 ConvexOptimization PDF
No ratings yet
Nisheeth VishnoiFall2014 ConvexOptimization PDF
114 pages
Arxiv - v1 (Math - Oc) 23 Sep 2021
No ratings yet
Arxiv - v1 (Math - Oc) 23 Sep 2021
20 pages
Assessment Guide
33% (3)
Assessment Guide
22 pages
Digital Cont Lec p2
No ratings yet
Digital Cont Lec p2
37 pages
Subgradient Method
No ratings yet
Subgradient Method
22 pages
Subgrad Method Slides
No ratings yet
Subgrad Method Slides
33 pages
Projected Gradient Methods For Linearly Constrained Problems - Calamai, Moré (1987)
No ratings yet
Projected Gradient Methods For Linearly Constrained Problems - Calamai, Moré (1987)
24 pages
Lecture Notes On Iterative Optimization Algorithms
No ratings yet
Lecture Notes On Iterative Optimization Algorithms
102 pages
LectureNotes-large-scale and Distributed Optimization
No ratings yet
LectureNotes-large-scale and Distributed Optimization
19 pages
BSC Part 3
No ratings yet
BSC Part 3
29 pages
Fukushima 1981
No ratings yet
Fukushima 1981
13 pages
Sparsity and Its Mathematics
No ratings yet
Sparsity and Its Mathematics
44 pages
Ch-2 (HMT)
No ratings yet
Ch-2 (HMT)
67 pages
A Deep Learning Approach For Linear Complementarity Problems
No ratings yet
A Deep Learning Approach For Linear Complementarity Problems
10 pages
4.bezier Curve
No ratings yet
4.bezier Curve
46 pages
Proximal Minimization With D-Functions: Gorithms
No ratings yet
Proximal Minimization With D-Functions: Gorithms
11 pages
Notes ch4 1
No ratings yet
Notes ch4 1
7 pages
Part A: Chennai Mathematical Institute
No ratings yet
Part A: Chennai Mathematical Institute
3 pages
Newton-Raphson Method: Numerical Analysis
No ratings yet
Newton-Raphson Method: Numerical Analysis
14 pages
Online Gradient Descent
No ratings yet
Online Gradient Descent
7 pages
Chapter 3
No ratings yet
Chapter 3
7 pages
Lecture02a Optimization Annotated PDF
No ratings yet
Lecture02a Optimization Annotated PDF
23 pages
Linear Regression
No ratings yet
Linear Regression
6 pages
Opt ch6
No ratings yet
Opt ch6
6 pages
Subgradient Method: Ryan Tibshirani Convex Optimization 10-725
No ratings yet
Subgradient Method: Ryan Tibshirani Convex Optimization 10-725
21 pages
Calculus 1 With Analytic Geometry Module Finals
No ratings yet
Calculus 1 With Analytic Geometry Module Finals
46 pages
Coordinate Descent Algorithms: Stephen J. Wright
No ratings yet
Coordinate Descent Algorithms: Stephen J. Wright
32 pages
02-Subgrad Method Notes
No ratings yet
02-Subgrad Method Notes
27 pages
Gradient Methods For Minimizing Composite Objective Function
No ratings yet
Gradient Methods For Minimizing Composite Objective Function
31 pages
Alt Proj
No ratings yet
Alt Proj
9 pages
Local Search in Smooth Convex Sets: CX Ax B A I A A A A A A O D X Ax B X CX CX O A I J Z O Opt D X X C A B P CX
No ratings yet
Local Search in Smooth Convex Sets: CX Ax B A I A A A A A A O D X Ax B X CX CX O A I J Z O Opt D X X C A B P CX
9 pages
Controle 16
No ratings yet
Controle 16
4 pages
Root
No ratings yet
Root
4 pages
Interior Gradient and Proximal Methods For Convex and Conic Optimization
No ratings yet
Interior Gradient and Proximal Methods For Convex and Conic Optimization
29 pages
18.657: Mathematics of Machine Learning: S R LR LK K
No ratings yet
18.657: Mathematics of Machine Learning: S R LR LK K
9 pages
Chapter 8 Lecture Notes
No ratings yet
Chapter 8 Lecture Notes
4 pages
Hw3sol PDF
No ratings yet
Hw3sol PDF
8 pages
Chapter1 PDF
No ratings yet
Chapter1 PDF
12 pages
Mirror Descent and Nonlinear Projected Subgradient Methods For Convex Optimization
No ratings yet
Mirror Descent and Nonlinear Projected Subgradient Methods For Convex Optimization
9 pages
Traditional: Software Defined Networking
No ratings yet
Traditional: Software Defined Networking
6 pages
Data Science - Convex Optimization and Examples PDF
No ratings yet
Data Science - Convex Optimization and Examples PDF
9 pages
Dirichlet's Theorem On Arithmetic Progressions: Anthony V Arilly
No ratings yet
Dirichlet's Theorem On Arithmetic Progressions: Anthony V Arilly
13 pages
An Efficient Algorithm For Linear Programming (1990)
No ratings yet
An Efficient Algorithm For Linear Programming (1990)
7 pages
HLAA P2 May23
No ratings yet
HLAA P2 May23
15 pages
Maximum and Minimum Values: Click Here For Answers. Click Here For Solutions
No ratings yet
Maximum and Minimum Values: Click Here For Answers. Click Here For Solutions
5 pages
Kalman Decomposition Examples
No ratings yet
Kalman Decomposition Examples
3 pages
Sample Problems Mathematics
No ratings yet
Sample Problems Mathematics
86 pages
Me6603 NOTES
No ratings yet
Me6603 NOTES
35 pages
Trigonometry DPP - Crash Course
No ratings yet
Trigonometry DPP - Crash Course
83 pages
Top 10 PHD Interview Questions
No ratings yet
Top 10 PHD Interview Questions
9 pages
Period 31 - Deflection of Beams
No ratings yet
Period 31 - Deflection of Beams
9 pages
Maths 2
No ratings yet
Maths 2
16 pages
Cryptography: A Review
No ratings yet
Cryptography: A Review
47 pages
Strauss PDEch 1 S 6 P 06
No ratings yet
Strauss PDEch 1 S 6 P 06
2 pages
Implementation and Performance of A SDN Cluster-Controller Based On The OpenDayLight Framework
No ratings yet
Implementation and Performance of A SDN Cluster-Controller Based On The OpenDayLight Framework
93 pages
Distance Metrics
No ratings yet
Distance Metrics
1 page
Lecture 10
No ratings yet
Lecture 10
4 pages
Calculus For Engineers: Chapter 16 - Laplace Transforms - Solutions
No ratings yet
Calculus For Engineers: Chapter 16 - Laplace Transforms - Solutions
23 pages
02 Summation Notation
No ratings yet
02 Summation Notation
6 pages
Attacking SDN Infrastructure
No ratings yet
Attacking SDN Infrastructure
28 pages
The Classic Concordance of Cacographic Chaos: Chris Upward Introduces
No ratings yet
The Classic Concordance of Cacographic Chaos: Chris Upward Introduces
8 pages
Lecture 6
No ratings yet
Lecture 6
3 pages
Atienza Lab3
No ratings yet
Atienza Lab3
18 pages
Unit 6 Test Polynomials Review Guide
No ratings yet
Unit 6 Test Polynomials Review Guide
6 pages
Hierarchy Design
No ratings yet
Hierarchy Design
13 pages
Math8 Q1 Test-2023-2024
No ratings yet
Math8 Q1 Test-2023-2024
7 pages
Containerized AI For Anomaly Detection
No ratings yet
Containerized AI For Anomaly Detection
12 pages
IEOR 160 Partial Lecture Notes 2017
No ratings yet
IEOR 160 Partial Lecture Notes 2017
33 pages
Nonlinear Structural Analysis With ANSYS: Advanced Finite Element Method
No ratings yet
Nonlinear Structural Analysis With ANSYS: Advanced Finite Element Method
9 pages
A Deeper Dive Into The NS1
No ratings yet
A Deeper Dive Into The NS1
5 pages
Smoothing Functions: Paul Seidel
No ratings yet
Smoothing Functions: Paul Seidel
4 pages
History: Computer Science
No ratings yet
History: Computer Science
3 pages
MATH 8 Exam 1Q
No ratings yet
MATH 8 Exam 1Q
3 pages
1103quadparab PDF
No ratings yet
1103quadparab PDF
15 pages
Klein-Gordon Geon : Physical Review Number
No ratings yet
Klein-Gordon Geon : Physical Review Number
12 pages
Maths
No ratings yet
Maths
5 pages
Boussineqs Ecuation
No ratings yet
Boussineqs Ecuation
1 page
Set 8
No ratings yet
Set 8
3 pages