0% found this document useful (0 votes)

29 views37 pages

Concave + Convex

The document discusses optimization techniques for convex and non-convex functions. It introduces convexity and describes how convex problems have globally optimal solutions. Non-convex problems are harder to optimize and techniques like grid search, branch and bound, multiple coverings, and simulated annealing are discussed to optimize non-convex functions.

Uploaded by

Kannan Govindan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views37 pages

Concave + Convex

Uploaded by

Kannan Govindan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

Lecture 4

B1 Optimization

Michaelmas 2013

Convexity

Robust cost functions

Optimizing non-convex functions
grid search
branch and bound
multiple coverings
simulated annealing

A. Zisserman

The Optimization Tree

Unconstrained optimization
function of one
variable

f (x)

min f (x)
x

local
minimum

global
minimum

down-hill search (gradient descent) algorithms can find local minima

which of the minima is found depends on the starting point
such minima often occur in real applications

How can you tell if an optimization has a single optimum?

The answer is: see if the optimization problem is convex.

If it is, then a local optimum is the global optimum.

First, we need to introduce

Convex Sets, and
Convex Functions

Note sketch introduction only

convex

Not convex

Convex functions

Convex function examples

convex

Not convex

A non-negative sum of convex functions is convex

Convex Optimization Problem

Minimize:
a convex function
over a convex set
Then locally optimal points are globally optimal

Also, such problems can be solved both in theory and practice

Why do we need the domain to be convex?

f (x)

domain (not a convex set)

local
optimum

cant get from here to

here by downhill search

Examples of convex optimization problems

1. Linear programming
2. Least squares
f (x) = (Ax b)2 , for any A
3. Quadratic functions
f (x) = x>Px+q>x+r, provided that P is positive definite

Many more useful examples, see Boyd & Vandenberghe

Second order condition

The Hessian of a function f (x1, x2, . . . , xn) is the matrix of partial derivatives

x1 x1

= x2.x1

2f
xn x1

x1 x2
2f
x2 x2

. . . x xn
1
2f
. . . x xn
2
..
...

2f
xn x2

2f
xn xn

...

Diagonalize the Hessian by an orthogonal change of coordinates.

Diagonals are the eigenvalues.
If the eigenvalues are all positive, then the Hessian is positive definite, and
f is convex.

Strictly convex
A function f (x) is strictly convex if
f ((1 )x0 + x1) < (1 )f (x0) + f (x1) .

strictly convex
one global optimum

Not strictly convex

multiple local optima
(but all are global)

Robust Cost Functions

In formulating an optimization problem there is often some
room for design and choice
The cost function can be chosen to be:
convex
robust to noise (outliers) in the data/measurements

Consider minimizing the cost function

f (x) =

X
i

(x ai)2

{ ai }

the data { ai } may be thought of as repeated measurements of a fixed

value (at 0), subject to Gaussian noise and some outliers
it has 10% of outliers biased towards the right of the true value
the minimum of f(x) does not correspond to the true value

Examine the behaviour of various cost functions f (x) =

X
i

quadratic

C(|x ai|)

truncated quadratic

huber

Quadratic cost function

squared error the usual default cost

C()

function
arises in Maximum Likelihood
Estimation for Gaussian noise
convex

C() = 2

Truncated Quadratic cost function

C()

for inliers behaves as a quadratic

truncated so that outliers only incur a
fixed cost
non-convex

C() = min( 2, )
=

2
if || <

otherwise.

L1 cost function
absolute error

C()

called total variation

convex
non-differentiable at origin
finds the median of { ai }

C() = ||

Huber cost function

C()

hybrid between quadratic and L1

continuous first derivative
for small values is quadratic
for larger values becomes linear
thus has the outlier stability of L1
convex

C() =

2
if || <
2|| 2 otherwise.

{ ai }

Example 1: measurements with outliers

f (x) =

X
i

quadratic

C(|x ai|)

huber

truncated quadratic

zoom

{ ai }

Example 2: bimodal measurements

70% in principal mode
30% in outlier mode

f (x) =

X
i

quadratic

C(|x ai|)

truncated quadratic

huber

Summary
Squared cost function very susceptible to outliers
Truncated quadratic has a stable minimum, but is non-convex and
also has other local minima. Also basin of attraction of global minimum
limited
Huber has stable minimum and is convex

Optimizing non-convex functions

function of one
variable

f (x)

min f (x)
x

Sketch four methods:

local
minimum

global
minimum

grid search: uniform grid space covering

branch and bound

multiple coverings: Newton like methods within regions

simulated annealing: stochastic optimization

Branch and bound

min f (x)
x

f (x)

Key idea:

Split region into sub-regions and compute bounds

Consider two regions A and C

If lower bound of A is greater than upper bound of C then A can be discarded

divide (branch) regions and repeat

Multiple coverings
Key idea is to cover the parameter space with overlapping
regions to deal with local optima, and then take advantage
of efficient continuous optimization for each region.
Example from Matlab Global Optimization toolbox
Pattern Contours with Constraint Boundaries
5
4
3
2

1
0
-1
-2
-3
-4
-3

-2

-1

2
x

Multiple coverings ctd

Use multiple starting points

Continuous optimization method

for each

Record optimum for each starting

point

Sort values to find global optimum

Simulated Annealing

f (x)
1

local
minimum

global
minimum

The algorithm has a mechanism to jump out of local minima

It is a stochastic search method, i.e. it uses randomness in the search

Simulated annealing algorithm

At each iteration propose a move in the parameter space

If the move decreases the cost, then accept it

If the move increases the cost by E, then

accept it with a probability exp(-E/T),

Otherwise, dont move

Note probability depends on temperature T

Decrease the temperature according to a schedule so that at the start

cost increases are likely to accepted, and at the end they are not

Boltzmann distribution and the cooling schedule

start with T high, then exp(-E/T)

is approx. 1, and all moves are
accepted
many cooling schedules are
possible, but the simplest is

Tk+1 = Tk , 0 < < 1

Boltzmann distribution exp(-E/T)

exp(-x)
exp(-x/10)
exp(-x/100)

0.8

0.6

0.4

where k is the iteration number

The algorithm can be very slow
to converge

0.2

0
0

100

Simulated annealing
The name and inspiration come from annealing in metallurgy, a technique
involving heating and controlled cooling of a material to increase the size
of its crystals and reduce their defects.
The heat causes the atoms to become unstuck from their initial positions
(a local minimum of the internal energy) and wander randomly through
states of higher energy; the slow cooling gives them more chances of
finding configurations with lower internal energy than the initial one.
Algorithms due to: Kirkpatrick et al. 1982; Metropolis et al.1953.

Example: Convergence of simulated annealing

AT INIT_TEMP

unconditional Acceptance

COST FUNCTION, C

HILL CLIMBING

move accepted with

probability exp(-E/T)
HILL CLIMBING

HILL CLIMBING

AT FINAL_TEMP

NUMBER OF ITERATIONS

Steepest descent on a graph

Slide from Adam Kalai and Santosh Vempala

Random Search on a graph

Simulated Annealing on a graph

Phase 1: Hot (Random)

Phase 2: Warm (Bias down)
Phase 3: Cold (Descent)
(Descend)

There is more
There are many other classes of optimization problem, and also many
efficient optimization algorithms developed for problems with special
structure. Examples include:
Combinatorial and discrete optimization
Dynamic programming
Max-flow/Min-cut graph cuts

See the links on the web page

http://www.robots.ox.ac.uk/~az/lectures/b1/index.html
and come to the C Optimization lectures next year

Unconstrained Numerical Optimization An Introduction For Econometricians
100% (1)
Unconstrained Numerical Optimization An Introduction For Econometricians
32 pages
Court Order
100% (1)
Court Order
17 pages
LGT2
No ratings yet
LGT2
32 pages
OPTIMIZATION - Lecture3 - RSB
No ratings yet
OPTIMIZATION - Lecture3 - RSB
31 pages
Optim
No ratings yet
Optim
70 pages
Process Optimization
No ratings yet
Process Optimization
70 pages
CS-6777 Liu Abs
No ratings yet
CS-6777 Liu Abs
103 pages
Konveksna Optimizacija
No ratings yet
Konveksna Optimizacija
179 pages
NEOM UNIT-1 Sept-23
No ratings yet
NEOM UNIT-1 Sept-23
34 pages
Convexity II: Optimization Basics: Ryan Tibshirani Convex Optimization 10-725
No ratings yet
Convexity II: Optimization Basics: Ryan Tibshirani Convex Optimization 10-725
28 pages
Week02 Convex Optimization
No ratings yet
Week02 Convex Optimization
48 pages
Week 11
No ratings yet
Week 11
42 pages
B For I 1,, M: N J J J
No ratings yet
B For I 1,, M: N J J J
19 pages
Chap04 ConvexOptimizationBasics
No ratings yet
Chap04 ConvexOptimizationBasics
29 pages
Chapter 2 Basis Math
No ratings yet
Chapter 2 Basis Math
14 pages
Nonlinear Program
No ratings yet
Nonlinear Program
13 pages
Convex Functions: September 2, 2008
No ratings yet
Convex Functions: September 2, 2008
21 pages
Lect 3 Concave and Convex
No ratings yet
Lect 3 Concave and Convex
18 pages
Readings Readings: E I 172 Economics 172 Introduction To Operation Research (Part 2)
No ratings yet
Readings Readings: E I 172 Economics 172 Introduction To Operation Research (Part 2)
13 pages
5 Optimization Techniques
No ratings yet
5 Optimization Techniques
40 pages
CSE488 Lab6 Optimization
No ratings yet
CSE488 Lab6 Optimization
20 pages
斯坦福大学机器学习数学基础 41-48
No ratings yet
斯坦福大学机器学习数学基础 41-48
8 pages
Nonlinear Program
No ratings yet
Nonlinear Program
12 pages
Convex Optimisation
No ratings yet
Convex Optimisation
17 pages
Matinf 2360 Part 3
No ratings yet
Matinf 2360 Part 3
106 pages
CH 2
No ratings yet
CH 2
31 pages
An Introduction To Convexity: Geir Dahl November 2010
No ratings yet
An Introduction To Convexity: Geir Dahl November 2010
126 pages
1
No ratings yet
1
31 pages
Modeling, Simulation and Optimisation For Chemical Engineering
No ratings yet
Modeling, Simulation and Optimisation For Chemical Engineering
30 pages
Convex Fns Scribed
No ratings yet
Convex Fns Scribed
6 pages
5 Optimization: F Emp
No ratings yet
5 Optimization: F Emp
52 pages
3 - Convexity and Optimization
No ratings yet
3 - Convexity and Optimization
17 pages
09 - nlp1 - Online
No ratings yet
09 - nlp1 - Online
23 pages
OQM Lecture Note - Part 8 Unconstrained Nonlinear Optimisation
No ratings yet
OQM Lecture Note - Part 8 Unconstrained Nonlinear Optimisation
23 pages
Lecture 3
No ratings yet
Lecture 3
5 pages
Univariate Calculus and Multivariate Calculus
No ratings yet
Univariate Calculus and Multivariate Calculus
141 pages
09 Convex
No ratings yet
09 Convex
48 pages
ConvexSpring25 Week3
No ratings yet
ConvexSpring25 Week3
30 pages
Machine Learning Notes2
No ratings yet
Machine Learning Notes2
34 pages
Basic Concepts of Optimization: The The The Set
No ratings yet
Basic Concepts of Optimization: The The The Set
10 pages
ML Module 5 Full Notes
No ratings yet
ML Module 5 Full Notes
23 pages
06 Optimization
No ratings yet
06 Optimization
42 pages
NLP Slides
No ratings yet
NLP Slides
201 pages
Upload Library: Browse
No ratings yet
Upload Library: Browse
44 pages
Optimization Techniques
No ratings yet
Optimization Techniques
96 pages
M2L2 LN
No ratings yet
M2L2 LN
8 pages
Optimization Using Calculus: Convexity and Concavity of Functions of One and Two Variables
No ratings yet
Optimization Using Calculus: Convexity and Concavity of Functions of One and Two Variables
22 pages
Mclas Tema1 v2
No ratings yet
Mclas Tema1 v2
74 pages
Opte
No ratings yet
Opte
32 pages
Lect5 Removed
No ratings yet
Lect5 Removed
35 pages
OpTimIzation Overview
No ratings yet
OpTimIzation Overview
47 pages
Math Chapter 7
No ratings yet
Math Chapter 7
4 pages
Tutorial 3 Curvature Concepts
No ratings yet
Tutorial 3 Curvature Concepts
8 pages
pdfHXu ch1
No ratings yet
pdfHXu ch1
30 pages
Optimization
No ratings yet
Optimization
30 pages
Week 10 Notes MLF
No ratings yet
Week 10 Notes MLF
20 pages
Introduction To Optimization
No ratings yet
Introduction To Optimization
18 pages
M2L2 LN
No ratings yet
M2L2 LN
8 pages
Optimization in Function Spaces
From Everand
Optimization in Function Spaces
Amol Sasane
No ratings yet
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Algebraic Geometry
From Everand
Algebraic Geometry
Solomon Lefschetz
No ratings yet
Carolina Reaper
No ratings yet
Carolina Reaper
19 pages
2018 Oakland Linuxmalware
No ratings yet
2018 Oakland Linuxmalware
15 pages
Kra 4 Community Linkages and Professional Engagement & Personal Growth and
No ratings yet
Kra 4 Community Linkages and Professional Engagement & Personal Growth and
7 pages
Exhibit B - Security Policy
No ratings yet
Exhibit B - Security Policy
4 pages
TOCFL 基礎級 A2
No ratings yet
TOCFL 基礎級 A2
11 pages
Chemical Engineering in Practice Second Edition - Sampler
100% (1)
Chemical Engineering in Practice Second Edition - Sampler
99 pages
Notes Summer 2024 - Finance and Economics Summary
No ratings yet
Notes Summer 2024 - Finance and Economics Summary
3 pages
FTS Assessment 1 Student Questioning Written Knowledge Assessment AURETR112 V1
No ratings yet
FTS Assessment 1 Student Questioning Written Knowledge Assessment AURETR112 V1
23 pages
Syllabus 2021 Foundation Engineering
No ratings yet
Syllabus 2021 Foundation Engineering
4 pages
Complete Guide To Service Learning 2
No ratings yet
Complete Guide To Service Learning 2
110 pages
Dilution Systems For Aerosols Series DIL, DDS and HDS: Special Advantages
No ratings yet
Dilution Systems For Aerosols Series DIL, DDS and HDS: Special Advantages
4 pages
St. Cyril of Alexandria Term Paper For Patrology
100% (3)
St. Cyril of Alexandria Term Paper For Patrology
16 pages
Airbnb Seasonality and Revenue Data Trends For Grand Prairie - AirDNA MarketMinder
No ratings yet
Airbnb Seasonality and Revenue Data Trends For Grand Prairie - AirDNA MarketMinder
2 pages
Occult Herbmaster - Theras
No ratings yet
Occult Herbmaster - Theras
1 page
H 0010-20-43061 2 10 0 Pds Protocol Programmer S Guide
No ratings yet
H 0010-20-43061 2 10 0 Pds Protocol Programmer S Guide
172 pages
Fig. Qty Description Code Fig. Qty Description Code: Carburettor 40 DCOE Part No. 19550.174 Parts
No ratings yet
Fig. Qty Description Code Fig. Qty Description Code: Carburettor 40 DCOE Part No. 19550.174 Parts
2 pages
Basic Japanese Language Chapter 1
No ratings yet
Basic Japanese Language Chapter 1
20 pages
POEM
No ratings yet
POEM
7 pages
Obs Gynae Dams Notes 2018 PDF
No ratings yet
Obs Gynae Dams Notes 2018 PDF
398 pages
Defects
No ratings yet
Defects
51 pages
To Issue Swing Door For Entrance To Ac Area (With Overhead Concealed Double Acting Door Closer) Mi006232
No ratings yet
To Issue Swing Door For Entrance To Ac Area (With Overhead Concealed Double Acting Door Closer) Mi006232
2 pages
Case Ih Tractor Ignition Electrical Parts
100% (2)
Case Ih Tractor Ignition Electrical Parts
16 pages
Important: Service Data Sheet
No ratings yet
Important: Service Data Sheet
4 pages
Porsche Case Study
No ratings yet
Porsche Case Study
4 pages
Strat Sim
No ratings yet
Strat Sim
289 pages
PHP Yii JSP Servlet - 2 - Md. Shibly Forkani
No ratings yet
PHP Yii JSP Servlet - 2 - Md. Shibly Forkani
4 pages
Find YourSelf - Khyber EyeCon
No ratings yet
Find YourSelf - Khyber EyeCon
52 pages
Topics in Finite and Discrete Mathematics - Sheldon M. Ross
100% (1)
Topics in Finite and Discrete Mathematics - Sheldon M. Ross
279 pages
My MVP in Volleyball: Individual Awards: Collegiate Awards
No ratings yet
My MVP in Volleyball: Individual Awards: Collegiate Awards
1 page