Concave + Convex
Concave + Convex
B1 Optimization
Michaelmas 2013
Convexity
A. Zisserman
Unconstrained optimization
function of one
variable
f (x)
min f (x)
x
local
minimum
global
minimum
convex
Not convex
Convex functions
convex
Not convex
f (x)
local
optimum
2f
x1 x1
2f
= x2.x1
2f
xn x1
2f
2f
x1 x2
2f
x2 x2
. . . x xn
1
2f
. . . x xn
2
..
...
2f
xn x2
2f
xn xn
..
...
Strictly convex
A function f (x) is strictly convex if
f ((1 )x0 + x1) < (1 )f (x0) + f (x1) .
strictly convex
one global optimum
f (x) =
X
i
(x ai)2
{ ai }
X
i
quadratic
L1
C(|x ai|)
truncated quadratic
huber
C()
function
arises in Maximum Likelihood
Estimation for Gaussian noise
convex
C() = 2
C() = min( 2, )
=
2
if || <
otherwise.
L1 cost function
absolute error
C()
C() = ||
C() =
2
if || <
2|| 2 otherwise.
{ ai }
f (x) =
X
i
quadratic
C(|x ai|)
huber
truncated quadratic
zoom
{ ai }
f (x) =
X
i
quadratic
C(|x ai|)
truncated quadratic
huber
Summary
Squared cost function very susceptible to outliers
Truncated quadratic has a stable minimum, but is non-convex and
also has other local minima. Also basin of attraction of global minimum
limited
Huber has stable minimum and is convex
f (x)
min f (x)
x
local
minimum
global
minimum
1.
2.
3.
4.
min f (x)
x
f (x)
UB
LB
Key idea:
Multiple coverings
Key idea is to cover the parameter space with overlapping
regions to deal with local optima, and then take advantage
of efficient continuous optimization for each region.
Example from Matlab Global Optimization toolbox
Pattern Contours with Constraint Boundaries
5
4
3
2
1
0
-1
-2
-3
-4
-3
-2
-1
2
x
Simulated Annealing
f (x)
1
local
minimum
global
minimum
exp(-x)
exp(-x/10)
exp(-x/100)
0.8
0.6
0.4
0.2
0
0
20
40
60
80
100
Simulated annealing
The name and inspiration come from annealing in metallurgy, a technique
involving heating and controlled cooling of a material to increase the size
of its crystals and reduce their defects.
The heat causes the atoms to become unstuck from their initial positions
(a local minimum of the internal energy) and wander randomly through
states of higher energy; the slow cooling gives them more chances of
finding configurations with lower internal energy than the initial one.
Algorithms due to: Kirkpatrick et al. 1982; Metropolis et al.1953.
AT INIT_TEMP
unconditional Acceptance
COST FUNCTION, C
HILL CLIMBING
HILL CLIMBING
AT FINAL_TEMP
NUMBER OF ITERATIONS
There is more
There are many other classes of optimization problem, and also many
efficient optimization algorithms developed for problems with special
structure. Examples include:
Combinatorial and discrete optimization
Dynamic programming
Max-flow/Min-cut graph cuts