Approximating NP-hard Problems: Efficient Algorithms and Their Limits
Approximating NP-hard Problems: Efficient Algorithms and Their Limits
Prasad Raghavendra
University of Washington
Seattle
Combinatorial Optimization
Problems
!
Set Cover Max 3 SAT Steiner Tree
!
d
a rd
r
Vertex P xH
( x1 Cover MultiCut
2 x3 )( x2 x 3 x5 )( x 2 x3 x5 )( x 5 x4 x1 )
N
a
Find an assignment that satisfies the
Max 3 SAT
H
Max Cut
maximum number of clauses. Label Cover
P
Max Di Cut Sparsest Cut
Multiway Cut
Max 2 SAT
N Metric TSP Max 4 SAT
Approximation Algorithms
Can
An we find
algorithm α-approximation
A isaansolution that is for
say
a problem if for every instance I,
half as good as optimum?
A(I) ≥ α ∙ OPT(I)
--Vast Literature--
The Tools
Till 1994,
A majority of approximation algorithms directly or
indirectly relied on Linear Programming.
In 1994,
Semidefinite Programming based
algorithm for Max Cut
[Goemans-Williamson]
[Charikar-Makarychev-Makarychev] p /( 2 )
[Chlamtac-Makarychev-Makarychev] 1 O( log n log p )
[Arora-Khot-Kolla-Steurer-Tulsiani-Vishnoi] 1
1 log
Hardness Results:
No constant factor approximation for unique games. [Feige-Reichman]
UGC HARD NP HARD
Assuming UGC
MAX k-CSP UGC Hardness
Unique Games Results
MAX 3-CSP [Khot-Kindler-Mossel-O’donnell]
MAX 3-AND [Austrin 06]
For MaxCut, Max-2-SAT, [Austrin 07]
MAX 3-MAJ
Unique Games
MAX E2 LIN3
based hardness
[Khot-Odonnell]
[Odonnell-Wu]
MAX 3 DI-CUT = [Samorodnitsky-Trevisan]
approximation
MAX 4-SAT obtained by Semidefinite programming!
MAX DI CUT
MAX 3-SAT
MAX CUT
MAX 2-SAT
MAX Horn SAT
MAX k-CUT
0 1
The Connection
MAX k-CSP
Unique Games UGC Hard
How General a CSP? MAX 3-CSP How Simple an SDP?
Theorem: [Raghavendra 08]
MAX 3-AND
Assuming Unique Games MAX
Conjecture, For takes
3-MAJ
everynear
CSP,linear
Can Specify
Theorem: [Raghavendra08]
“the10%simplest semidefinite
of 3-Clauses programs
MAX E2 LIN3 give the
timebest
in the size of
Aapproximation
generic
70% algorithm that
computable
of Cut constraints is optimal for every
efficiently.” CSP under
UGC! GENERIC
MAX 3 DI-CUT
20% of 2-SAT constraints (at least
the CSP.
MAX as good as all known
4-SAT algorithms)
(techniques from
ALGORITHM
MAX DI CUT [Arora-Kale])
MAX 3-SAT
MAX CUT
MAX 2-SAT
MAX Horn SAT
MAX k-CUT
3-way Cut
B
3-Way Cut:
10 “Separate the 3-terminals
15
A 1
7 while separating the
1 minimum number of edges”
3
C A generalization of the
classic s-t cut problem
B
[Karger-Klein-Stein-Thorup-Young]
A 12/11 factor approximation algorithm
for 3-Way Cut
A C
Graph Labelling Problems
ALGORITHMS
Generalizations of 3-Way Cut [Calinescu-Karloff-Rabani 98]
• k-Way Cut [Chekuri-Khanna-Naor-Zosin]
[Calinescu-Karloff-Rabani 01]
• 0-Extension [Gupta-Tardos]
[Karger-Klein-Stein-Thorup-Young]
• Class of Metric Labelling Problems [Kazarnov 98]
[Kazarnov 99]
[Kleinberg-Tardos]
Theorem: [Manokaran-Naor-Raghavendra-Schwartz]
Assuming Unique Games Conjecture,
The “earthmover linear program” gives the best
approximation for every graph labelling problem.
Ranking Teams?
Maximum
Rank teams Acyclic Subgraph
so that result of
“Given a directed
maximum numbergraph,
of
orderagrees
games the vertices to
with the
maximize the number of
ranking
forward edges.”
Theorem: [Guruswami-Manokaran-Raghavendra]
Assuming Unique Games Conjecture,
The best algorithm’s output is as good as a random ordering.
More generally,
Theorem: [Guruswami-Manokaran-Raghavendra]
Assuming Unique Games Conjecture, For every Ordering CSP,
a simple SDP relaxation gives the best approximation.
The UG Barrier
If UGC is true,
Constraint Satisfaction
Problems
Then Simplest SDPs
give the best
approximation
Graph Labelling Problems UGC possible.
HARD
If UGC is false,
Ordering CSPs
Hopefully, a new
Kernel Clustering Problems
algorithmic
technique will arise.
Grothendieck Problem
Even if UGC is false
1 Quadratic Program
-1 Semidefinite Program
-1
1 10
15
-1 Variables : vx1 , xv2 … xvn
1 7
1
1 1
-1
x|i =vi1|2or= -1
1
-1 -1
3
1
-1
Maximize w
4 (i , j )E
ij (
| x
vii x
v j )| 2
Maximizing
[Raghavendra-Steurer 09]
Adding all valid constraints on at most
Possibility:
2^O((loglogn)1/4 ) variables to the simple SDP does not
disprove the Unique Games Conjecture
Adding a simple constraint on every 5 variables
yields a better approximation for MaxCut,
Grothendieck Problem
So far :
• Unique Games Barrier
• Semidefinite Programming
technique (Maxcut example)
Coming Up :
• Generic Algorithm for CSPs
• Hardness Result for MaxCut.
Generic Algorithm for CSPs
Semidefinite Program for CSPs
( x1 x 2 x3 )( x2 x 3 x5 )( x 2 x3 x5 )( x 5 x4 x1 )
Variables : Constraints :
For each variable Xa For each clause P,
0 ≤μ(P,α) ≤ 1
Vectors {V(a,0) , V(a,1)} Xa = 1 V(a,0) = 0
( P , ) 1
V(a,1) = 1
SDP objective:
maximize
v2
Rounding Scheme
[Raghavendra-Steurer] v1 v3
Map vector V
V → V’ = (V∙G1 , V∙G2 , … V∙G100) v4
v2
STEP 2 : Discretization
v2
v1
•Pick an Є –net for the
100 dimensional sphere
v3
v4
• Move every vertex to the nearest
v5
point in the Є –net Constant dimensions
STEP 3 : Brute Force
•Find a solution to the new FINITE MODEL
Graph on Є –net points
instance by brute force.
HARDNESS RESULT FOR MAXCUT
The Goal
UG Hardness
HARD INSTANCE G Assuming UGC,
Suppose for an instance G, On instances with MaxCut = C,
Dimension Reduction v1 v3
Max Cut SDP:
v1
v2
of the edges)
v3
v4
v5
Project to random 1/ Є2
Constant dimensional hyperplane
dimensional space.
New SDP Value = Old SDP Value + or - Є
Making the Instance Harder
v2
v5
v3
v2
v1 v3 SDP Value = Average Squared
v5
v3
v2 Length of an Edge
v1 v3
v3
v1
v4
Transformations
v5
• Rotation does not change the
v2
v4
v5 SDP value.
v2
v4 v5
same SDP value
v1
v4 v4
Sphere Graph H :
Union of all possible rotations of G.
v5
v3
v2
v1 v3
v5
v3
v2
v1 v3 MaxCut (H) = S
v3
v1
v4
v5 MaxCut (G) ≥ S
v2
v4
v5
Pick a random rotation of G and
v2
v1
v4 v4 Thus,
v2
v1 v3
MaxCut (H) ≤ MaxCut(G)
v5
v4
SDP Value (G) = SDP Value (H)
v2
Hypercube Graph v1 v3
Output (x,y)
100 dimensional hypercube : {-1,1}100
Dichotomy of Cuts
1 1
1
1 A cut gives a function F on the
hypercube
F : {-1,1}100 -> {-1,1}
-1
-1 Dictator Cuts
-1
F(x) = xi
v2 Intuition:
v3
v5
v1 v3
Sphere graph : Uniform on all directions
v4
v2
v4
v1 v 2
v4
If a cut does not respect the axis, then it should
not distinguish between Sphere and Hypercube
graphs.
The Invariance Principle
Central Limit Theorem
H
P : sphere -> Nearly {-1,1}
F:{-1,1} -> {-1,1}
100
is the multilinear extension
is a cut far from every of F
dictator.
By Invariance Principle,
MaxCut value of F on hypercube ≈ Maxcut value of P on
Sphere graph H
Hyper Cube Graph
v2
v1 v3
[Dictatorship Test]
v5 [Bellare-Goldreich-Sudan]
v4
Graph G Completeness
Value of Dictator Cuts
= SDP Value (G)
Soundness
Cuts far from dictators
≤ MaxCut( Sphere Graph)
Hypercube = {-1,1}100 ≤ MaxCut( G)
UG Hardness
UG Hardness
Dictatorship “On instances, with
Test value C, it is NP-hard to
Completeness C
Soundness S [KKMO] output a solution of
value S, assuming UGC”
In our case,
Dichotomy Conjecture
Semidefinite Programming “Every CSP is polynomial
or UG hardness results time solvable or NP-hard”
for problems beyond CSP
Pr[ACCEPT ] = Pr[ACCEPT ] =
Completeness Soundness
A Dictatorship Test for Maxcut
A dictatorship test is a graph
G on the hypercube.
A cut gives a function F on the
hypercube
Completeness
Value of Dictator Cuts
F(x) = xi
Soundness
Hypercube = {-1,1}100 The maximum value
attained by a cut far from
a dictator
Connections
SDP Gap
Instance
SDP = 0.9
OPT = 0.7
[Khot-Vishnoi]
[This Work] For sparsest cut, max cut.
Dictatorship UG
Test Hardness
Completeness = 0.9 0.9 vs 0.7
Soundness = 0.7
[Khot-Kindler-Mossel-O’Donnell]
Total PayOff
Triangle Inequality
2-CSP over {0,..q-1}
Total PayOff
Arbitrary k-ary GCSP
DICTG
Any
Dictatorship Test
CSP Instance
on functions
G
F : {-1,1}n ->{-1,1}
Any RoundF
Function Rounding Scheme
F: {-1,1}n → {-1,1} on CSP Instances G
Maximize
1
3
| v1 v2 |2 | v2 v3 |2 | v3 v1 |2
c = SDP Value
Local Random Variables v1 , v2 , v3 = SDP Vectors
1 1 1 1 2
E A12 [( F (a1 ) F (a2 )) ] E A23 [( F (a2 ) F (a3 )) ] E A31 [( F (a3 ) F (a1 )) ]
2 2
3 4 4 4
A12,A23,A31 = Local Distributions
Completeness
Input Function is a Dictator : F(x) = x1
1 11 1 2 2 1 1 2 2 11 2 2
E A12 [(EFA12(a[(1 )a11F (aa2 ))21 )] ] E AE [( F[((aa221) Fa(31a3)))] ] E A3131[([(Fa(31a3) a11F)(a]1 )) ]
3 43 4 44 23 A23
44
Suppose (a1 ,a2) is sampled from A12 then :
E[a11 a21] = v1∙ v2 E[a112] = |v1|2 E[a212] = |v2|2
E A12 [(a1 a2 ) ] | v1 v2 |
2 2
Generate
b1 = (b11 ,b12 ,… b1R)
b2 = (b21 ,b22 ,… b2R)
b3 = (b31 ,b32 ,… b3R)
with each coordinate (b1t ,b2t ,b3t) according to global distribution B
3 4 4 4
Invariance
Suppose F is far from every dictator then since A12
and B have same first two moments,
F(a1),F(a2) has nearly same distribution as
F(b1),F(b2)
1 1
E A12 [( F (a1 ) F (a2 )) ] EB [( F (b1 ) F (b2 )) 2 ]
2
• 4 4
Substitute
vi* ∙ vj* = (1-ε) (vi ∙ vj) + ε (ui∙ uj)
Semidefinite Linear program over the
Program inner products of vectors
ALGORITHM
MAX 4-SAT
MAX DI CUT
MAX 3-SAT
MAX CUT
MAX 2-SAT
MAX Horn SAT
MAX k-CUT
0 1
MAX k-CSP
Unique Games
MAX 3-CSP
MAX 3-AND
MAX 3-MAJ
MAX E2 LIN3
MAX 3 DI-CUT
MAX 4-SAT
MAX DI CUT
MAX 3-SAT
MAX CUT
MAX 2-SAT
MAX Horn SAT
MAX k-CUT
0 1
MAX k-CSP
Unique Games
MAX 3-CSP
MAX 3-AND
MAX 3-MAJ
MAX E2 LIN3
MAX 3 DI-CUT
MAX 4-SAT
MAX DI CUT
MAX 3-SAT
MAX CUT
MAX 2-SAT
MAX Horn SAT
MAX k-CUT
0 1
Approximability of CSPs
ALGORITHMS Unique Games
[Charikar-Makarychev-Makarychev 06] MAX 3-CSP
[Goemans-Williamson] MAX CUT
[Charikar-Wirth]
[Lewin-Livnat-Zwick] MAX 2-SAT
[Charikar-Makarychev-Makarychev 07] MAX k-CSP
[Hast] MAX 3-SAT
[Charikar-Makarychev-Makarychev 07] MAX DI CUT
[Frieze-Jerrum]
MAX 4-SAT
[Karloff-Zwick]
[Zwick SODA 98] MAX k-CUT
[Zwick STOC 98] MAX Horn SAT
[Zwick 99]
[Halperin-Zwick 01] MAX 3 DI-CUT
[Goemans-Williamson 01] MAX E2 LIN3
[Goemans 01] MAX 3-AND
[Feige-Goemans] MAX 3-MAJ
[Matuura-Matsui]
0 1