Exponential Time Approximations for
Coloring 3-Colorable Graphs

Venkatesan Guruswami Simons Institute for the Theory of Computing and the Dept. of EECS, Univ. of California, Berkeley, Berkeley, CA 94720. [email protected]. Research supported in part by a Simons Investigator award and NSF grant CCF-2211972.    Rhea Jain Dept. of Computer Science, Univ. of Illinois, Urbana-Champaign, Urbana, IL 61801. [email protected].
Abstract

The problem of efficiently coloring 3333-colorable graphs with few colors has received much attention on both the algorithmic and inapproximability fronts. We consider exponential time approximations, in which given a parameter r𝑟ritalic_r, we aim to develop an r𝑟ritalic_r-approximation algorithm with the best possible runtime, providing a tradeoff between runtime and approximation ratio. In this vein, an algorithm to O(nϵ)𝑂superscript𝑛italic-ϵO(n^{\epsilon})italic_O ( italic_n start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT )-color a 3-colorable graphs in time 2Θ(n12ϵlog(n))superscript2Θsuperscript𝑛12italic-ϵ𝑛2^{\Theta(n^{1-2\epsilon}\log(n))}2 start_POSTSUPERSCRIPT roman_Θ ( italic_n start_POSTSUPERSCRIPT 1 - 2 italic_ϵ end_POSTSUPERSCRIPT roman_log ( italic_n ) ) end_POSTSUPERSCRIPT is given in (Atserias and Dalmau, SODA 2022.)

We build on tools developed in (Bansal et al., Algorithmic, 2019) to obtain an algorithm to color 3333-colorable graphs with O(r)𝑂𝑟O(r)italic_O ( italic_r ) colors in exp(O~(nlog11/2rr3))~𝑂𝑛superscript112𝑟superscript𝑟3\exp\left(\tilde{O}\left(\frac{n\log^{11/2}r}{r^{3}}\right)\right)roman_exp ( over~ start_ARG italic_O end_ARG ( divide start_ARG italic_n roman_log start_POSTSUPERSCRIPT 11 / 2 end_POSTSUPERSCRIPT italic_r end_ARG start_ARG italic_r start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG ) ) time, asymptotically improving upon the bound given by Atserias and Dalmau.

1 Introduction

Graph Coloring is a classic and fundamental problem in theoretical computer science with applications in scheduling, register allocation, and more. A graph is said to be r𝑟ritalic_r-colorable if there is an assignment of r𝑟ritalic_r colors to vertices such that no two adjacent vertices share the same color. The question of whether a given graph can be colored using r𝑟ritalic_r colors for r3𝑟3r\geq 3italic_r ≥ 3 was one of Karp’s NP-complete problems [19]. For any graph G𝐺Gitalic_G, the minimum r𝑟ritalic_r such that G𝐺Gitalic_G is r𝑟ritalic_r-colorable is called the chromatic number of G𝐺Gitalic_G; it is known that the chromatic number is hard to approximate to a factor n1o(1)superscript𝑛1𝑜1n^{1-o(1)}italic_n start_POSTSUPERSCRIPT 1 - italic_o ( 1 ) end_POSTSUPERSCRIPT [15, 23].

It is natural to ask if one can obtain a valid coloring of a graph with few colors if its chromatic number is known. The first polynomial time algorithm for this problem was given by Wigderson [25], who obtained an O(n11r1)𝑂superscript𝑛11𝑟1O(n^{1-\frac{1}{r-1}})italic_O ( italic_n start_POSTSUPERSCRIPT 1 - divide start_ARG 1 end_ARG start_ARG italic_r - 1 end_ARG end_POSTSUPERSCRIPT )-coloring for r𝑟ritalic_r-colorable graphs. In this paper we focus on the special case where G𝐺Gitalic_G is 3333-colorable. There are several known polynomial time algorithms for this setting. Blum obtained a polynomial improvement over [25] to O~(n3/8)~𝑂superscript𝑛38\tilde{O}(n^{3/8})over~ start_ARG italic_O end_ARG ( italic_n start_POSTSUPERSCRIPT 3 / 8 end_POSTSUPERSCRIPT ) colors [7] when the input graph is 3333-colorable. This was improved to O(n1/4)𝑂superscript𝑛14O(n^{1/4})italic_O ( italic_n start_POSTSUPERSCRIPT 1 / 4 end_POSTSUPERSCRIPT ) using semi-definite programming methods by Karger, Motwani, and Sudan [18]. A long line of subsequent work culminated in the the development of an algorithm using O(n0.19996)𝑂superscript𝑛0.19996O(n^{0.19996})italic_O ( italic_n start_POSTSUPERSCRIPT 0.19996 end_POSTSUPERSCRIPT ) colors [20], very recently improved to O(n0.19747)𝑂superscript𝑛0.19747O(n^{0.19747})italic_O ( italic_n start_POSTSUPERSCRIPT 0.19747 end_POSTSUPERSCRIPT ) [21]. On the hardness of approximation side, it is known to be NP-hard to color a 3-colorable graph using 4 colors [22, 16, 10], and until recently, no better lower bound was known. Barto et al. recently showed that it is NP-hard to color an r𝑟ritalic_r-colorable graph using 2r12𝑟12r-12 italic_r - 1 colors; in particular, it is NP-hard to 5-color a 3-colorable graph [5]. Furthermore, there is some evidence based on variants of the Unique Games Conjecture that coloring an r𝑟ritalic_r-colorable graph with O(1)r𝑂1𝑟O(1)\cdot ritalic_O ( 1 ) ⋅ italic_r colors is hard for any r3𝑟3r\geq 3italic_r ≥ 3 [13, 17].

The hardness of approximating graph coloring in polynomial time has motivated the design of algorithms that provide a more fine-grained trade-off between running time and approximation factors; see [8, 9, 12] for some such work in graph coloring, as well as the closely related Independent Set problem. As a simple example, one can obtain an r𝑟ritalic_r-coloring of a 3-colorable graph G𝐺Gitalic_G in time O(exp(O(n/r)))superscript𝑂𝑂𝑛𝑟O^{*}(\exp(O(n/r)))italic_O start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( roman_exp ( italic_O ( italic_n / italic_r ) ) ) as follows: partition G𝐺Gitalic_G into r/3𝑟3r/3italic_r / 3 blocks of size 3n/r3𝑛𝑟3n/r3 italic_n / italic_r each and use an exact 3-coloring exponential-time algorithm on each block. In general, this idea is surprisingly close to the best we can do: any r𝑟ritalic_r-approximation to Graph Coloring requires at least exp(n1o(1)/r1+o(1))superscript𝑛1𝑜1superscript𝑟1𝑜1\exp(n^{1-o(1)}/r^{1+o(1)})roman_exp ( italic_n start_POSTSUPERSCRIPT 1 - italic_o ( 1 ) end_POSTSUPERSCRIPT / italic_r start_POSTSUPERSCRIPT 1 + italic_o ( 1 ) end_POSTSUPERSCRIPT ) time, assuming ETH [24].

Recently, Bansal et. al. [3] obtained an r𝑟ritalic_r-approximation for the Graph Coloring problem that runs in O(exp(O~(n/(rlogr)+rlog2r)))superscript𝑂~𝑂𝑛𝑟𝑟𝑟superscript2𝑟O^{*}\left(\exp\left(\tilde{O}\left(n/(r\log r)+r\log^{2}r\right)\right)\right)italic_O start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( roman_exp ( over~ start_ARG italic_O end_ARG ( italic_n / ( italic_r roman_log italic_r ) + italic_r roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_r ) ) ) time. As a subroutine in this algorithm, they use and develop an r𝑟ritalic_r-approximation for the Independent Set problem that runs in
O(exp(O~(n/(rlog2r)+rlog2r)))superscript𝑂~𝑂𝑛𝑟superscript2𝑟𝑟superscript2𝑟O^{*}\left(\exp\left(\tilde{O}\left(n/(r\log^{2}r)+r\log^{2}r\right)\right)\right)italic_O start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( roman_exp ( over~ start_ARG italic_O end_ARG ( italic_n / ( italic_r roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_r ) + italic_r roman_log start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_r ) ) ) time. In this work we improve upon the approximation algorithms given by Bansal et. al. [3] for the setting in which the input graph G𝐺Gitalic_G is known to be 3-colorable.

1.1 Promise Constraint Satisfaction Problems and Width-based Algorithms

Graph colorability is an example of a larger class of problems known as constraint satisfaction problems (CSPs). The input to a CSP is a set of variables X𝑋Xitalic_X over a domain D𝐷Ditalic_D, along with a set of constraints. Each constraint specifies, for some subset of variables, a relation that it must takes values in. The goal is to find a valid assignment of values from the domain to each variable that satisfies all constraints. Equivalently, one can model CSPs as a problem of finding homomorphism between relational structures [14]. That is, we can think of the input as two structures: 𝕏=(X,S1,,Sn)𝕏𝑋subscript𝑆1subscript𝑆𝑛\mathbb{X}=(X,S_{1},\dots,S_{n})blackboard_X = ( italic_X , italic_S start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ), where SiXrisubscript𝑆𝑖superscript𝑋subscript𝑟𝑖S_{i}\subseteq X^{r_{i}}italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⊆ italic_X start_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT, 𝔻=(D,R1,,Rn)𝔻𝐷subscript𝑅1subscript𝑅𝑛\mathbb{D}=(D,R_{1},\dots,R_{n})blackboard_D = ( italic_D , italic_R start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_R start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ), where RiDrisubscript𝑅𝑖superscript𝐷subscript𝑟𝑖R_{i}\subseteq D^{r_{i}}italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⊆ italic_D start_POSTSUPERSCRIPT italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUPERSCRIPT. The goal is to find a function f:XD:𝑓𝑋𝐷f:X\to Ditalic_f : italic_X → italic_D such that f((x1),,f(xri))Ri𝑓subscript𝑥1𝑓subscript𝑥subscript𝑟𝑖subscript𝑅𝑖f((x_{1}),\dots,f(x_{r_{i}}))\in R_{i}italic_f ( ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , … , italic_f ( italic_x start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ) ∈ italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for all (x1,,xri)Sisubscript𝑥1subscript𝑥subscript𝑟𝑖subscript𝑆𝑖(x_{1},\dots,x_{r_{i}})\in S_{i}( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_x start_POSTSUBSCRIPT italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ∈ italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. In the case of r𝑟ritalic_r-graph colorability, the variables are the vertices of the graph, the sets Sisubscript𝑆𝑖S_{i}italic_S start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT are the edges, the domain is the set of colors {1,,r}1𝑟\{1,\dots,r\}{ 1 , … , italic_r }, and the constraints Risubscript𝑅𝑖R_{i}italic_R start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT are all tuples of two different colors.

Brakensiek and Guruswami [11], building on work in [2], recently proposed a generalization of this class of problems, known as promise constraint satisfaction problems (PCSPs); these are CSPs in which we are guaranteed that the input satisfies an even stronger set of constraints. Formally, we are given three relational structures 𝕏,𝔸,𝔹𝕏𝔸𝔹\mathbb{X},\mathbb{A},\mathbb{B}blackboard_X , blackboard_A , blackboard_B where there exists a homomorphism from 𝔹𝔹\mathbb{B}blackboard_B to 𝔸𝔸\mathbb{A}blackboard_A. Given the promise that there exists some homomorphism from 𝕏𝕏\mathbb{X}blackboard_X to 𝔹𝔹\mathbb{B}blackboard_B, the goal is to find one from 𝕏𝕏\mathbb{X}blackboard_X to 𝔸𝔸\mathbb{A}blackboard_A. It is easy to see that the problem of r𝑟ritalic_r-coloring a 3-colorable graph is an example of a promise CSP, with 𝔸𝔸\mathbb{A}blackboard_A corresponding to r𝑟ritalic_r-colorability and 𝔹𝔹\mathbb{B}blackboard_B to 3-colorability.

Fixed-template CSPs are CSPs in which the structure 𝔸𝔸\mathbb{A}blackboard_A is fixed and the input specifies 𝕏𝕏\mathbb{X}blackboard_X. One of the few known techniques for solving fixed-template CSPs in polynomial time is local consistency checking. This technique considers k𝑘kitalic_k-strategies, which are sets of partial solutions on subsets of variables of size at most k𝑘kitalic_k. These sets must be closed under restrictions and all partial solutions of subsets of size less than k𝑘kitalic_k must be extendable to all supersets up to size k𝑘kitalic_k. Clearly, if there exists a homomorphism from 𝕏𝕏\mathbb{X}blackboard_X to 𝔸𝔸\mathbb{A}blackboard_A, then there exists a k𝑘kitalic_k-strategy for all k|X|𝑘𝑋k\leq|X|italic_k ≤ | italic_X |. A CSP is said to have width k𝑘kitalic_k if the converse holds: a non-empty k𝑘kitalic_k-strategy implies a homomorphism from 𝕏𝕏\mathbb{X}blackboard_X to 𝔸𝔸\mathbb{A}blackboard_A. For any fixed k𝑘kitalic_k, this gives a simple polynomial time algorithm to solve the CSP: find all partial solutions on subsets of size at most k𝑘kitalic_k and remove the ones that don’t satisfy the closure and extension properties until we either have a non-empty k𝑘kitalic_k-strategy or an empty set.

One can define a similar notion in the context of PCSPs [5]. A PCSP has width k𝑘kitalic_k if a non-empty k𝑘kitalic_k-strategy on (𝕏,𝔹)𝕏𝔹(\mathbb{X},\mathbb{B})( blackboard_X , blackboard_B ) implies a homomorphism from 𝕏𝕏\mathbb{X}blackboard_X to 𝔸𝔸\mathbb{A}blackboard_A. Recently, Atserias and Dalmau [1] showed that given an input graph G𝐺Gitalic_G and parameter ϵ(0,1/2)italic-ϵ012\epsilon\in(0,1/2)italic_ϵ ∈ ( 0 , 1 / 2 ), the problem of distinguishing whether G𝐺Gitalic_G is 3-colorable from the case where G𝐺Gitalic_G is not O(nϵ)𝑂superscript𝑛italic-ϵO(n^{\epsilon})italic_O ( italic_n start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT )-colorable has width between n12ϵsuperscript𝑛12italic-ϵn^{1-2\epsilon}italic_n start_POSTSUPERSCRIPT 1 - 2 italic_ϵ end_POSTSUPERSCRIPT and n13ϵsuperscript𝑛13italic-ϵn^{1-3\epsilon}italic_n start_POSTSUPERSCRIPT 1 - 3 italic_ϵ end_POSTSUPERSCRIPT. This gives an algorithm to color 3-colorable graphs with O(nϵ)𝑂superscript𝑛italic-ϵO(n^{\epsilon})italic_O ( italic_n start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ) colors in time 2Θ(n12ϵlogn)superscript2Θsuperscript𝑛12italic-ϵ𝑛2^{\Theta(n^{1-2\epsilon}\log n)}2 start_POSTSUPERSCRIPT roman_Θ ( italic_n start_POSTSUPERSCRIPT 1 - 2 italic_ϵ end_POSTSUPERSCRIPT roman_log italic_n ) end_POSTSUPERSCRIPT. Improving this runtime is posed as an open question in [1].

1.2 Definitions and Notation

For a given graph G𝐺Gitalic_G, we let V(G)𝑉𝐺V(G)italic_V ( italic_G ) denote its set of vertices, E(G)𝐸𝐺E(G)italic_E ( italic_G ) denote the set of edges, and n:=|V(G)|assign𝑛𝑉𝐺n:=|V(G)|italic_n := | italic_V ( italic_G ) |. For any vV(G)𝑣𝑉𝐺v\in V(G)italic_v ∈ italic_V ( italic_G ), we call the neighborhood of v𝑣vitalic_v (denoted NG(v)subscript𝑁𝐺𝑣N_{G}(v)italic_N start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_v )) the set of all uV(G)𝑢𝑉𝐺u\in V(G)italic_u ∈ italic_V ( italic_G ) such that {u,v}E(G)𝑢𝑣𝐸𝐺\{u,v\}\in E(G){ italic_u , italic_v } ∈ italic_E ( italic_G ). We let dG(v)=|NG(v)|subscript𝑑𝐺𝑣subscript𝑁𝐺𝑣d_{G}(v)=|N_{G}(v)|italic_d start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_v ) = | italic_N start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT ( italic_v ) | denote the degree of v𝑣vitalic_v. We write Osuperscript𝑂O^{*}italic_O start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT to suppress polynomial factors in n𝑛nitalic_n, and O~~𝑂\tilde{O}over~ start_ARG italic_O end_ARG to suppress polyloglog(r)polyloglog𝑟\textnormal{polyloglog}(r)polyloglog ( italic_r ) factors (where r𝑟ritalic_r is the approximation ratio). The analogous definitions hold for Ω~~Ω\tilde{\Omega}over~ start_ARG roman_Ω end_ARG.

Maximum Independent Set: We say a set SV(G)𝑆𝑉𝐺S\subseteq V(G)italic_S ⊆ italic_V ( italic_G ) is independent if for all u,vS𝑢𝑣𝑆u,v\in Sitalic_u , italic_v ∈ italic_S, {u,v}E𝑢𝑣𝐸\{u,v\}\notin E{ italic_u , italic_v } ∉ italic_E. We let α(G)𝛼𝐺\alpha(G)italic_α ( italic_G ) denote the size of the maximum cardinality independent set of G𝐺Gitalic_G.

Graph Coloring: We say that a function c:V(G)[r]:𝑐𝑉𝐺delimited-[]𝑟c:V(G)\to[r]italic_c : italic_V ( italic_G ) → [ italic_r ] is a r𝑟ritalic_r-coloring if c(u)c(v){u,v}E(G)𝑐𝑢𝑐𝑣for-all𝑢𝑣𝐸𝐺c(u)\neq c(v)~{}\forall\{u,v\}\in E(G)italic_c ( italic_u ) ≠ italic_c ( italic_v ) ∀ { italic_u , italic_v } ∈ italic_E ( italic_G ). We say G𝐺Gitalic_G is r𝑟ritalic_r-colorable if a valid r𝑟ritalic_r-coloring of G𝐺Gitalic_G exists. We let χ(G)𝜒𝐺\chi(G)italic_χ ( italic_G ) denote the chromatic number of G𝐺Gitalic_G, i.e. the minimum integer r𝑟ritalic_r such that G𝐺Gitalic_G is r𝑟ritalic_r-colorable.

1.3 Our Result and Techniques

Our main contribution is an improved appoximation algorithm for coloring 3-colorable graphs.

Theorem 1.1.

Given a 3-colorable graph G=(V,E)𝐺𝑉𝐸G=(V,E)italic_G = ( italic_V , italic_E ) and a parameter r>0𝑟0r>0italic_r > 0, there exists an algorithm that, with high probability, outputs a valid coloring of G𝐺Gitalic_G using at most O(r)𝑂𝑟O(r)italic_O ( italic_r ) colors in time O(exp(O~(nlog11/2rr3)))superscript𝑂~𝑂𝑛superscript112𝑟superscript𝑟3O^{*}\left(\exp\left(\tilde{O}\left(\frac{n\log^{11/2}r}{r^{3}}\right)\right)\right)italic_O start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( roman_exp ( over~ start_ARG italic_O end_ARG ( divide start_ARG italic_n roman_log start_POSTSUPERSCRIPT 11 / 2 end_POSTSUPERSCRIPT italic_r end_ARG start_ARG italic_r start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG ) ) ).

For any fixed real ϵ(0,1/3)italic-ϵ013\epsilon\in(0,1/3)italic_ϵ ∈ ( 0 , 1 / 3 ), setting r𝑟ritalic_r to be nϵsuperscript𝑛italic-ϵn^{\epsilon}italic_n start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT, we obtain a coloring of G𝐺Gitalic_G using O(nϵ)𝑂superscript𝑛italic-ϵO(n^{\epsilon})italic_O ( italic_n start_POSTSUPERSCRIPT italic_ϵ end_POSTSUPERSCRIPT ) colors in time O(exp(O(n13ϵ)))superscript𝑂𝑂superscript𝑛13italic-ϵO^{*}\left(\exp\left(O(n^{1-3\epsilon})\right)\right)italic_O start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( roman_exp ( italic_O ( italic_n start_POSTSUPERSCRIPT 1 - 3 italic_ϵ end_POSTSUPERSCRIPT ) ) ). This asymptotically improves upon the width-based algorithm given by Atserias and Dalmau [1]. Furthermore, up to polynomial factors in n𝑛nitalic_n, this matches the width-based lower bound given in [1]. Whether our result can be captured by a width-based argument remains a direction for future research.

Techniques: We closely follow the algorithm given by Bansal et al. [3]. We first consider the independent set problem. The approach in [3] is to branch on high degree vertices. For each such vertex, the algorithm chooses to include it with some small probability, in which case all of its neighbors are removed, else it removes the high degree vertex and recurses on the remaining graph. The probability of choosing each vertex is chosen in a way to ensure the number of leaves of the branching process remains bounded. Once the maximum degree of the graph is small, the algorithm terminates by applying a known approximation for for graphs with bounded degree [4]. We follow from a similar algorithm, replacing the base case with an Ω~(d1/3)~Ωsuperscript𝑑13\tilde{\Omega}(d^{1/3})over~ start_ARG roman_Ω end_ARG ( italic_d start_POSTSUPERSCRIPT 1 / 3 end_POSTSUPERSCRIPT )-approximation for Independent Set on 3-colorable graphs. To then color the graph G𝐺Gitalic_G, the algorithm in [3] repeatedly finds large independent sets and colors each independent set with its own color. Once the number of remaining vertices is small enough, they use a brute force inclusion-exclusion based algorithm for coloring the remaining vertices in time O(2|V|)superscript𝑂superscript2𝑉O^{*}(2^{|V|})italic_O start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( 2 start_POSTSUPERSCRIPT | italic_V | end_POSTSUPERSCRIPT ) [6]. We follow the same approach, using our improved approximation for Independent Set.

2 Improved Approximation for Coloring 3-Colorable Graphs

2.1 Independent Set

In this section, we prove the following lemma providing an approximation algorithm for finding an independent set in a 3-colorable graph:

Lemma 2.1.

Given a 3-colorable graph G=(V,E)𝐺𝑉𝐸G=(V,E)italic_G = ( italic_V , italic_E ) and a parameter r>0𝑟subscriptabsent0r\in\mathbbm{Z}_{>0}italic_r ∈ blackboard_Z start_POSTSUBSCRIPT > 0 end_POSTSUBSCRIPT, there exists an algorithm that, with probability at least 1/2121/21 / 2, outputs an independent set of size at least α(G)/r𝛼𝐺𝑟\alpha(G)/ritalic_α ( italic_G ) / italic_r in time O(exp(O~(nlog5/2rr3)))superscript𝑂~𝑂𝑛superscript52𝑟superscript𝑟3O^{*}\left(\exp\left(\tilde{O}\left(\frac{n\log^{5/2}r}{r^{3}}\right)\right)\right)italic_O start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( roman_exp ( over~ start_ARG italic_O end_ARG ( divide start_ARG italic_n roman_log start_POSTSUPERSCRIPT 5 / 2 end_POSTSUPERSCRIPT italic_r end_ARG start_ARG italic_r start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG ) ) ).

We use the following lemma from [3].

Lemma 2.2 ([3]).

Let 𝒜(G)𝒜superscript𝐺\mathcal{A}(G^{\prime})caligraphic_A ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) be an approximation algorithm that outputs an independent set of Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT of size α(G)/r𝛼superscript𝐺𝑟\alpha(G^{\prime})/ritalic_α ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) / italic_r. Suppose 𝒜(G)𝒜superscript𝐺\mathcal{A}(G^{\prime})caligraphic_A ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) runs in time T𝒜(n,r)subscript𝑇𝒜superscript𝑛𝑟T_{\mathcal{A}}(n^{\prime},r)italic_T start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT ( italic_n start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_r ) (where n=|V(G)|superscript𝑛𝑉superscript𝐺n^{\prime}=|V(G^{\prime})|italic_n start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = | italic_V ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) |) when G𝐺Gitalic_G has maximum degree d(r)𝑑𝑟d(r)italic_d ( italic_r ), where d(r)2r𝑑𝑟2𝑟d(r)\geq 2ritalic_d ( italic_r ) ≥ 2 italic_r. Then, there is an algorithm that given any graph G𝐺Gitalic_G, outputs an independent set of G𝐺Gitalic_G of expected size α(G)/r𝛼𝐺𝑟\alpha(G)/ritalic_α ( italic_G ) / italic_r in expected time O(exp(nd(r)log(4d(r)/r))T𝒜(n,r))superscript𝑂𝑛𝑑𝑟4𝑑𝑟𝑟subscript𝑇𝒜𝑛𝑟O^{*}\left(\exp\left(\frac{n}{d(r)}\log(4d(r)/r)\right)T_{\mathcal{A}}(n,r)\right)italic_O start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( roman_exp ( divide start_ARG italic_n end_ARG start_ARG italic_d ( italic_r ) end_ARG roman_log ( 4 italic_d ( italic_r ) / italic_r ) ) italic_T start_POSTSUBSCRIPT caligraphic_A end_POSTSUBSCRIPT ( italic_n , italic_r ) ).

We note that the algorithm of [3] only calls 𝒜(G)𝒜superscript𝐺\mathcal{A}(G^{\prime})caligraphic_A ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) on graphs Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT obtained by deleting vertices and edges from G𝐺Gitalic_G. Therefore, if G𝐺Gitalic_G is 3-colorable, we can assume Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is as well. The seminal work of Karger, Motwani, and Sudan [18] provides a polynomial time algorithm to color 3-colorable graphs using O(d1/3log1/2dlogn)𝑂superscript𝑑13superscript12𝑑𝑛O(d^{1/3}\log^{1/2}d\log n)italic_O ( italic_d start_POSTSUPERSCRIPT 1 / 3 end_POSTSUPERSCRIPT roman_log start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT italic_d roman_log italic_n ) colors, where d𝑑ditalic_d is the maximum degree of the input graph. We will use the following lemma from their analysis:

Lemma 2.3 ([18]).

Given a 3333-colorable graph G𝐺Gitalic_G, there exists a polynomial time algorithm to find an independent set of size Ω(n/(d1/3logd))Ω𝑛superscript𝑑13𝑑\Omega(n/(d^{1/3}\sqrt{\log d}))roman_Ω ( italic_n / ( italic_d start_POSTSUPERSCRIPT 1 / 3 end_POSTSUPERSCRIPT square-root start_ARG roman_log italic_d end_ARG ) ), where d𝑑ditalic_d is the maximum degree of the input graph.

Combining these two results, we obtain the following corollary:

Corollary 2.4.

Given a 3333-colorable graph G𝐺Gitalic_G, there exists a polynomial time algorithm that outputs an independent set of expected size α(G)/r𝛼𝐺𝑟\alpha(G)/ritalic_α ( italic_G ) / italic_r in expected time O(exp(O~(nlog5/2rr3)))superscript𝑂~𝑂𝑛superscript52𝑟superscript𝑟3O^{*}\left(\exp\left(\tilde{O}\left(\frac{n\log^{5/2}r}{r^{3}}\right)\right)\right)italic_O start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( roman_exp ( over~ start_ARG italic_O end_ARG ( divide start_ARG italic_n roman_log start_POSTSUPERSCRIPT 5 / 2 end_POSTSUPERSCRIPT italic_r end_ARG start_ARG italic_r start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG ) ) ).

Proof.

We refer to the algorithm given by Lemma 2.3 as 𝒜𝒜\mathcal{A}caligraphic_A. By Lemma 2.3, for any 3333-colorable graph Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT with maximum degree d=r3log3/2r𝑑superscript𝑟3superscript32𝑟d=\frac{r^{3}}{\log^{3/2}r}italic_d = divide start_ARG italic_r start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG start_ARG roman_log start_POSTSUPERSCRIPT 3 / 2 end_POSTSUPERSCRIPT italic_r end_ARG, 𝒜(G)𝒜superscript𝐺\mathcal{A}(G^{\prime})caligraphic_A ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) outputs an independent set of size

Ω(nd1/3logd)=Ω(nlog(r)rlog(r/logr)).Ω𝑛superscript𝑑13𝑑Ω𝑛𝑟𝑟𝑟𝑟\Omega\left(\frac{n}{d^{1/3}\sqrt{\log d}}\right)=\Omega\left(\frac{n\sqrt{% \log(r)}}{r\sqrt{\log(r/\log r)}}\right).roman_Ω ( divide start_ARG italic_n end_ARG start_ARG italic_d start_POSTSUPERSCRIPT 1 / 3 end_POSTSUPERSCRIPT square-root start_ARG roman_log italic_d end_ARG end_ARG ) = roman_Ω ( divide start_ARG italic_n square-root start_ARG roman_log ( italic_r ) end_ARG end_ARG start_ARG italic_r square-root start_ARG roman_log ( italic_r / roman_log italic_r ) end_ARG end_ARG ) .

Note that since Gsuperscript𝐺G^{\prime}italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is 3333-colorable, α(G)n/3𝛼superscript𝐺𝑛3\alpha(G^{\prime})\geq n/3italic_α ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ≥ italic_n / 3. Thus we can choose d(r)=O~(r3/log3/2r)𝑑𝑟~𝑂superscript𝑟3superscript32𝑟d(r)=\tilde{O}(r^{3}/\log^{3/2}r)italic_d ( italic_r ) = over~ start_ARG italic_O end_ARG ( italic_r start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT / roman_log start_POSTSUPERSCRIPT 3 / 2 end_POSTSUPERSCRIPT italic_r ) so that 𝒜(G)𝒜superscript𝐺\mathcal{A}(G^{\prime})caligraphic_A ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) outputs an independent set of size α(G)/r𝛼superscript𝐺𝑟\alpha(G^{\prime})/ritalic_α ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) / italic_r in polynomial time (recall here that O~~𝑂\tilde{O}over~ start_ARG italic_O end_ARG suppresses polyloglog(r)polyloglog𝑟\textnormal{polyloglog}(r)polyloglog ( italic_r ) factors).

We then apply Lemma 2.2 on the input G𝐺Gitalic_G with 𝒜(G)𝒜superscript𝐺\mathcal{A}(G^{\prime})caligraphic_A ( italic_G start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) as defined above to obtain an independent set of expected size α(G)/r𝛼𝐺𝑟\alpha(G)/ritalic_α ( italic_G ) / italic_r in expected time

O(exp(nlog3/2rr3log(4r3rlog3/2r))poly(n))=O(exp(O~(nlog5/2rr3))).superscript𝑂𝑛superscript32𝑟superscript𝑟34superscript𝑟3𝑟superscript32𝑟poly𝑛superscript𝑂~𝑂𝑛superscript52𝑟superscript𝑟3\displaystyle O^{*}\left(\exp\left(\frac{n\log^{3/2}r}{r^{3}}\log\left(\frac{4% r^{3}}{r\log^{3/2}r}\right)\right)\operatorname{poly}(n)\right)=O^{*}\left(% \exp\left(\tilde{O}\left(\frac{n\log^{5/2}r}{r^{3}}\right)\right)\right).italic_O start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( roman_exp ( divide start_ARG italic_n roman_log start_POSTSUPERSCRIPT 3 / 2 end_POSTSUPERSCRIPT italic_r end_ARG start_ARG italic_r start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG roman_log ( divide start_ARG 4 italic_r start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG start_ARG italic_r roman_log start_POSTSUPERSCRIPT 3 / 2 end_POSTSUPERSCRIPT italic_r end_ARG ) ) roman_poly ( italic_n ) ) = italic_O start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( roman_exp ( over~ start_ARG italic_O end_ARG ( divide start_ARG italic_n roman_log start_POSTSUPERSCRIPT 5 / 2 end_POSTSUPERSCRIPT italic_r end_ARG start_ARG italic_r start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG ) ) ) .

To complete the proof of Lemma 2.1, note that Corollary 2.4 gives the desired runtime and approximation ratio in expectation. Thus it suffices to run multiple iterations of this algorithm to obtain our desired probability of failure.

Proof of Lemma 2.1.

Let G𝐺Gitalic_G be a given 3-colorable graph. Let (G,r)𝐺𝑟\mathcal{B}(G,r)caligraphic_B ( italic_G , italic_r ) denote the algorithm given by Corollary 2.4, and let T(n,r)=O(exp(O~(nlog5/2rr3)))superscript𝑇𝑛𝑟superscript𝑂~𝑂𝑛superscript52𝑟superscript𝑟3T^{\prime}(n,r)=O^{*}\left(\exp\left(\tilde{O}\left(\frac{n\log^{5/2}r}{r^{3}}% \right)\right)\right)italic_T start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_n , italic_r ) = italic_O start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( roman_exp ( over~ start_ARG italic_O end_ARG ( divide start_ARG italic_n roman_log start_POSTSUPERSCRIPT 5 / 2 end_POSTSUPERSCRIPT italic_r end_ARG start_ARG italic_r start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG ) ) ) denote the expected runtime of B(G,r)𝐵𝐺𝑟B(G,r)italic_B ( italic_G , italic_r ). We run B(G,r/2)𝐵𝐺𝑟2B(G,r/2)italic_B ( italic_G , italic_r / 2 ) r𝑟ritalic_r times and output the maximum independent set given. If the total runtime exceeds 5rT(n,r/2)5𝑟superscript𝑇𝑛𝑟25rT^{\prime}(n,r/2)5 italic_r italic_T start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_n , italic_r / 2 ), we terminate the algorithm.

By construction, this algorithm runs in time O(rT(n,r/2))=O(exp(O~(nlogrr3)))𝑂𝑟superscript𝑇𝑛𝑟2superscript𝑂~𝑂𝑛𝑟superscript𝑟3O(rT^{\prime}(n,r/2))=O^{*}\left(\exp\left(\tilde{O}\left(\frac{n\log r}{r^{3}% }\right)\right)\right)italic_O ( italic_r italic_T start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_n , italic_r / 2 ) ) = italic_O start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( roman_exp ( over~ start_ARG italic_O end_ARG ( divide start_ARG italic_n roman_log italic_r end_ARG start_ARG italic_r start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG ) ) ). By Markov’s inequality, the probability that the total runtime exceeds 5rT(n,r/2)5𝑟superscript𝑇𝑛𝑟25rT^{\prime}(n,r/2)5 italic_r italic_T start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_n , italic_r / 2 ) is at most 1515\frac{1}{5}divide start_ARG 1 end_ARG start_ARG 5 end_ARG. Thus it remains to bound the error probability given that we successfully complete r𝑟ritalic_r iterations of B(G,r/2)𝐵𝐺𝑟2B(G,r/2)italic_B ( italic_G , italic_r / 2 ).

Fix a single run of B(G,r/2)𝐵𝐺𝑟2B(G,r/2)italic_B ( italic_G , italic_r / 2 ), and let S𝑆Sitalic_S be the resulting independent set. To bound the probability that S𝑆Sitalic_S is small, note that 𝐏𝐫[|S|α(G)/r]=𝐏𝐫[α(G)|S|α(G)(11/r)]𝐏𝐫delimited-[]𝑆𝛼𝐺𝑟𝐏𝐫delimited-[]𝛼𝐺𝑆𝛼𝐺11𝑟\mathop{\bf Pr\/}[|S|\leq\alpha(G)/r]=\mathop{\bf Pr\/}[\alpha(G)-|S|\geq% \alpha(G)(1-1/r)]start_BIGOP bold_Pr end_BIGOP [ | italic_S | ≤ italic_α ( italic_G ) / italic_r ] = start_BIGOP bold_Pr end_BIGOP [ italic_α ( italic_G ) - | italic_S | ≥ italic_α ( italic_G ) ( 1 - 1 / italic_r ) ]. Since |S|α(G)𝑆𝛼𝐺|S|\leq\alpha(G)| italic_S | ≤ italic_α ( italic_G ), we can apply Markov’s inequality to obtain

𝐏𝐫[α(G)|S|α(G)(11/r)]𝐄[α(G)|S|]α(G)(11/r)=α(G)(12/r)α(G)(11/r)=11r1.𝐏𝐫delimited-[]𝛼𝐺𝑆𝛼𝐺11𝑟𝐄delimited-[]𝛼𝐺𝑆𝛼𝐺11𝑟𝛼𝐺12𝑟𝛼𝐺11𝑟11𝑟1\displaystyle\mathop{\bf Pr\/}[\alpha(G)-|S|\geq\alpha(G)(1-1/r)]\leq\frac{% \mathop{\bf E\/}[\alpha(G)-|S|]}{\alpha(G)(1-1/r)}=\frac{\alpha(G)(1-2/r)}{% \alpha(G)(1-1/r)}=1-\frac{1}{r-1}.start_BIGOP bold_Pr end_BIGOP [ italic_α ( italic_G ) - | italic_S | ≥ italic_α ( italic_G ) ( 1 - 1 / italic_r ) ] ≤ divide start_ARG start_BIGOP bold_E end_BIGOP [ italic_α ( italic_G ) - | italic_S | ] end_ARG start_ARG italic_α ( italic_G ) ( 1 - 1 / italic_r ) end_ARG = divide start_ARG italic_α ( italic_G ) ( 1 - 2 / italic_r ) end_ARG start_ARG italic_α ( italic_G ) ( 1 - 1 / italic_r ) end_ARG = 1 - divide start_ARG 1 end_ARG start_ARG italic_r - 1 end_ARG .

Thus the probability that none of the r𝑟ritalic_r runs of B(G,r/2)𝐵𝐺𝑟2B(G,r/2)italic_B ( italic_G , italic_r / 2 ) give an independent set of size at least α(G)/r𝛼𝐺𝑟\alpha(G)/ritalic_α ( italic_G ) / italic_r is at most (11/(r1))r1/esuperscript11𝑟1𝑟1𝑒(1-1/(r-1))^{r}\leq 1/e( 1 - 1 / ( italic_r - 1 ) ) start_POSTSUPERSCRIPT italic_r end_POSTSUPERSCRIPT ≤ 1 / italic_e. Combining with the runtime error, we get a total probability of failure of at most 15+45e<1/21545𝑒12\frac{1}{5}+\frac{4}{5e}<1/2divide start_ARG 1 end_ARG start_ARG 5 end_ARG + divide start_ARG 4 end_ARG start_ARG 5 italic_e end_ARG < 1 / 2. ∎

2.2 Graph Coloring

In this section, we refer to the algorithm given by Lemma 2.1 as IS(G,r)IS𝐺𝑟\textnormal{IS}(G,r)IS ( italic_G , italic_r ) for consistency with the notation used in [3]. To prove Theorem 1.1, we follow the same general approach outlined by Bansal et. al. in [3] with IS(G,r)IS𝐺𝑟\textnormal{IS}(G,r)IS ( italic_G , italic_r ) as a subroutine. For completeness, we describe the full algorithm and analysis below. Let t=nr3𝑡𝑛superscript𝑟3t=\frac{n}{r^{3}}italic_t = divide start_ARG italic_n end_ARG start_ARG italic_r start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG, r=r/log(n/t)=rlogr3superscript𝑟𝑟𝑛𝑡𝑟superscript𝑟3r^{\prime}=r/\log(n/t)=\frac{r}{\log r^{3}}italic_r start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_r / roman_log ( italic_n / italic_t ) = divide start_ARG italic_r end_ARG start_ARG roman_log italic_r start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG.

Algorithm 1 r𝑟ritalic_r-Approximate Graph Coloring
1:c0𝑐0c\leftarrow 0italic_c ← 0
2:while |V|t𝑉𝑡|V|\geq t| italic_V | ≥ italic_t do
3:     cc+1𝑐𝑐1c\leftarrow c+1italic_c ← italic_c + 1
4:     Ccsubscript𝐶𝑐C_{c}\leftarrow\emptysetitalic_C start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT ← ∅
5:     for i=1,,n𝑖1𝑛i=1,\dots,nitalic_i = 1 , … , italic_n do
6:         Ccargmax(|Cc|,|IS(G[V],r)|)subscript𝐶𝑐argmaxsubscript𝐶𝑐IS𝐺delimited-[]𝑉superscript𝑟C_{c}\leftarrow\textnormal{argmax}(|C_{c}|,|\textnormal{IS}(G[V],r^{\prime})|)italic_C start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT ← argmax ( | italic_C start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT | , | IS ( italic_G [ italic_V ] , italic_r start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) | )
7:     end for
8:     VVCc𝑉𝑉subscript𝐶𝑐V\leftarrow V\setminus C_{c}italic_V ← italic_V ∖ italic_C start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT
9:end while
10:(Cc+1,Cc+2,Cc+3)subscript𝐶𝑐1subscript𝐶𝑐2subscript𝐶𝑐3absent(C_{c+1},C_{c+2},C_{c+3})\leftarrow( italic_C start_POSTSUBSCRIPT italic_c + 1 end_POSTSUBSCRIPT , italic_C start_POSTSUBSCRIPT italic_c + 2 end_POSTSUBSCRIPT , italic_C start_POSTSUBSCRIPT italic_c + 3 end_POSTSUBSCRIPT ) ← brute force optimum coloring of G[V]𝐺delimited-[]𝑉G[V]italic_G [ italic_V ]
11:return (C1,,Cc+3)subscript𝐶1subscript𝐶𝑐3(C_{1},\dots,C_{c+3})( italic_C start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_C start_POSTSUBSCRIPT italic_c + 3 end_POSTSUBSCRIPT )
Claim 2.5.

For a fixed iteration c𝑐citalic_c of Algorithm 1, Cc|V|3rsubscript𝐶𝑐𝑉3superscript𝑟C_{c}\geq\frac{|V|}{3r^{\prime}}italic_C start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT ≥ divide start_ARG | italic_V | end_ARG start_ARG 3 italic_r start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG with probability at least 1(1/2)n1superscript12𝑛1-(1/2)^{n}1 - ( 1 / 2 ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT.

Proof.

At iteration c𝑐citalic_c, G[V]𝐺delimited-[]𝑉G[V]italic_G [ italic_V ] must have an independent set of size at least |V|/3𝑉3|V|/3| italic_V | / 3, since G𝐺Gitalic_G is 3-colorable and removing vertices cannot increase the chromatic number. By Lemma 2.1, for each i[n]𝑖delimited-[]𝑛i\in[n]italic_i ∈ [ italic_n ], the independent set Ccsubscript𝐶𝑐C_{c}italic_C start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT given by line 6, |Cc|α(G[V])/r|V|3rsubscript𝐶𝑐𝛼𝐺delimited-[]𝑉superscript𝑟𝑉3superscript𝑟|C_{c}|\geq\alpha(G[V])/r^{\prime}\geq\frac{|V|}{3r^{\prime}}| italic_C start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT | ≥ italic_α ( italic_G [ italic_V ] ) / italic_r start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ≥ divide start_ARG | italic_V | end_ARG start_ARG 3 italic_r start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG with probability at least 1212\frac{1}{2}divide start_ARG 1 end_ARG start_ARG 2 end_ARG. Thus 𝐏𝐫[|Cc|<|V|/(3r)](1/2)n𝐏𝐫delimited-[]subscript𝐶𝑐𝑉3superscript𝑟superscript12𝑛\mathop{\bf Pr\/}[|C_{c}|<|V|/(3r^{\prime})]\leq(1/2)^{n}start_BIGOP bold_Pr end_BIGOP [ | italic_C start_POSTSUBSCRIPT italic_c end_POSTSUBSCRIPT | < | italic_V | / ( 3 italic_r start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ] ≤ ( 1 / 2 ) start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT as desired. ∎

Claim 2.6.

Algorithm 1 gives a valid coloring of G𝐺Gitalic_G using at most O(r)𝑂𝑟O(r)italic_O ( italic_r ) colors with high probability.

Proof.

Validity of the coloring follows is clear by construction, since we choose a new color for each independent set. It remains to bound the number of colors used.

Note that cn𝑐𝑛c\leq nitalic_c ≤ italic_n at all times, since |V|𝑉|V|| italic_V | reduces by at least one at each iteration. Thus by Claim 2.5, |V|𝑉|V|| italic_V | reduces by a factor of (11/(3r))113superscript𝑟(1-1/(3r^{\prime}))( 1 - 1 / ( 3 italic_r start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ) at every iteration with probability at least 1n/(2n)1𝑛superscript2𝑛1-n/(2^{n})1 - italic_n / ( 2 start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ). After \ellroman_ℓ iterations,

|V|n(113r)nexp(3r)=nexp(ln(n/t)3r)=n(nt)/3r.𝑉𝑛superscript113superscript𝑟𝑛3superscript𝑟𝑛𝑛𝑡3𝑟𝑛superscript𝑛𝑡3𝑟|V|\leq n\left(1-\frac{1}{3r^{\prime}}\right)^{\ell}\leq n\cdot\exp\left(-% \frac{\ell}{3r^{\prime}}\right)=n\cdot\exp\left(-\frac{\ell\ln(n/t)}{3r}\right% )=n\cdot\left(\frac{n}{t}\right)^{-\ell/3r}.| italic_V | ≤ italic_n ( 1 - divide start_ARG 1 end_ARG start_ARG 3 italic_r start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG ) start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT ≤ italic_n ⋅ roman_exp ( - divide start_ARG roman_ℓ end_ARG start_ARG 3 italic_r start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG ) = italic_n ⋅ roman_exp ( - divide start_ARG roman_ℓ roman_ln ( italic_n / italic_t ) end_ARG start_ARG 3 italic_r end_ARG ) = italic_n ⋅ ( divide start_ARG italic_n end_ARG start_ARG italic_t end_ARG ) start_POSTSUPERSCRIPT - roman_ℓ / 3 italic_r end_POSTSUPERSCRIPT .

Therefore, when =3r3𝑟\ell=3rroman_ℓ = 3 italic_r, |V|t𝑉𝑡|V|\leq t| italic_V | ≤ italic_t, so Algorithm 1 runs for at most 3r+13𝑟13r+13 italic_r + 1 iterations. Since G𝐺Gitalic_G is 3-colorable, the final optimum coloring of the remaining vertices on line 10 uses at most 3 colors. Thus the total number of colors used is at most 3r+4=O(r)3𝑟4𝑂𝑟3r+4=O(r)3 italic_r + 4 = italic_O ( italic_r ). ∎

Claim 2.7.

Algorithm 1 runs in time O(exp(O~(nlog11/2rr3)))superscript𝑂~𝑂𝑛superscript112𝑟superscript𝑟3O^{*}\left(\exp\left(\tilde{O}\left(\frac{n\log^{11/2}r}{r^{3}}\right)\right)\right)italic_O start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( roman_exp ( over~ start_ARG italic_O end_ARG ( divide start_ARG italic_n roman_log start_POSTSUPERSCRIPT 11 / 2 end_POSTSUPERSCRIPT italic_r end_ARG start_ARG italic_r start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG ) ) ).

Proof.

We first bound the runtime of the while loop in lines 2-9. By Lemma 2.1, IS(G[V],r)IS𝐺delimited-[]𝑉superscript𝑟\textnormal{IS}(G[V],r^{\prime})IS ( italic_G [ italic_V ] , italic_r start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) runs in time O(exp(O~(nlog5/2rr3)))superscript𝑂~𝑂𝑛superscript52superscript𝑟superscript𝑟3O^{*}\left(\exp\left(\tilde{O}\left(\frac{n\log^{5/2}r^{\prime}}{r^{\prime 3}}% \right)\right)\right)italic_O start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( roman_exp ( over~ start_ARG italic_O end_ARG ( divide start_ARG italic_n roman_log start_POSTSUPERSCRIPT 5 / 2 end_POSTSUPERSCRIPT italic_r start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG start_ARG italic_r start_POSTSUPERSCRIPT ′ 3 end_POSTSUPERSCRIPT end_ARG ) ) ). By the analysis in the proof of Claim 2.6, the total number of iterations is at most O(r)𝑂𝑟O(r)italic_O ( italic_r ), and in each iteration we run IS(G[V],r)IS𝐺delimited-[]𝑉superscript𝑟\textnormal{IS}(G[V],r^{\prime})IS ( italic_G [ italic_V ] , italic_r start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) r𝑟ritalic_r times. This the loop runs in time

O(r2exp(O~(nlog5/2rr3)))=O(exp(O~(nlog5/2(r/log(r3))r3/log3(r3))))superscript𝑂superscript𝑟2~𝑂𝑛superscript52superscript𝑟superscript𝑟3superscript𝑂~𝑂𝑛superscript52𝑟superscript𝑟3superscript𝑟3superscript3superscript𝑟3\displaystyle O^{*}\left(r^{2}\exp\left(\tilde{O}\left(\frac{n\log^{5/2}r^{% \prime}}{r^{\prime 3}}\right)\right)\right)=O^{*}\left(\exp\left(\tilde{O}% \left(\frac{n\log^{5/2}(r/\log(r^{3}))}{r^{3}/\log^{3}(r^{3})}\right)\right)\right)italic_O start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( italic_r start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT roman_exp ( over~ start_ARG italic_O end_ARG ( divide start_ARG italic_n roman_log start_POSTSUPERSCRIPT 5 / 2 end_POSTSUPERSCRIPT italic_r start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_ARG start_ARG italic_r start_POSTSUPERSCRIPT ′ 3 end_POSTSUPERSCRIPT end_ARG ) ) ) = italic_O start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( roman_exp ( over~ start_ARG italic_O end_ARG ( divide start_ARG italic_n roman_log start_POSTSUPERSCRIPT 5 / 2 end_POSTSUPERSCRIPT ( italic_r / roman_log ( italic_r start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) ) end_ARG start_ARG italic_r start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT / roman_log start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ( italic_r start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) end_ARG ) ) )
=\displaystyle== O(exp(O~(nlog11/2rr3))).superscript𝑂~𝑂𝑛superscript112𝑟superscript𝑟3\displaystyle O^{*}\left(\exp\left(\tilde{O}\left(\frac{n\log^{11/2}r}{r^{3}}% \right)\right)\right).italic_O start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( roman_exp ( over~ start_ARG italic_O end_ARG ( divide start_ARG italic_n roman_log start_POSTSUPERSCRIPT 11 / 2 end_POSTSUPERSCRIPT italic_r end_ARG start_ARG italic_r start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG ) ) ) .

Using a naive brute force algorithm, the final step in line 10 runs in time O(3t)=O(exp(O(nr3)))𝑂superscript3𝑡𝑂𝑂𝑛superscript𝑟3O\left(3^{t}\right)=O\left(\exp\left(O\left(\frac{n}{r^{3}}\right)\right)\right)italic_O ( 3 start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT ) = italic_O ( roman_exp ( italic_O ( divide start_ARG italic_n end_ARG start_ARG italic_r start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG ) ) ), giving us our desired time bound. ∎

References

  • [1] Albert Atserias and Víctor Dalmau. Promise constraint satisfaction and width. In Proceedings of the 2022 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 1129–1153. SIAM, 2022.
  • [2] Per Austrin, Venkatesan Guruswami, and Johan Håstad. (2+ϵ2italic-ϵ2+\epsilon2 + italic_ϵ)-SAT is NP-hard. SIAM Journal on Computing, 46(5):1554–1573, 2017. arXiv:https://doi.org/10.1137/15M1006507, doi:10.1137/15M1006507.
  • [3] Nikhil Bansal, Parinya Chalermsook, Bundit Laekhanukit, Danupon Nanongkai, and Jesper Nederlof. New tools and connections for exponential-time approximation. Algorithmica, 81(10):3993–4009, 2019.
  • [4] Nikhil Bansal, Anupam Gupta, and Guru Guruganesh. On the Lovász Theta function for independent sets in sparse graphs. In Proceedings of the Forty-Seventh Annual ACM Symposium on Theory of Computing, STOC ’15, page 193–200, New York, NY, USA, 2015. Association for Computing Machinery. doi:10.1145/2746539.2746607.
  • [5] Libor Barto, Jakub Bulín, Andrei Krokhin, and Jakub Opršal. Algebraic approach to promise constraint satisfaction. J. ACM, 68(4), jul 2021. doi:10.1145/3457606.
  • [6] Andreas Björklund, Thore Husfeldt, and Mikko Koivisto. Set partitioning via inclusion-exclusion. SIAM J. Comput., 39(2):546–563, jul 2009. doi:10.1137/070683933.
  • [7] Avrim Blum. New approximation algorithms for graph coloring. J. ACM, 41(3):470–516, may 1994. doi:10.1145/176584.176586.
  • [8] Édouard Bonnet and Vangelis Th Paschos. Sparsification and subexponential approximation. Acta Informatica, 55:1–15, 2018.
  • [9] Nicolas Bourgeois, Bruno Escoffier, and Vangelis Th Paschos. Approximation of max independent set, min vertex cover and related problems by moderately exponential algorithms. Discrete Applied Mathematics, 159(17):1954–1970, 2011.
  • [10] Joshua Brakensiek and Venkatesan Guruswami. New Hardness Results for Graph and Hypergraph Colorings. In Ran Raz, editor, 31st Conference on Computational Complexity (CCC 2016), volume 50 of Leibniz International Proceedings in Informatics (LIPIcs), pages 14:1–14:27, Dagstuhl, Germany, 2016. Schloss Dagstuhl – Leibniz-Zentrum für Informatik. URL: https://drops.dagstuhl.de/entities/document/10.4230/LIPIcs.CCC.2016.14, doi:10.4230/LIPIcs.CCC.2016.14.
  • [11] Joshua Brakensiek and Venkatesan Guruswami. Promise constraint satisfaction: Algebraic structure and a symmetric Boolean dichotomy. SIAM Journal on Computing, 50(6):1663–1700, 2021. arXiv:https://doi.org/10.1137/19M128212X, doi:10.1137/19M128212X.
  • [12] Marek Cygan, Lukasz Kowalik, Marcin Pilipczuk, and Mateusz Wykurz. Exponential-time approximation of hard problems, 2008. arXiv:0810.4934.
  • [13] Irit Dinur, Elchanan Mossel, and Oded Regev. Conditional hardness for approximate coloring. SIAM Journal on Computing, 39(3):843–873, 2009. arXiv:https://doi.org/10.1137/07068062X, doi:10.1137/07068062X.
  • [14] Tomás Feder and Moshe Y. Vardi. The computational structure of monotone monadic snp and constraint satisfaction: A study through datalog and group theory. SIAM Journal on Computing, 28(1):57–104, 1998. arXiv:https://doi.org/10.1137/S0097539794266766, doi:10.1137/S0097539794266766.
  • [15] Uriel Feige and Joe Kilian. Zero knowledge and the chromatic number. Journal of Computer and System Sciences, 57(2):187–199, 1998. URL: https://www.sciencedirect.com/science/article/pii/S0022000098915872, doi:https://doi.org/10.1006/jcss.1998.1587.
  • [16] Venkatesan Guruswami and Sanjeev Khanna. On the hardness of 4-coloring a 3-colorable graph. volume 18, pages 30–40, 2004. doi:10.1137/S0895480100376794.
  • [17] Venkatesan Guruswami and Sai Sandeep. d𝑑ditalic_d-to-1 hardness of coloring 3-colorable graphs with o(1)𝑜1o(1)italic_o ( 1 ) colors. In Artur Czumaj, Anuj Dawar, and Emanuela Merelli, editors, 47th International Colloquium on Automata, Languages, and Programming (ICALP), volume 168 of LIPIcs, pages 62:1–62:12. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2020. URL: https://doi.org/10.4230/LIPIcs.ICALP.2020.62, doi:10.4230/LIPICS.ICALP.2020.62.
  • [18] David R. Karger, Rajeev Motwani, and Madhu Sudan. Approximate graph coloring by semidefinite programming. J. ACM, 45(2):246–265, 1998. doi:10.1145/274787.274791.
  • [19] Richard M. Karp. Reducibility among Combinatorial Problems, pages 85–103. Springer US, Boston, MA, 1972. doi:10.1007/978-1-4684-2001-2_9.
  • [20] Ken-Ichi Kawarabayashi and Mikkel Thorup. Coloring 3-colorable graphs with less than n1/5superscript𝑛15n^{1/5}italic_n start_POSTSUPERSCRIPT 1 / 5 end_POSTSUPERSCRIPT colors. J. ACM, 64(1), mar 2017. doi:10.1145/3001582.
  • [21] Ken-ichi Kawarabayashi, Mikkel Thorup, and Hirotaka Yoneda. Better coloring of 3-colorable graphs. In Proceedings of the 56th Annual ACM Symposium on Theory of Computing, STOC 2024, page 331–339, New York, NY, USA, 2024. Association for Computing Machinery. doi:10.1145/3618260.3649768.
  • [22] Sanjeev Khanna, Nathan Linial, and Shmuel Safra. On the hardness of approximating the chromatic number. Combinatorica, 20(3):393–415, March 2000. doi:10.1007/s004930070013.
  • [23] S. Khot. Improved inapproximability results for maxclique, chromatic number and approximate graph coloring. In Proceedings 42nd IEEE Symposium on Foundations of Computer Science, pages 600–609, 2001. doi:10.1109/SFCS.2001.959936.
  • [24] Bundit Laekhanukit. Inapproximability of combinatorial problems in subexponential-time. Phd thesis, McGill University, 2014.
  • [25] Avi Wigderson. Improving the performance guarantee for approximate graph coloring. J. ACM, 30(4):729–735, oct 1983. doi:10.1145/2157.2158.