Approximate kernel clustering

Khot, Subhash; Naor, Assaf

Computer Science > Data Structures and Algorithms

arXiv:0807.4626 (cs)

[Submitted on 29 Jul 2008 (v1), last revised 9 Dec 2008 (this version, v2)]

Title:Approximate kernel clustering

Authors:Subhash Khot, Assaf Naor

View PDF

Abstract: In the kernel clustering problem we are given a large $n\times n$ positive semi-definite matrix $A=(a_{ij})$ with $\sum_{i,j=1}^na_{ij}=0$ and a small $k\times k$ positive semi-definite matrix $B=(b_{ij})$. The goal is to find a partition $S_1,...,S_k$ of $\{1,... n\}$ which maximizes the quantity $$ \sum_{i,j=1}^k (\sum_{(i,j)\in S_i\times S_j}a_{ij})b_{ij}. $$ We study the computational complexity of this generic clustering problem which originates in the theory of machine learning. We design a constant factor polynomial time approximation algorithm for this problem, answering a question posed by Song, Smola, Gretton and Borgwardt. In some cases we manage to compute the sharp approximation threshold for this problem assuming the Unique Games Conjecture (UGC). In particular, when $B$ is the $3\times 3$ identity matrix the UGC hardness threshold of this problem is exactly $\frac{16\pi}{27}$. We present and study a geometric conjecture of independent interest which we show would imply that the UGC threshold when $B$ is the $k\times k$ identity matrix is $\frac{8\pi}{9}(1-\frac{1}{k})$ for every $k\ge 3$.

Subjects:	Data Structures and Algorithms (cs.DS); Computational Complexity (cs.CC); Functional Analysis (math.FA)
Cite as:	arXiv:0807.4626 [cs.DS]
	(or arXiv:0807.4626v2 [cs.DS] for this version)
	https://doi.org/10.48550/arXiv.0807.4626

Submission history

From: Assaf Naor [view email]
[v1] Tue, 29 Jul 2008 10:40:55 UTC (33 KB)
[v2] Tue, 9 Dec 2008 20:48:32 UTC (34 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.DS

< prev | next >

new | recent | 2008-07

Change to browse by:

cs
cs.CC
math
math.FA

References & Citations

DBLP - CS Bibliography

listing | bibtex

Subhash Khot
Assaf Naor

export BibTeX citation

Computer Science > Data Structures and Algorithms

Title:Approximate kernel clustering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Data Structures and Algorithms

Title:Approximate kernel clustering

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators