DocumentCode
1595197
Title
Approximate Kernel Clustering
Author
Khot, Subhash ; Naor, Assaf
Author_Institution
Courant Inst. of Math. Sci., New York, NY
fYear
2008
Firstpage
561
Lastpage
570
Abstract
In the kernel clustering problem we are given a large ntimesn positive semi-definite matrix A=(aij) with Sigmai,j n=1 aij=0 and a small ktimesk positivesemi-definite matrix B=bij. The goal is to find a partition S1,..Sk of {1,...n} which maximizes the quantity Sigmai,j=1 k(Sigma(i,j)isinS i timesS j). We study the computational complexity of this generic clustering problem which originates in the theory of machine learning. We design a constant factor polynomial time approximation algorithm forthis problem, answering a question posed by Song, Smola, Gretton and Borgwardt. In some cases we manage to compute the sharp approximation threshold for this problem assuming the unique games conjecture (UGC). In particular, when B is the 3times3 identity matrix the UGC hardness threshold of this problem is exactly 16pi/27. We present and study a geometricconjecture of independent interest which we show would imply thatthe UGC threshold when B is the ktimesk identity matrix is 8pi/9(1-1/k) for every kges3.
Keywords
approximation theory; computational complexity; game theory; matrix algebra; pattern clustering; approximate kernel clustering; computational complexity; polynomial time approximation algorithm; semi-definite matrix; unique games conjecture; Algorithm design and analysis; Approximation algorithms; Clustering algorithms; Computational complexity; Computer science; Kernel; Machine learning; Machine learning algorithms; Polynomials; User-generated content; Approximation algorithm; clustering; inapproximability;
fLanguage
English
Publisher
ieee
Conference_Titel
Foundations of Computer Science, 2008. FOCS '08. IEEE 49th Annual IEEE Symposium on
Conference_Location
Philadelphia, PA
ISSN
0272-5428
Print_ISBN
978-0-7695-3436-7
Type
conf
DOI
10.1109/FOCS.2008.33
Filename
4690989
Link To Document