• DocumentCode
    1595197
  • Title

    Approximate Kernel Clustering

  • Author

    Khot, Subhash ; Naor, Assaf

  • Author_Institution
    Courant Inst. of Math. Sci., New York, NY
  • fYear
    2008
  • Firstpage
    561
  • Lastpage
    570
  • Abstract
    In the kernel clustering problem we are given a large ntimesn positive semi-definite matrix A=(aij) with Sigmai,j n=1 aij=0 and a small ktimesk positivesemi-definite matrix B=bij. The goal is to find a partition S1,..Sk of {1,...n} which maximizes the quantity Sigmai,j=1 k(Sigma(i,j)isinS i timesS j). We study the computational complexity of this generic clustering problem which originates in the theory of machine learning. We design a constant factor polynomial time approximation algorithm forthis problem, answering a question posed by Song, Smola, Gretton and Borgwardt. In some cases we manage to compute the sharp approximation threshold for this problem assuming the unique games conjecture (UGC). In particular, when B is the 3times3 identity matrix the UGC hardness threshold of this problem is exactly 16pi/27. We present and study a geometricconjecture of independent interest which we show would imply thatthe UGC threshold when B is the ktimesk identity matrix is 8pi/9(1-1/k) for every kges3.
  • Keywords
    approximation theory; computational complexity; game theory; matrix algebra; pattern clustering; approximate kernel clustering; computational complexity; polynomial time approximation algorithm; semi-definite matrix; unique games conjecture; Algorithm design and analysis; Approximation algorithms; Clustering algorithms; Computational complexity; Computer science; Kernel; Machine learning; Machine learning algorithms; Polynomials; User-generated content; Approximation algorithm; clustering; inapproximability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Foundations of Computer Science, 2008. FOCS '08. IEEE 49th Annual IEEE Symposium on
  • Conference_Location
    Philadelphia, PA
  • ISSN
    0272-5428
  • Print_ISBN
    978-0-7695-3436-7
  • Type

    conf

  • DOI
    10.1109/FOCS.2008.33
  • Filename
    4690989