• DocumentCode
    3239380
  • Title

    On the optimality of k-means clustering

  • Author

    Dalton, Lori

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Ohio State Univ., Columbus, OH, USA
  • fYear
    2013
  • fDate
    17-19 Nov. 2013
  • Firstpage
    70
  • Lastpage
    71
  • Abstract
    Although it is typically accepted that cluster analysis is a subjective activity, without an objective framework it is impossible to understand, let alone guarantee, the predictive capacity of clustering. To address this, recent work utilizes random point process theory to develop a probabilistic theory of clustering. The theory fully parallels Bayes decision theory for classification: given a known underlying processes and specified cost function there exist Bayes clustering operators with minimum expected error. Clustering is hence transformed from a subjective activity to an objective operation. In this work, we present conditions under which the optimization function utilized in classical k-means clustering is optimal in the new Bayes clustering theory, and thus begin to understand this algorithm objectively.
  • Keywords
    Bayes methods; decision theory; optimisation; pattern clustering; Bayes clustering operators; Bayes decision theory; cost function; k-means clustering; objective operation; optimization function; predictive capacity; probabilistic theory; random point process theory; subjective activity; Clustering algorithms; Cost function; Decision theory; Indexes; Linear programming; Probabilistic logic; Silicon; Bayes decision theory; Optimal clustering; k-means clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Genomic Signal Processing and Statistics (GENSIPS), 2013 IEEE International Workshop on
  • Conference_Location
    Houston, TX
  • Print_ISBN
    978-1-4799-3461-4
  • Type

    conf

  • DOI
    10.1109/GENSIPS.2013.6735934
  • Filename
    6735934