• DocumentCode
    3239631
  • Title

    Information cut and information forces for clustering

  • Author

    Jenssen, Robert ; Principe, Jose C. ; Eltoft, Torbjørn

  • Author_Institution
    Comput. NeuroEngineering Lab., Florida Univ., Gainesville, FL, USA
  • fYear
    2003
  • fDate
    17-19 Sept. 2003
  • Firstpage
    459
  • Lastpage
    468
  • Abstract
    We define an information-theoretic divergence measure between probability density functions (pdfs) that has a deep connection to the cut in graph-theory. This connection is revealed when the pdfs are estimated by the Parzen method with a Gaussian kernel. We refer to our divergence measure as the information cut. The information cut provides us with a theoretically sound criterion for cluster evaluation. In this paper we show that it can be used to merge clusters. The initial clusters are obtained based on the related concept of information forces. We create directed trees by selecting the predecessor of a node (pattern) according to the direction of the information force acting on the pattern. Each directed tree corresponds to a cluster, hence enabling us to obtain an initial partitioning of the data set. Subsequently, we utilize the information cut as a cluster evaluation function to merge clusters until the predefined number of clusters is reached. We demonstrate the performance of our novel information-theoretic clustering method when applied to both artificially created data and real data, with encouraging results.
  • Keywords
    data analysis; graph theory; information theory; pattern clustering; Gaussian kernel; Parzen method; cluster evaluation function; divergence measure; graph theory; information cut; information forces; information-theoretic clustering method; probability density functions; Clustering algorithms; Clustering methods; Cost function; Data structures; Density measurement; Entropy; Kernel; Laboratories; Neural engineering; Probability density function;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks for Signal Processing, 2003. NNSP'03. 2003 IEEE 13th Workshop on
  • ISSN
    1089-3555
  • Print_ISBN
    0-7803-8177-7
  • Type

    conf

  • DOI
    10.1109/NNSP.2003.1318045
  • Filename
    1318045