• DocumentCode
    949197
  • Title

    A Redundancy-Based Measure of Dissimilarity among Probability Distributions for Hierarchical Clustering Criteria

  • Author

    Iwata, Kazunori ; Hayashi, Akira

  • Author_Institution
    Hiroshima City Univ., Hiroshima
  • Volume
    30
  • Issue
    1
  • fYear
    2008
  • Firstpage
    76
  • Lastpage
    88
  • Abstract
    We introduce novel dissimilarity into a probabilistic clustering task to properly measure dissimilarity among multiple clusters when each cluster is characterized by a subpopulation in the mixture model. This measure of dissimilarity is called redundancy-based dissimilarity among probability distributions. From aspects of both source coding and a statistical hypothesis test, we shed light on several of the theoretical reasons for the redundancy-based dissimilarity among probability distributions being a reasonable measure of dissimilarity among clusters. We also elucidate a principle in common for the measures of redundancy-based dissimilarity and Ward´s method in terms of hierarchical clustering criteria. Moreover, we show several related theorems that are significant for clustering tasks. In the experiments, properties of the measure of redundancy-based dissimilarity are examined in comparison with several other measures.
  • Keywords
    pattern clustering; source coding; statistical distributions; statistical testing; Ward method; hierarchical clustering criteria; probability distributions; redundancy-based dissimilarity measure; source coding; statistical hypothesis test; Ward’ clustering; dissimilarity measure; information theory; mixture model; s method; Algorithms; Artificial Intelligence; Computer Simulation; Data Interpretation, Statistical; Image Enhancement; Image Interpretation, Computer-Assisted; Imaging, Three-Dimensional; Models, Statistical; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity; Subtraction Technique;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2007.1160
  • Filename
    4359310