• Title of article

    Dynamic clustering of histogram data based on adaptive squared Wasserstein distances

  • Author/Authors

    Antonio Irpino، نويسنده , , Antonio and Verde، نويسنده , , Rosanna and De Carvalho، نويسنده , , Francisco de A.T.، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2014
  • Pages
    16
  • From page
    3351
  • To page
    3366
  • Abstract
    This paper presents a Dynamic Clustering Algorithm for histogram data with an automatic weighting step of the variables by using adaptive distances. The Dynamic Clustering Algorithm is a k-means-like algorithm for clustering a set of objects into a predefined number of classes. Histogram data are realizations of particular set-valued descriptors defined in the context of Symbolic Data Analysis. We propose to use the ℓ 2 Wasserstein distance for clustering histogram data and two novel adaptive distance based clustering schemes. The ℓ 2 Wasserstein distance allows to express the variability of a set of histograms in two components: the first related to the variability of their averages and the second to the variability of the histograms related to different size and shape. The weighting step aims to take into account global and local adaptive distances as well as two components of the variability of a set of histograms. To evaluate the clustering results, we extend some classic partition quality indexes when the proposed adaptive distances are used in the clustering criterion function. Examples on synthetic and real-world datasets corroborate the proposed clustering procedure.
  • Keywords
    Histogram data , Partitioning clustering method , Wasserstein distance , Symbolic data analysis , Adaptive distance
  • Journal title
    Expert Systems with Applications
  • Serial Year
    2014
  • Journal title
    Expert Systems with Applications
  • Record number

    2354654