Title :
Comparing Fuzzy, Probabilistic, and Possibilistic Partitions Using the Earth Mover’s Distance
Author :
Anderson, Derek T. ; Zare, Alina ; Price, Steven
Author_Institution :
Dept. of Electr. & Comput. Eng., Mississippi State Univ., Starkville, MS, USA
Abstract :
A number of noteworthy techniques have been put forth recently in different research fields for comparing clusterings. Herein, we introduce a new method for comparing soft (fuzzy, probabilistic, and possibilistic) partitions based on the earth mover´s distance (EMD) and the ordered weighted average (OWA). The proposed method is a metric, depending on the ground distance, for all but possibilistic partitions. It is extremely flexible due to its EMD formulation, OWA aggregation, and abstract concept of ground distance. In theory, our method is agnostic to the type (uncertainty) of soft partition, clustering algorithm, and distance measure used in the clustering algorithm(s), and it is applicable to the clustering of both object and relational data. Validation is performed theoretically, experimentally, as well as in terms of computational complexity. Emphasis is placed on the set of possibilistic partitions, specifically noise and coincident clusters, which are important cases that have received little to no attention to date in the comparing clustering literature. Improvements are reported in terms of metric properties and computational complexity over existing extended concordance/discordance (e.g., soft Rand and Jaccard) approaches and improved design and robustness in comparison with existing transportation problem-based approaches.
Keywords :
computational complexity; fuzzy set theory; pattern clustering; possibility theory; probability; EMD formulation; OWA aggregation; clustering algorithm; coincident clusters; computational complexity; distance measure; earth mover distance; fuzzy partitions; ground distance; noise clusters; object data clustering; ordered weighted average; possibilistic partitions; probabilistic partitions; relational data clustering; soft partitions; Clustering algorithms; Histograms; Measurement; Open wireless architecture; Partitioning algorithms; Probabilistic logic; Vectors; Cluster validity; comparing partitions; earth mover’s distance (EMD); ordered weighted average (OWA); possibilistic clustering;
Journal_Title :
Fuzzy Systems, IEEE Transactions on
DOI :
10.1109/TFUZZ.2012.2230181