Title :
Consensus Clusterings
Author :
Nguyen, Nam ; Caruana, Rich
Author_Institution :
Cornell Univ., Ithaca
Abstract :
In this paper we address the problem of combining multiple clusterings without access to the underlying features of the data. This process is known in the literature as clustering ensembles, clustering aggregation, or consensus clustering. Consensus clustering yields a stable and robust final clustering that is in agreement with multiple clusterings. We find that an iterative EM-like method is remarkably effective for this problem. We present an iterative algorithm and its variations for finding clustering consensus. An extensive empirical study compares our proposed algorithms with eleven other consensus clustering methods on four data sets using three different clustering performance metrics. The experimental results show that the new ensemble clustering methods produce clusterings that are as good as, and often better than, these other methods.
Keywords :
data mining; iterative methods; pattern clustering; clustering aggregation; clustering ensembles; consensus clusterings; iterative algorithm; iterative method; multiple clusterings; Clustering algorithms; Clustering methods; Computer science; Data mining; Iterative algorithms; Iterative methods; Measurement; Motion pictures; Partitioning algorithms; Robustness;
Conference_Titel :
Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on
Conference_Location :
Omaha, NE
Print_ISBN :
978-0-7695-3018-5
DOI :
10.1109/ICDM.2007.73