DocumentCode :
3166674
Title :
Consensus Clusterings
Author :
Nguyen, Nam ; Caruana, Rich
Author_Institution :
Cornell Univ., Ithaca
fYear :
2007
fDate :
28-31 Oct. 2007
Firstpage :
607
Lastpage :
612
Abstract :
In this paper we address the problem of combining multiple clusterings without access to the underlying features of the data. This process is known in the literature as clustering ensembles, clustering aggregation, or consensus clustering. Consensus clustering yields a stable and robust final clustering that is in agreement with multiple clusterings. We find that an iterative EM-like method is remarkably effective for this problem. We present an iterative algorithm and its variations for finding clustering consensus. An extensive empirical study compares our proposed algorithms with eleven other consensus clustering methods on four data sets using three different clustering performance metrics. The experimental results show that the new ensemble clustering methods produce clusterings that are as good as, and often better than, these other methods.
Keywords :
data mining; iterative methods; pattern clustering; clustering aggregation; clustering ensembles; consensus clusterings; iterative algorithm; iterative method; multiple clusterings; Clustering algorithms; Clustering methods; Computer science; Data mining; Iterative algorithms; Iterative methods; Measurement; Motion pictures; Partitioning algorithms; Robustness;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on
Conference_Location :
Omaha, NE
ISSN :
1550-4786
Print_ISBN :
978-0-7695-3018-5
Type :
conf
DOI :
10.1109/ICDM.2007.73
Filename :
4470298
Link To Document :
بازگشت