DocumentCode :
610335
Title :
Coupled clustering ensemble: Incorporating coupling relationships both between base clusterings and objects
Author :
Can Wang ; Zhong She ; Longbing Cao
Author_Institution :
Adv. Analytics Inst., Univ. of Technol., Sydney, NSW, Australia
fYear :
2013
fDate :
8-12 April 2013
Firstpage :
374
Lastpage :
385
Abstract :
Clustering ensemble is a powerful approach for improving the accuracy and stability of individual (base) clustering algorithms. Most of the existing clustering ensemble methods obtain the final solutions by assuming that base clusterings perform independently with one another and all objects are independent too. However, in real-world data sources, objects are more or less associated in terms of certain coupling relationships. Base clusterings trained on the source data are complementary to one another since each of them may only capture some specific rather than full picture of the data. In this paper, we discuss the problem of explicating the dependency between base clusterings and between objects in clustering ensembles, and propose a framework for coupled clustering ensembles (CCE). CCE not only considers but also integrates the coupling relationships between base clusterings and between objects. Specifically, we involve both the intra-coupling within one base clustering (i.e., cluster label frequency distribution) and the inter-coupling between different base clusterings (i.e., cluster label co-occurrence dependency). Furthermore, we engage both the intra-coupling between two objects in terms of the base clustering aggregation and the inter-coupling among other objects in terms of neighborhood relationship. This is the first work which explicitly addresses the dependency between base clusterings and between objects, verified by the application of such couplings in three types of consensus functions: clustering-based, object-based and cluster-based. Substantial experiments on synthetic and UCI data sets demonstrate that the CCE framework can effectively capture the interactions embedded in base clusterings and objects with higher clustering accuracy and stability compared to several state-of-the-art techniques, which is also supported by statistical analysis.
Keywords :
pattern clustering; statistical analysis; CCE; base clusterings; cluster-based consensus functions; clustering-based consensus functions; coupled clustering ensembles; coupling relationships; neighborhood relationship; object-based consensus functions; statistical analysis; Accuracy; Clustering algorithms; Couplings; Equations; Mathematical model; Rocks; Stability analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2013 IEEE 29th International Conference on
Conference_Location :
Brisbane, QLD
ISSN :
1063-6382
Print_ISBN :
978-1-4673-4909-3
Electronic_ISBN :
1063-6382
Type :
conf
DOI :
10.1109/ICDE.2013.6544840
Filename :
6544840
Link To Document :
بازگشت