DocumentCode :
3104659
Title :
COALA: A Novel Approach for the Extraction of an Alternate Clustering of High Quality and High Dissimilarity
Author :
Bae, Eric ; Bailey, James
Author_Institution :
NICTA Victoria Lab. Dept. of Comput. Sci. & Software Eng., Melbourne, Univ., Melbourne, VIC
fYear :
2006
fDate :
18-22 Dec. 2006
Firstpage :
53
Lastpage :
62
Abstract :
Cluster analysis has long been a fundamental task in data mining and machine learning. However, traditional clustering methods concentrate on producing a single solution, even though multiple alternative clusterings may exist. It is thus difficult for the user to validate whether the given solution is in fact appropriate, particularly for large and complex datasets. In this paper we explore the critical requirements for systematically finding a new clustering, given that an already known clustering is available and we also propose a novel algorithm, COALA, to discover this new clustering. Our approach is driven by two important factors; dissimilarity and quality. These are especially important for finding a new clustering which is highly informative about the underlying structure of data, but is at the same time distinctively different from the provided clustering. We undertake an experimental analysis and show that our method is able to outperform existing techniques, for both synthetic and real datasets.
Keywords :
pattern clustering; COALA; cluster analysis; data mining; machine learning; multiple alternative clustering; Clustering algorithms; Clustering methods; Computer science; Data mining; Laboratories; Machine learning; Merging; Proteins; Search engines; Software engineering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining, 2006. ICDM '06. Sixth International Conference on
Conference_Location :
Hong Kong
ISSN :
1550-4786
Print_ISBN :
0-7695-2701-7
Type :
conf
DOI :
10.1109/ICDM.2006.37
Filename :
4053034
Link To Document :
بازگشت