Title :
Mining high-quality clusters in pattern-based clustering
Author :
Qian Ma ; Jingfeng Guo
Author_Institution :
Modern Educ. Technol. Manage. Center, HengShui Coll., Hengshui, China
Abstract :
Pattern-based clustering, which capture the similarity of the patterns exhibited by objects in a subset of dimensions, has broad applications in DNA microarray data analysis, customer segmentation, e-business data analysis, etc. However, pattern-based clustering often returns a large number of highly-overlapping clusters, which makes it hard for users to identify interesting patterns from the huge mining results. Moreover, there lacks a general measurement to evaluate the quality of Clusters which pattern-based clustering obtained. In this paper, we discuss factors which cause highly-overlapping, make error analysis and pattern weighting, and propose qScore as a key evaluation parameters on quality of Clusters. A algorithm which based on qScore is presented to solve the problem of high-overlapping and get better quality clustering results.
Keywords :
data mining; pattern clustering; error analysis; high-quality cluster mining; highly-overlapping clusters; pattern similarity; pattern weighting; pattern-based clustering; qScore; Algorithm design and analysis; Analytical models; Approximation algorithms; Clustering algorithms; Data analysis; Data mining; Error analysis; Pattern similarity; Pattern-based Clustering; error analysis; highly-Overlapping clusters; qScore;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2011 Eighth International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-61284-180-9
DOI :
10.1109/FSKD.2011.6019708