Title :
A Combination Approach to Cluster Validation Based on Statistical Quantiles
Author :
Albalate, Amparo ; Suendermann, David
Author_Institution :
Inst. of Inf. Technol., Univ. of Ulm, Ulm, Germany
Abstract :
In this paper, we analyse different techniques to detect the number of clusters in a dataset, also know as cluster validation techniques. We also propose a new algorithm based on the combination of several validation indexes to simultaneously validate several partitions of a dataset generated by different clustering techniques and object distances. The existing validation techniques as well as the combination algorithm have been tested on three data sets: a synthesized mixture of Gaussians data set, the NCI60 microarray data set, and the Iris data set. Evaluation results have shown the adequate performance of the proposed approach, even if the input validity scores fail to discover the true number of clusters.
Keywords :
pattern clustering; statistical analysis; Gaussians data set; NCI60 microarray data set; combination algorithm; dataset cluster validation technique; object distance; statistical quantile; validation index; Bioinformatics; Biology computing; Clustering algorithms; Information analysis; Information technology; Intelligent systems; Partitioning algorithms; Speech analysis; Systems biology; Testing; Cluster Validation; Quantile;
Conference_Titel :
Bioinformatics, Systems Biology and Intelligent Computing, 2009. IJCBS '09. International Joint Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3739-9
DOI :
10.1109/IJCBS.2009.116