Title :
Performance Evaluation of Some Symmetry-Based Cluster Validity Indexes
Author :
Saha, Sriparna ; Bandyopadhyay, Sanghamitra
Author_Institution :
Machine Intell. Unit, Indian Stat. Inst., Kotkata
fDate :
7/1/2009 12:00:00 AM
Abstract :
Identification of the correct number of clusters is an important consideration in clustering where several cluster validity indexes, primarily utilizing the Euclidean distance, have been used in the literature. The property of symmetry is observed in most clustering solutions. In this paper, the symmetry versions of nine cluster validity indexes, namely, Davies-Bouldin index, Dunn index, generalized Dunn index, point symmetry (PS) index, I index, Xie-Beni index, FS index, K index, and SV index, are proposed. It is empirically established that incorporation of the property of symmetry significantly improves the capabilities of these indexes in identifying the appropriate number of clusters. A recently developed PS-based genetic clustering technique, GAPS clustering, is used as the underlying partitioning algorithm. Results on six artificially generated and five real-life datasets show that symmetry-distance-based I index performs the best as compared to all the other eight indexes.
Keywords :
data mining; genetic algorithms; pattern clustering; software performance evaluation; Davies-Bouldin index; Euclidean distance; PS-based genetic clustering technique; Xie-Beni index; cluster identification; generalized Dunn index; partitioning algorithm; point symmetry index; point-symmetry-based distance; symmetry-based cluster validity indexes; Cluster validity index; point-symmetry-based distance (PS distance); symmetry property; unsupervised classification;
Journal_Title :
Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on
DOI :
10.1109/TSMCC.2009.2013335