Title :
Fuzzy clustering: Determining the number of clusters
Author :
Rezankova, H. ; Husek, D.
Author_Institution :
Dept. of Stat. & Probability, Univ. of Econ., Prague, Czech Republic
Abstract :
In this study we analyze behavior of two types of coefficients for determining the suitable number of clusters obtained when fuzzy cluster analysis is applied. First one is Dunn´s coefficient which contains membership degrees in its computational formula; second one is the average silhouette width, used primarily for evaluating hard clustering. There have already been attempts to compare different coefficients for determining the clustering quality or number of clusters respectively. Unfortunately coefficients for evaluating hard clustering and for fuzzy clustering were studied separately only. We tested coefficients efficiency when clustering both data set consisting of generated objects with the known number of clusters and real data sets with unknown number of clusters. The analysis showed the limitations of these two coefficients especially for the cases when clusters are really fuzzy.
Keywords :
fuzzy set theory; pattern clustering; Dunn coefficient; clustering quality determination; computational formula; fuzzy cluster analysis; hard clustering; real data sets; silhouette width; Clustering algorithms; Entropy; Indexes; Security; Social network services; Standards; Vectors; Dunn´s coefficient; Fuzzy cluster analysis; average silhouette width; determining number of clusters;
Conference_Titel :
Computational Aspects of Social Networks (CASoN), 2012 Fourth International Conference on
Conference_Location :
Sao Carlos
Print_ISBN :
978-1-4673-4793-8
DOI :
10.1109/CASoN.2012.6412415