Title :
Collaborative Fuzzy Clustering Algorithms: Some Refinements and Design Guidelines
Author :
Coletta, Luiz F S ; Vendramin, Lucas ; Hruschka, Eduardo Raul ; Campello, Ricardo J G B ; Pedrycz, Witold
Author_Institution :
Dept. of Comput. Sci., Univ. of Sao Paulo, Sao Carlos, Brazil
fDate :
6/1/2012 12:00:00 AM
Abstract :
There are some variants of the widely used Fuzzy C-Means (FCM) algorithm that support clustering data distributed across different sites. Those methods have been studied under different names, like collaborative and parallel fuzzy clustering. In this study, we offer some augmentation of the two FCM-based clustering algorithms used to cluster distributed data by arriving at some constructive ways of determining essential parameters of the algorithms (including the number of clusters) and forming a set of systematically structured guidelines such as a selection of the specific algorithm depending on the nature of the data environment and the assumptions being made about the number of clusters. A thorough complexity analysis, including space, time, and communication aspects, is reported. A series of detailed numeric experiments is used to illustrate the main ideas discussed in the study.
Keywords :
computational complexity; data mining; fuzzy set theory; pattern clustering; FCM-based clustering algorithms; collaborative fuzzy clustering algorithms; communication aspects; distributed data clustering; fuzzy c-means algorithm; parallel fuzzy clustering; space complexity; time complexity; Algorithm design and analysis; Clustering algorithms; Collaboration; Distributed databases; Indexes; Partitioning algorithms; Prototypes; Collaborative and parallel fuzzy clustering; design and selection guidelines; distributed knowledge discovery; validity indices;
Journal_Title :
Fuzzy Systems, IEEE Transactions on
DOI :
10.1109/TFUZZ.2011.2175400