DocumentCode
3180219
Title
Evaluating scalable fuzzy clustering
Author
Gu, Yuhua ; Hall, Lawrence O. ; Goldgof, Dmitry B.
Author_Institution
Dept. of Comput. Sci. & Eng., Univ. of South Florida, Tampa, FL, USA
fYear
2010
fDate
10-13 Oct. 2010
Firstpage
873
Lastpage
880
Abstract
Clustering large data has the problem of not having all the data fit in the memory at one time. It is a challenge to apply fuzzy clustering algorithms to get a partition in a timely manner. In this paper, we compare the online fuzzy clustering and single pass fuzzy clustering algorithms, which can be used to cluster very large data sets which might be treated as streaming data, with fuzzy c-means. We introduce more meaningful partition comparison measurements based on cluster center location instead of using the difference in Rm value. We obtained results on several large volumes of magnetic resonance images which indicate that the online FCM algorithm produces partitions which are very close to what you could get if you clustered all the data at one time. We also show online FCM outperforms single pass FCM and it can process streaming data as it comes without degradation in most cases.
Keywords
fuzzy set theory; pattern clustering; data clustering; fuzzy c-means; scalable fuzzy clustering; Clustering algorithms; Clustering; fuzzy c means; large data sets; single pass; streaming;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems Man and Cybernetics (SMC), 2010 IEEE International Conference on
Conference_Location
Istanbul
ISSN
1062-922X
Print_ISBN
978-1-4244-6586-6
Type
conf
DOI
10.1109/ICSMC.2010.5641870
Filename
5641870
Link To Document