DocumentCode :
3180219
Title :
Evaluating scalable fuzzy clustering
Author :
Gu, Yuhua ; Hall, Lawrence O. ; Goldgof, Dmitry B.
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of South Florida, Tampa, FL, USA
fYear :
2010
fDate :
10-13 Oct. 2010
Firstpage :
873
Lastpage :
880
Abstract :
Clustering large data has the problem of not having all the data fit in the memory at one time. It is a challenge to apply fuzzy clustering algorithms to get a partition in a timely manner. In this paper, we compare the online fuzzy clustering and single pass fuzzy clustering algorithms, which can be used to cluster very large data sets which might be treated as streaming data, with fuzzy c-means. We introduce more meaningful partition comparison measurements based on cluster center location instead of using the difference in Rm value. We obtained results on several large volumes of magnetic resonance images which indicate that the online FCM algorithm produces partitions which are very close to what you could get if you clustered all the data at one time. We also show online FCM outperforms single pass FCM and it can process streaming data as it comes without degradation in most cases.
Keywords :
fuzzy set theory; pattern clustering; data clustering; fuzzy c-means; scalable fuzzy clustering; Clustering algorithms; Clustering; fuzzy c means; large data sets; single pass; streaming;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems Man and Cybernetics (SMC), 2010 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1062-922X
Print_ISBN :
978-1-4244-6586-6
Type :
conf
DOI :
10.1109/ICSMC.2010.5641870
Filename :
5641870
Link To Document :
بازگشت