• DocumentCode
    3180219
  • Title

    Evaluating scalable fuzzy clustering

  • Author

    Gu, Yuhua ; Hall, Lawrence O. ; Goldgof, Dmitry B.

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Univ. of South Florida, Tampa, FL, USA
  • fYear
    2010
  • fDate
    10-13 Oct. 2010
  • Firstpage
    873
  • Lastpage
    880
  • Abstract
    Clustering large data has the problem of not having all the data fit in the memory at one time. It is a challenge to apply fuzzy clustering algorithms to get a partition in a timely manner. In this paper, we compare the online fuzzy clustering and single pass fuzzy clustering algorithms, which can be used to cluster very large data sets which might be treated as streaming data, with fuzzy c-means. We introduce more meaningful partition comparison measurements based on cluster center location instead of using the difference in Rm value. We obtained results on several large volumes of magnetic resonance images which indicate that the online FCM algorithm produces partitions which are very close to what you could get if you clustered all the data at one time. We also show online FCM outperforms single pass FCM and it can process streaming data as it comes without degradation in most cases.
  • Keywords
    fuzzy set theory; pattern clustering; data clustering; fuzzy c-means; scalable fuzzy clustering; Clustering algorithms; Clustering; fuzzy c means; large data sets; single pass; streaming;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems Man and Cybernetics (SMC), 2010 IEEE International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1062-922X
  • Print_ISBN
    978-1-4244-6586-6
  • Type

    conf

  • DOI
    10.1109/ICSMC.2010.5641870
  • Filename
    5641870