• DocumentCode
    1772356
  • Title

    An adaptive method to determine the number of clusters in clustering process

  • Author

    Huan Doan ; Dinh Thuan Nguyen

  • Author_Institution
    HCM, Univ. of Inf. Technol., Ho Chi Minh City, Vietnam
  • fYear
    2014
  • fDate
    3-5 June 2014
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    A difficult problem of most clustering algorithms is how to specify the appropriate number of clusters. This paper proposes an adaptive method of selecting of number of clusters in clustering process by making coefficients indicated the appropriate number of clusters. The intra-cluster coefficient reflects intra distortion of cluster through maximum distance and a mean distance of cluster´s extremely marginal objects. The inter-cluster coefficient reflects distance among clusters. It is ratio between closest distance from this cluster´s centre to an extremely marginal object of other cluster and mean distance from this cluster´s centre to all of extremely marginal object of other cluster respectively. A new coefficient that indicates the appropriate number of clusters is build from the intra-cluster coefficient and inter-cluster coefficient. The looking for extremely marginal objects and the new coefficient are integrated in a weighted FCM algorithm and it is calculated adaptively while the weighted FCM is processing. The weighted FCM algorithm integrated new coefficient is called FCM++. We experiment with FCM++ on some data sets of UCI: Iris, Wine, Soybean-small and show encouraging results.
  • Keywords
    fuzzy set theory; pattern clustering; FCM++; Iris data set; Soybean-small data set; TICI data set; Wine data set; adaptive method; cluster intra distortion; cluster maximum distance; cluster mean distance; cluster number determination; clustering process; fuzzy c-means; intercluster coefficient; intracluster coefficient; weighted FCM algorithm; Buildings; Clustering algorithms; Computational complexity; Educational institutions; Iris; Machine learning algorithms; Market research; clustering algorithm; determine the number of clusters; fuzzy c-means;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Sciences (ICCOINS), 2014 International Conference on
  • Conference_Location
    Kuala Lumpur
  • Print_ISBN
    978-1-4799-4391-3
  • Type

    conf

  • DOI
    10.1109/ICCOINS.2014.6868373
  • Filename
    6868373