• DocumentCode
    2779014
  • Title

    Automatic estimation total number of cluster using a hybrid test-and-generate and K-means algorithm

  • Author

    Mahmuddin, M. ; Yusof, Y.

  • Author_Institution
    Coll. of Arts & Sci., Univ. Utara Malaysia, Sintok, Malaysia
  • fYear
    2010
  • fDate
    5-8 Dec. 2010
  • Firstpage
    593
  • Lastpage
    596
  • Abstract
    K-mean algorithm requires total number cluster, k beforehand in order the algorithm operates correctly. This pre-requisite value is needed to ensure the algorithm works on the tested data. In this paper, a test-and-generate approach is applied to estimate total number present in a data. A hybrid Bees Algorithm and cluster validity index are used for this purpose. The modified Bees algorithm is used to find near-optimal cluster centres (centroids) whereas cluster validity index is employed to examine `goodness´ of the generated clusters. A series of experiments using some benchmarking data sets are undertaken to evaluate effectiveness of the proposed approach. A promising results show that the proposed approach is capable to estimate total number of cluster in a data.
  • Keywords
    pattern clustering; hybrid bees algorithm; k-means algorithm; test and generate algorithm; total cluster number estimation; Clustering algorithms; Educational institutions; Indexes; Machine learning; Machine learning algorithms; Noise measurement; Sun; Cluster validity index; Clustering; K-means; Total number of cluster;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Applications and Industrial Electronics (ICCAIE), 2010 International Conference on
  • Conference_Location
    Kuala Lumpur
  • Print_ISBN
    978-1-4244-9054-7
  • Type

    conf

  • DOI
    10.1109/ICCAIE.2010.5735150
  • Filename
    5735150