DocumentCode
2779014
Title
Automatic estimation total number of cluster using a hybrid test-and-generate and K-means algorithm
Author
Mahmuddin, M. ; Yusof, Y.
Author_Institution
Coll. of Arts & Sci., Univ. Utara Malaysia, Sintok, Malaysia
fYear
2010
fDate
5-8 Dec. 2010
Firstpage
593
Lastpage
596
Abstract
K-mean algorithm requires total number cluster, k beforehand in order the algorithm operates correctly. This pre-requisite value is needed to ensure the algorithm works on the tested data. In this paper, a test-and-generate approach is applied to estimate total number present in a data. A hybrid Bees Algorithm and cluster validity index are used for this purpose. The modified Bees algorithm is used to find near-optimal cluster centres (centroids) whereas cluster validity index is employed to examine `goodness´ of the generated clusters. A series of experiments using some benchmarking data sets are undertaken to evaluate effectiveness of the proposed approach. A promising results show that the proposed approach is capable to estimate total number of cluster in a data.
Keywords
pattern clustering; hybrid bees algorithm; k-means algorithm; test and generate algorithm; total cluster number estimation; Clustering algorithms; Educational institutions; Indexes; Machine learning; Machine learning algorithms; Noise measurement; Sun; Cluster validity index; Clustering; K-means; Total number of cluster;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Applications and Industrial Electronics (ICCAIE), 2010 International Conference on
Conference_Location
Kuala Lumpur
Print_ISBN
978-1-4244-9054-7
Type
conf
DOI
10.1109/ICCAIE.2010.5735150
Filename
5735150
Link To Document