DocumentCode :
1854691
Title :
K-means clustering with multiresolution peak detection
Author :
Yu, Guanshan ; Soh, Leen-Kiat ; Bond, Alan
Author_Institution :
Dept. of Comput. Sci. & Eng., Nebraska-Lincoln Univ., Lincoln, NE
fYear :
2005
fDate :
22-25 May 2005
Lastpage :
6
Abstract :
Clustering is a practical data mining approach of pattern detection. Because of the sensitivity of initial conditions, k-means clustering often suffers from low clustering performance. We present a procedure to refine initial conditions of k-means clustering by analyzing density distributions of a data set before estimating the number of clusters k necessary for the data set, as well as the positions of the initial centroids of the clusters. We demonstrate that this approach indeed improves the accuracy and performance of k-means clustering measured by average intra to inter-clustering error ratio. This method is applied to the virtual ecology project to design a virtual blue jay system
Keywords :
data mining; pattern clustering; K-means clustering; data mining; density distribution; intra-to-interclustering error ratio; multiresolution peak detection; pattern detection; virtual blue jay system; virtual ecology; Biological system modeling; Biology; Bonding; Computer science; Data engineering; Data mining; Environmental factors; Frequency; Machine learning; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electro Information Technology, 2005 IEEE International Conference on
Conference_Location :
Lincoln, NE
Print_ISBN :
0-7803-9232-9
Type :
conf
DOI :
10.1109/EIT.2005.1626978
Filename :
1626978
Link To Document :
بازگشت