Title :
K-means clustering based on self-adaptive weight
Author :
Yuzhu Zhang ; Hualin Shi ; Damin Zhang
Author_Institution :
Inst. of Comput. Sci. & Inf., Guizhou Univ., Guiyang, China
Abstract :
Traditional K-means algorithm is sensitive to the initial cluster centers. It is easily influenced by the outlier and the uneven distribution of the samples to fall into the local optimum states. The K-means algorithm treats all features fairly and sets weights of all features equally when evaluating dissimilarity, which easily makes the cluster fall into the trap of dimension. Due to the deficiency of k-means algorithm, this paper proposed an improved K-means clustering algorithm based on self-adaptive weights. The new method uses the Gaussian distance ration in cluster approximation to set the sample weights. It computed the weights for every data according to the current clustering state and did not rely on the initial clustering center and initial weights any more. The new method selects the weights more scientifically and gets more accurate clustering results. The experimental results show that the algorithm accuracy and stability are significantly higher than the traditional K-means algorithm.
Keywords :
Gaussian processes; approximation theory; data mining; pattern clustering; Gaussian distance ration; cluster approximation; clustering state; dissimilarity evaluation; initial clustering center; k-means clustering algorithm; self-adaptive weights; clustering; gaussian distance; k-means algorithm; self-adaptive weights; weighting;
Conference_Titel :
Computer Science and Network Technology (ICCSNT), 2012 2nd International Conference on
Conference_Location :
Changchun
Print_ISBN :
978-1-4673-2963-7
DOI :
10.1109/ICCSNT.2012.6526212