DocumentCode :
126776
Title :
An improved K-Means clustering algorithm: A step forward for removal of dependency on K
Author :
Chadha, Anupama ; Kumar, Sudhakar
Author_Institution :
Fac. of Comput. Applic., MRIU, Faridabad, India
fYear :
2014
fDate :
6-8 Feb. 2014
Firstpage :
136
Lastpage :
140
Abstract :
K-Means is one of the most popular partition based clustering technique. K-means has gain popularity because of its simplicity and speed of classifying massive data rapidly and efficiently. However, the output of K-Means algorithm highly depends upon the selection of initial cluster centers because the initial cluster centers are chosen randomly. The other limitation of the algorithm is to input the required number of clusters. This requires some sort of intuitive knowledge about appropriate value of K which is sometimes difficult to predict as it requires domain knowledge. In this paper, we have proposed an algorithm based on the K-Means, but it does not require the number of clusters K as input. The time complexity and quality of the clusters produced by the proposed algorithm is compared with that of original K-Means using two different data sets.
Keywords :
computational complexity; data analysis; pattern clustering; cluster quality; data sets; domain knowledge; improved k-means clustering algorithm; initial cluster centers; massive data classification; partition based clustering technique; time complexity; Clustering algorithms; Euclidean distance; Partitioning algorithms; Proteins; K-means; accuracy; clustering; time complexity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Optimization, Reliabilty, and Information Technology (ICROIT), 2014 International Conference on
Conference_Location :
Faridabad
Print_ISBN :
978-1-4799-3958-9
Type :
conf
DOI :
10.1109/ICROIT.2014.6798312
Filename :
6798312
Link To Document :
بازگشت