DocumentCode :
1598162
Title :
Improvement of traditional k-means algorithm through the regulation of distance metric parameters
Author :
Srimani, Pradip K. ; Mahesh, Shanthi ; Bhyratae, Suhas A.
Author_Institution :
Department of Computer Science & Math´s, Bangalore University, R&D, B.U, India
fYear :
2013
Firstpage :
393
Lastpage :
398
Abstract :
This paper discusses in detail the behavior of the basic k-means algorithm with four more new algorithms with varied distance measures on gene expression data. In data mining, k-means clustering is a method which aims to partition n observations into k clusters in which each observation belongs to the cluster with the nearest mean. The traditional k-means is one of the most popular clustering methods for analyzing gene expression data. However, it suffers from major shortcomings. It is sensitive to initial partitions and it is only applicable to data with spherical-shape clusters. The results of the present study show that the performances of the new algorithms are extremely well when compared to the traditional k-means and also emphasizes that through the regulation of distance metric parameters, one can achieve better clustering effects then the traditional k-means, and has an advantage in sensitivity, specificity and run time. Finally it is found that Canberra k-means performs extremely well.
Keywords :
Algorithm design and analysis; Clustering algorithms; Databases; Genetics; Kernel; Sensitivity; Vectors; Canberra; Chebyshev; Data Mining; Gene Expression Data; Kernel; Manhattan; Supremum; k-Means;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems and Control (ISCO), 2013 7th International Conference on
Conference_Location :
Coimbatore, Tamil Nadu, India
Print_ISBN :
978-1-4673-4359-6
Type :
conf
DOI :
10.1109/ISCO.2013.6481187
Filename :
6481187
Link To Document :
بازگشت