DocumentCode :
3491059
Title :
Application of an improved K-means algorithm in gene expression data analysis
Author :
Ren, Qian ; Zhuo, Xinjian
Author_Institution :
Sch. of Sci., Beijing Univ. of Posts & Commun., Beijing, China
fYear :
2011
fDate :
2-4 Sept. 2011
Firstpage :
87
Lastpage :
91
Abstract :
K-means algorithm is one of the most classic partition algorithms in clustering algorithms. The result obtained by K-means algorithm varies with the choice of the initial clustering centers. Motivated by this, an improved K-means algorithm is proposed based on the Kruskal algorithm, which is famous in graph theory. The procedure of this algorithm is shown as follows: Firstly, the minimum spanning tree (MST) of the clustered objects is obtained by using Kruskal algorithm. Then K-1 edges are deleted based on weights in a descending order. At last, the average values of the objects contained by the k-connected graphs resulting from last two steps are regarded as the initial clustering centers to cluster. Make the improved K-means algorithm used in gene expression data analysis, simulation experiment shows that the improved K-means algorithm has a better clustering effect and higher efficiency than the traditional one.
Keywords :
biology computing; genetics; molecular biophysics; K-means algorithm; Kruskal algorithm; gene expression data analysis; graph theory; initial clustering centers; minimum spanning tree; Algorithm design and analysis; Clustering algorithms; Data analysis; Educational institutions; Gene expression; Partitioning algorithms; Systems biology; Clustering; Gene expression data; K-means Algorithm; Kruskal Algorithm; MST;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems Biology (ISB), 2011 IEEE International Conference on
Conference_Location :
Zhuhai
Print_ISBN :
978-1-4577-1661-4
Electronic_ISBN :
978-1-4577-1665-2
Type :
conf
DOI :
10.1109/ISB.2011.6033126
Filename :
6033126
Link To Document :
بازگشت