مرکز منطقه ای اطلاع رساني علوم و فناوري - Application of an improved K-means algorithm in gene expression data analysis

DocumentCode :

3491059

Title :

Application of an improved K-means algorithm in gene expression data analysis

Author :

Ren, Qian ; Zhuo, Xinjian

Author_Institution :

Sch. of Sci., Beijing Univ. of Posts & Commun., Beijing, China

fYear :

2011

fDate :

2-4 Sept. 2011

Firstpage :

Lastpage :

Abstract :

K-means algorithm is one of the most classic partition algorithms in clustering algorithms. The result obtained by K-means algorithm varies with the choice of the initial clustering centers. Motivated by this, an improved K-means algorithm is proposed based on the Kruskal algorithm, which is famous in graph theory. The procedure of this algorithm is shown as follows: Firstly, the minimum spanning tree (MST) of the clustered objects is obtained by using Kruskal algorithm. Then K-1 edges are deleted based on weights in a descending order. At last, the average values of the objects contained by the k-connected graphs resulting from last two steps are regarded as the initial clustering centers to cluster. Make the improved K-means algorithm used in gene expression data analysis, simulation experiment shows that the improved K-means algorithm has a better clustering effect and higher efficiency than the traditional one.

Keywords :

biology computing; genetics; molecular biophysics; K-means algorithm; Kruskal algorithm; gene expression data analysis; graph theory; initial clustering centers; minimum spanning tree; Algorithm design and analysis; Clustering algorithms; Data analysis; Educational institutions; Gene expression; Partitioning algorithms; Systems biology; Clustering; Gene expression data; K-means Algorithm; Kruskal Algorithm; MST;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Systems Biology (ISB), 2011 IEEE International Conference on

Conference_Location :

Zhuhai

Print_ISBN :

978-1-4577-1661-4

Electronic_ISBN :

978-1-4577-1665-2

Type :

conf

DOI :

10.1109/ISB.2011.6033126

Filename :

6033126

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3491059