DocumentCode :
1072173
Title :
Estimating the Number of Clusters via System Evolution for Cluster Analysis of Gene Expression Data
Author :
Wang, Kaijun ; Zheng, Jie ; Zhang, Junying ; Dong, Jiyang
Author_Institution :
Sch. of Math. & Comput. Sci., Fujian Normal Univ., Fuzhou, China
Volume :
13
Issue :
5
fYear :
2009
Firstpage :
848
Lastpage :
853
Abstract :
The estimation of the number of clusters (NC) is one of crucial problems in the cluster analysis of gene expression data. Most approaches available give their answers without the intuitive information about separable degrees between clusters. However, this information is useful for understanding cluster structures. To provide this information, we propose system evolution (SE) method to estimate NC based on partitioning around medoids (PAM) clustering algorithm. SE analyzes cluster structures of a dataset from the viewpoint of a pseudothermodynamics system. The system will go to its stable equilibrium state, at which the optimal NC is found, via its partitioning process and merging process. The experimental results on simulated and real gene expression data demonstrate that the SE works well on the data with well-separated clusters and the one with slightly overlapping clusters.
Keywords :
bioinformatics; genetics; pattern clustering; statistical analysis; gene expression data; partitioning around medoids clustering algorithm; pseudothermodynamics system; system evolution method; Cluster analysis; estimation of the number of clusters; partitioning around medoids; system evolution; Algorithms; Cluster Analysis; Computer Simulation; Databases, Genetic; Gene Expression; Models, Genetic; Models, Statistical;
fLanguage :
English
Journal_Title :
Information Technology in Biomedicine, IEEE Transactions on
Publisher :
ieee
ISSN :
1089-7771
Type :
jour
DOI :
10.1109/TITB.2009.2025119
Filename :
5072285
Link To Document :
بازگشت