DocumentCode :
3317033
Title :
A Center Closeness Algorithm for the Analyses of Gene Expression Data
Author :
Wang, Huakun ; Feng, Lixin ; Ying, Zhou ; Xu, Zhang ; Wang, Zhenzhen
Author_Institution :
Sch. of Math. Sci., Heilongjiang Univ., Harbin, China
fYear :
2011
fDate :
10-12 May 2011
Firstpage :
1
Lastpage :
2
Abstract :
Clustering is an important computational tool for identifying gene sets with similar profiles. Various clustering methods have been proposed for the analyses of gene expression data, among them, k-means is a widely used method because of its simplicity and computational speed which allows it to run on large datasets. Nevertheless, k-means need to determine the cluster number prior to clustering, which greatly influences the clustering results. This paper proposed a novel center closeness clustering algorithm that can automatically determine the cluster number based on the distances of data points. We used this proposed algorithm to cluster two gene expression data and compared the clustering results with those obtained by k-means. The cluster validity indices showed that our algorithm is obviously superior to k-means.
Keywords :
biology computing; data analysis; genetic algorithms; genetics; pattern clustering; very large databases; center closeness algorithm; center closeness clustering algorithm; cluster number; cluster validity indices; clustering methods; clustering results; computational speed; computational tool; data points; gene expression data; gene sets; k-means; large datasets; Algorithm design and analysis; Bioinformatics; Classification algorithms; Clustering algorithms; Gene expression; Indexes; Tumors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedical Engineering, (iCBBE) 2011 5th International Conference on
Conference_Location :
Wuhan
ISSN :
2151-7614
Print_ISBN :
978-1-4244-5088-6
Type :
conf
DOI :
10.1109/icbbe.2011.5779974
Filename :
5779974
Link To Document :
بازگشت