DocumentCode :
3001121
Title :
NMF based gene selection algorithm for improving performance of the spectral cancer clustering
Author :
Mirzal, Andri
Author_Institution :
Fac. of Comput., Univ. Teknol. Malaysia, Skudai, Malaysia
fYear :
2013
fDate :
Nov. 29 2013-Dec. 1 2013
Firstpage :
74
Lastpage :
78
Abstract :
Analyzing cancers using microarray gene expression datasets is currently an active research in medical community. There are many tasks related to this research, e.g., clustering and classifcation, data compression, and samples characterization. In this paper, we discuss the task of cancer clustering. The spectral clustering is one of the most commonly used methods in cancer clustering. As the gene expression datasets usually are highly imbalanced, i.e., containing only a few tissue samples (hundreds at most) but each is expressed by thousands of genes, filtering out some irrelevant and potentially misleading gene expressions is a necessary step to improve the performance of the method. In this paper, we propose an unsupervised gene selection algorithm based on the nonnegative matrix factorization (NMF). Our algorithm is designed by making use of the clustering capability of the NMF to select the most informative genes. Clustering performance of the spectral method is then evaluated by comparing the results using the original datasets with the results using the pruned datasets. Our results suggest that the proposed algorithm can be used to improve clustering performance of the spectral method.
Keywords :
cancer; genetics; matrix decomposition; medical computing; molecular biophysics; pattern clustering; NMF based gene selection algorithm; NMF clustering capability; cancer analysis; clustering performance; data classification; data clustering; data compression; medical community; microarray gene expression datasets; nonnegative matrix factorization; samples characterization; spectral cancer clustering; spectral method; Accuracy; Algorithm design and analysis; Bioinformatics; Cancer; Clustering algorithms; Gene expression; Matrix decomposition; cancer clustering; gene selection; nonnegative matrix factorization; spectral clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control System, Computing and Engineering (ICCSCE), 2013 IEEE International Conference on
Conference_Location :
Mindeb
Print_ISBN :
978-1-4799-1506-4
Type :
conf
DOI :
10.1109/ICCSCE.2013.6719935
Filename :
6719935
Link To Document :
بازگشت