Title :
Identifying disease genes from gene expression data based on singular value decomposition
Author :
Zhang, Huanping ; Song, Xiaofeng ; Zhang, Xiaobai
Author_Institution :
Coll. of Mech. & Electron. Eng., Nanjing Forestry Univ., Nanjing, China
Abstract :
Identification of disease genes that might anticipate the clinical behavior of human cancers is very important for understanding cancer pathogenesis. Computational analysis of disease gene from microarray data involves a search for gene subset that is able to discriminate cancer samples from normal samples, which is a challenging task due to a small number of samples compared to huge number of genes. In this paper, an algorithm (LRSVD) based on singular value decomposition and logistic regression is proposed to find genes that are associated with disease. LRSVD makes use of a threshold value to control the number of singular vectors; evaluates the contribution of each eigengene to the classifying accuracy by regression coefficients of logistic regression; and then ranks each gene by its discriminative power for two kinds of samples. The results on colon gene expression data indicate that LRSVD method with support vector machine (SVM) as a classifier is an encouraging method to identify disease genes.
Keywords :
diseases; genetics; medical computing; molecular biophysics; singular value decomposition; support vector machines; disease gene; eigengene; gene expression data; logistic regression; regression coefficient; singular value decomposition; singular vector; support vector machine; Accuracy; Cancer; Diseases; Gene expression; Humans; Logistics; Vectors; logistic regression; microarray data; singular value decomposition;
Conference_Titel :
Biomedical Engineering and Informatics (BMEI), 2011 4th International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-9351-7
DOI :
10.1109/BMEI.2011.6098516