DocumentCode :
582820
Title :
Integrating multiple gene semantic similarity profiles to infer disease genes
Author :
Peng, He ; Rui, Jiang
Author_Institution :
Dept. of Autom., Tsinghua Univ., Beijing, China
fYear :
2012
fDate :
25-27 July 2012
Firstpage :
7420
Lastpage :
4725
Abstract :
The inference of genes that are associated with human inherited diseases (disease genes) has been a task of great challenging in biological and medical studies. Many computational methods have been proposed to prioritize candidate genes with the use of a variety of genomic information. In this work, we propose a novel perspective of binary classification for the inference of disease genes. We integrate three semantic similarity profiles of human genes, a phenotype similarity profile of human diseases, and known associations between diseases and genes to obtain three numerical features that indicate the relevance between a given disease-gene pair. With the features, we use three classification methods (the logistic regression, the random forest, and the support vector machine) to predict whether a gene is truly associated with a disease or not. We apply 10-fold cross-validation experiments to assess the performance of the proposed method and show the effectiveness of this approach. We further show that this binary classification formulation can also be used to address the problem of prioritizing candidate genes.
Keywords :
diseases; genetics; medical computing; pattern classification; regression analysis; support vector machines; binary classification; biological studies; classification methods; disease gene inference; disease genes; disease-gene pair; genomic information; human inherited diseases; logistic regression; medical studies; multiple gene semantic similarity profiles; phenotype similarity profile; random forest; support vector machine; Diseases; Feature extraction; Humans; Logistics; Machine learning; Semantics; Support vector machines; Disease genes; gene semantic similarity; phenotype similarity; prediction; prioritization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Control Conference (CCC), 2012 31st Chinese
Conference_Location :
Hefei
ISSN :
1934-1768
Print_ISBN :
978-1-4673-2581-3
Type :
conf
Filename :
6391254
Link To Document :
بازگشت