DocumentCode
582820
Title
Integrating multiple gene semantic similarity profiles to infer disease genes
Author
Peng, He ; Rui, Jiang
Author_Institution
Dept. of Autom., Tsinghua Univ., Beijing, China
fYear
2012
fDate
25-27 July 2012
Firstpage
7420
Lastpage
4725
Abstract
The inference of genes that are associated with human inherited diseases (disease genes) has been a task of great challenging in biological and medical studies. Many computational methods have been proposed to prioritize candidate genes with the use of a variety of genomic information. In this work, we propose a novel perspective of binary classification for the inference of disease genes. We integrate three semantic similarity profiles of human genes, a phenotype similarity profile of human diseases, and known associations between diseases and genes to obtain three numerical features that indicate the relevance between a given disease-gene pair. With the features, we use three classification methods (the logistic regression, the random forest, and the support vector machine) to predict whether a gene is truly associated with a disease or not. We apply 10-fold cross-validation experiments to assess the performance of the proposed method and show the effectiveness of this approach. We further show that this binary classification formulation can also be used to address the problem of prioritizing candidate genes.
Keywords
diseases; genetics; medical computing; pattern classification; regression analysis; support vector machines; binary classification; biological studies; classification methods; disease gene inference; disease genes; disease-gene pair; genomic information; human inherited diseases; logistic regression; medical studies; multiple gene semantic similarity profiles; phenotype similarity profile; random forest; support vector machine; Diseases; Feature extraction; Humans; Logistics; Machine learning; Semantics; Support vector machines; Disease genes; gene semantic similarity; phenotype similarity; prediction; prioritization;
fLanguage
English
Publisher
ieee
Conference_Titel
Control Conference (CCC), 2012 31st Chinese
Conference_Location
Hefei
ISSN
1934-1768
Print_ISBN
978-1-4673-2581-3
Type
conf
Filename
6391254
Link To Document