• DocumentCode
    582820
  • Title

    Integrating multiple gene semantic similarity profiles to infer disease genes

  • Author

    Peng, He ; Rui, Jiang

  • Author_Institution
    Dept. of Autom., Tsinghua Univ., Beijing, China
  • fYear
    2012
  • fDate
    25-27 July 2012
  • Firstpage
    7420
  • Lastpage
    4725
  • Abstract
    The inference of genes that are associated with human inherited diseases (disease genes) has been a task of great challenging in biological and medical studies. Many computational methods have been proposed to prioritize candidate genes with the use of a variety of genomic information. In this work, we propose a novel perspective of binary classification for the inference of disease genes. We integrate three semantic similarity profiles of human genes, a phenotype similarity profile of human diseases, and known associations between diseases and genes to obtain three numerical features that indicate the relevance between a given disease-gene pair. With the features, we use three classification methods (the logistic regression, the random forest, and the support vector machine) to predict whether a gene is truly associated with a disease or not. We apply 10-fold cross-validation experiments to assess the performance of the proposed method and show the effectiveness of this approach. We further show that this binary classification formulation can also be used to address the problem of prioritizing candidate genes.
  • Keywords
    diseases; genetics; medical computing; pattern classification; regression analysis; support vector machines; binary classification; biological studies; classification methods; disease gene inference; disease genes; disease-gene pair; genomic information; human inherited diseases; logistic regression; medical studies; multiple gene semantic similarity profiles; phenotype similarity profile; random forest; support vector machine; Diseases; Feature extraction; Humans; Logistics; Machine learning; Semantics; Support vector machines; Disease genes; gene semantic similarity; phenotype similarity; prediction; prioritization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control Conference (CCC), 2012 31st Chinese
  • Conference_Location
    Hefei
  • ISSN
    1934-1768
  • Print_ISBN
    978-1-4673-2581-3
  • Type

    conf

  • Filename
    6391254