• DocumentCode
    2414331
  • Title

    A gene ranking method using text-mining for the identification of disease related genes

  • Author

    Lee, Hyungmin ; Shin, Miyoung ; Hong, Munpyo

  • Author_Institution
    Grad. Sch. of EECS, Kyungpook Nat. Univ., Daegu, South Korea
  • fYear
    2010
  • fDate
    18-21 Dec. 2010
  • Firstpage
    493
  • Lastpage
    498
  • Abstract
    For the identification of significant genes involved in specific diseases, microarray gene expression profiles have been widely used to prioritize candidate genes. In this paper, we propose a new gene ranking method that employs genegene relations extracted from literature along with gene expression scores obtained from microarrays. Here the genegene relations are extracted by taking a hybrid approach which is a combination of syntactic analysis and co-occurrence based approaches. Specifically, we perform the syntactic parsing on the text and then, within each clause of the parsed sentence, the co-occurred gene names are considered to be mutually related. Both the gene network derived from the gene-gene relations obtained in the above way and the gene expression scores are given as the inputs to the GeneRank algorithm. For the evaluation of our approach, we conducted experiments with the publicly available prostate cancer data. The results show that our method is superior in the precision and the recall to the original GeneRank which employs the gene-gene relations built from gene ontology annotations. Furthermore, our hybrid approach to the gene-gene relation extraction produces better prioritization of truly disease-related genes in top ranks than the existing popular co-occurrence approach.
  • Keywords
    bioinformatics; cancer; data mining; genetics; GeneRank algorithm; cooccurrence based approach; disease related gene identification; gene network; gene ranking method; gene-gene relation; microarray gene expression profile; prostate cancer; text mining; Data mining; Dictionaries; Diseases; Gene expression; Prostate cancer; Syntactics; disease related genes; gene ranking; microarray data analysis; relation extraction; text-mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine (BIBM), 2010 IEEE International Conference on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    978-1-4244-8306-8
  • Electronic_ISBN
    978-1-4244-8307-5
  • Type

    conf

  • DOI
    10.1109/BIBM.2010.5706616
  • Filename
    5706616