• DocumentCode
    464293
  • Title

    Semantic Analysis of Genome Annotations using Weighting Schemes

  • Author

    Done, Bogdan ; Khatri, Purvesh ; Done, Arina ; Draghici, Sorin

  • Author_Institution
    Dept. of Comput. Sci., Wayne State Univ., Detroit, MI
  • fYear
    2007
  • fDate
    1-5 April 2007
  • Firstpage
    212
  • Lastpage
    218
  • Abstract
    The correct interpretation of many molecular biology experiments depends in an essential way on the accuracy and consistency of the existing annotation databases. Such databases are meant to act as repositories for our biological knowledge as we acquire and refine it. Hence, by definition they are incomplete at any given time. In this paper we describe a technique that improves our previous method for extracting implicit semantic relationships between genes and functions. We added a number of weighting schemes to our previous latent semantic indexing approach. We used this technique to analyze the current annotations of the human genome. The predictions of 15 different weighting schemes were compared and evaluated. Out of the top 50 functional annotations predicted using the best performing weighting scheme, we found support in the literature for 82% of them. For 10% of our prediction we did not find any relevant publications, and 6% were actually contradicted by existing literature. This weighting scheme also outperformed the simple binary scheme used in our previous approach. Our method is independent of the organism and can be used to analyze and improve the quality of the data of any public or private annotation database
  • Keywords
    biology computing; genetics; information analysis; annotation databases; biological knowledge; genome annotations; implicit semantic relationships; molecular biology; semantic analysis; weighting schemes; Bioinformatics; Biological processes; Computational biology; Computational intelligence; Databases; Genomics; Humans; Indexing; Ontologies; Organisms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Bioinformatics and Computational Biology, 2007. CIBCB '07. IEEE Symposium on
  • Conference_Location
    Honolulu, HI
  • Print_ISBN
    1-4244-0710-9
  • Type

    conf

  • DOI
    10.1109/CIBCB.2007.4221226
  • Filename
    4221226