• DocumentCode
    1785205
  • Title

    Improving annotation quality in gene ontology by mining cross-ontology weighted association rules

  • Author

    Agapito, Giuseppe ; Milano, Michela ; Guzzi, Pietro H. ; Cannataro, Mario

  • Author_Institution
    Dept. of Surg. & Med. Sci., Univ. of Catanzaro, Catanzaro, Italy
  • fYear
    2014
  • fDate
    2-5 Nov. 2014
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    The Gene Ontology (GO) is the major resource of annotations for genes and proteins. Despite the presence of large efforts to avoid errors and inconsistencies, some unreliabilities are still present. In particular electronically inferred annotations are more unreliable than manual ones and their number is growing. Thus, the need for an accurate evaluation of annotations in an automatic way arises. In the past, some approaches for improving annotation consistencies have been proposed using association rule mining to discover hidden relationships among GO terms. However such approaches consider all the GO terms equally, while GO terms have different Information Content, i.e. different relevance. Consequently we designed a novel algorithm, (GO-WAR), Mining Weighted Association Rules from GO, that is based on the extraction of weighted association rules considering the IC of terms. We evaluated our algorithm considering seven different species and all the GO ontologies. In all the experiments GO-WAR outperformed state of the art approaches.
  • Keywords
    bioinformatics; data mining; feature extraction; genetics; molecular biophysics; ontologies (artificial intelligence); proteins; GO-WAR algorithm; annotation quality; cross-ontology weighted association rule mining; electronically inferred annotations; gene ontology; information content; proteins; weighted association rule extraction; Association rules; Biological information theory; Integrated circuits; Ontologies; Proteins; Association Rules; Gene Ontology; Weighted Rules;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine (BIBM), 2014 IEEE International Conference on
  • Conference_Location
    Belfast
  • Type

    conf

  • DOI
    10.1109/BIBM.2014.6999374
  • Filename
    6999374