Title :
Gene function prediction using protein domain probability and hierarchical Gene Ontology information
Author :
Jung, Jaehee ; Thon, Michael R.
Author_Institution :
Dept. of Comput. Sci., Texas A&M Univ., College Station, TX
Abstract :
The Gene Ontology (GO) is a controlled vocabulary of terms to describe protein functions. It also includes a hierarchical description of the relationships among the terms in the form of a directed acyclic graph (DAG). Several systems have been developed that employ pattern recognition to assign gene function, using a variety of features, including sequence similarity, presence of protein functional domains and gene expression patterns, but most of these approaches have not considered the hierarchical structure of the GO. The DAG represents the functional relationships between the GO terms, thus it should be an important component of an automated annotation system. We propose a Bayesian, multi-label classifier that incorporates the relationships among GO terms found in the GO DAG. A comparative analysis of our method to other previously described annotation systems shows that our method provides improved annotation accuracy when the performance of individual GO terms are compared. More importantly, our method enables the classification of significantly more GO terms to more proteins than were previously possible.
Keywords :
Bayes methods; biology computing; directed graphs; genetics; ontologies (artificial intelligence); pattern classification; proteins; Bayesian classifier; automated annotation system; directed acyclic graph; gene expression pattern; gene function prediction; hierarchical gene ontology information; multi-label classifier; pattern recognition; protein domain probability; protein functional domains; protein functions; sequence similarity; Automatic control; Bayesian methods; Bioinformatics; Gene expression; Genomics; Ontologies; Pediatrics; Probability; Protein engineering; Vocabulary;
Conference_Titel :
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location :
Tampa, FL
Print_ISBN :
978-1-4244-2174-9
Electronic_ISBN :
1051-4651
DOI :
10.1109/ICPR.2008.4761737