DocumentCode
22568
Title
Measure the Semantic Similarity of GO Terms Using Aggregate Information Content
Author
Xuebo Song ; Lin Li ; Srimani, Pradip K. ; Yu, Philip S. ; Wang, James Z.
Author_Institution
Sch. of Comput., Clemson Univ., Clemson, SC, USA
Volume
11
Issue
3
fYear
2014
fDate
May-June 2014
Firstpage
468
Lastpage
476
Abstract
The rapid development of gene ontology (GO) and huge amount of biomedical data annotated by GO terms necessitate computation of semantic similarity of GO terms and, in turn, measurement of functional similarity of genes based on their annotations. In this paper we propose a novel and efficient method to measure the semantic similarity of GO terms. The proposed method addresses the limitations in existing GO term similarity measurement techniques; it computes the semantic content of a GO term by considering the information content of all of its ancestor terms in the graph. The aggregate information content (AIC) of all ancestor terms of a GO term implicitly reflects the GO term´s location in the GO graph and also represents how human beings use this GO term and all its ancestor terms to annotate genes. We show that semantic similarity of GO terms obtained by our method closely matches the human perception. Extensive experimental studies show that this novel method also outperforms all existing methods in terms of the correlation with gene expression data. We have developed web services for measuring semantic similarity of GO terms and functional similarity of genes using the proposed AIC method and other popular methods. These web services are available at http://bioinformatics.clemson.edu/G-SESAME.
Keywords
Web services; bioinformatics; correlation methods; data analysis; genetics; graph theory; medical information systems; ontologies (artificial intelligence); AIC method; GO graph; GO term location; GO term semantic content computation; GO term semantic similarity measurement; GO term similarity measurement techniques; Web service development; aggregate information content; ancestor term AIC; ancestor term information content; biomedical data annotation; gene annotations; gene expression data correlation; gene functional similarity measurement; gene ontology; semantic similarity computation; Bioinformatics; Biomedical measurement; Databases; Equations; Integrated circuits; Ontologies; Semantics; G-SESAME; GO similarity; Gene ontology; gene expression;
fLanguage
English
Journal_Title
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher
ieee
ISSN
1545-5963
Type
jour
DOI
10.1109/TCBB.2013.176
Filename
6682909
Link To Document