DocumentCode
3714439
Title
An integrative measure of graph- and vector-based semantic similarity using information content distance
Author
Qiaoli Hu; Young-Rae Cho
Author_Institution
Department of Computer Science, Baylor University, Waco, TX, USA
fYear
2015
Firstpage
517
Lastpage
522
Abstract
Gene Ontology (GO) and its annotation data have been widely used for genomic and proteomic analysis. In the past few years, various semantic similarity measures using GO have been proposed to quantify functional similarity between two proteins and assess validity of protein-protein interactions (PPIs). They are categorized as pairwise and groupwise approaches according to the strategies of deriving protein-to-protein functional similarity. We propose a novel semantic similarity measure, called simVICD, which is a graph-and vector-based groupwise approach. This method computes the magnitude of a common induced subgraph as semantic similarity between two sets of terms annotating two proteins, respectively. The magnitude of the common induced subgraph is represented as the Euclidean norm of a vector having information content distance of all possible directed shortest paths in the induced subgraph. Our experimental results show that the proposed groupwise approach, simVICD, and a previous integrative pairwise approach, simICND, outperform the other existing semantic similarity methods in predicting protein complexes and identifying essential proteins.
Keywords
"Chlorine","Proteins","Genomics","Bioinformatics","Ontologies"
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine (BIBM), 2015 IEEE International Conference on
Type
conf
DOI
10.1109/BIBM.2015.7359737
Filename
7359737
Link To Document