DocumentCode :
2116417
Title :
Application of an algorithm for the computation of functional similarity in the research of protein interaction
Author :
Hongxia, Chen
Author_Institution :
Coll. of Basic Med., Xianning Univ., Xianning, China
fYear :
2012
fDate :
21-23 April 2012
Firstpage :
172
Lastpage :
174
Abstract :
The similarity among gene products based on gene ontology terminology is mainly measured through semantic similarity, which first computes the semantic similarity among gene ontology terminologies and then computes the similarity among gene products related to GO terminologies. Methods for computing the semantic similarity in the first step are either information-quantity-based or semantic-distance-based. The former category considers the information quantity of the nodes themselves while neglecting the structure of gene ontology; the later category neglects the semantic relations among the nodes in gene ontology and the uneven distribution of connections between nodes. This paper proposes a comprehensive approach where the similarity is first computed on the terminological level, inducing density-constraining and depth-constraining variables based on the microscopic structure (density and depth of the nodes) of the gene ontology nodes, then based on node properties and node-based instances, the comprehensive semantic similarity is computed. Published researches regarding the functional similarity in the second step are rare. Considering the defect of computing the functional similarity by averaging, a new weighted functional similarity algorithm is proposed. The effects of 4 algorithms: average-based, maximum-based, Wang and WSim are evaluated by means of ROC curve using the interaction data of human protein and fruit fly protein from the DIP database. Result of comparison proves that the proposed weighted algorithm is favorable.
Keywords :
biology computing; genetics; molecular biophysics; ontologies (artificial intelligence); proteins; DIP database; GO terminology; ROC curve; average-based algorithms; density-constraining variables; depth-constraining variables; fruit fly protein; functional similarity computation; gene ontology nodes; gene ontology terminology; gene products; human protein; maximum-based algorithms; microscopic structure; protein interaction; semantic similarity; weighted functional similarity algorithm; Functional Similarity; GO Terminology; Protein Interaction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Consumer Electronics, Communications and Networks (CECNet), 2012 2nd International Conference on
Conference_Location :
Yichang
Print_ISBN :
978-1-4577-1414-6
Type :
conf
DOI :
10.1109/CECNet.2012.6201609
Filename :
6201609
Link To Document :
بازگشت