DocumentCode :
2251388
Title :
Taxonomy-based soft similarity measures in bioinformatics
Author :
Keller, James M. ; Popescu, Mihail ; Mitchell, Joyce
Author_Institution :
Dept. of Electr. & Comput. Eng., Missouri Univ., Columbia, MO, USA
Volume :
1
fYear :
2004
fDate :
25-29 July 2004
Firstpage :
23
Abstract :
One of the most important objects in bioinformatics is a gene product (a protein or an RNA). Besides the gene sequence and expression values found following a microarray experiment, for many gene products, additional functional information comes from the set of gene ontology (GO) annotations and the set of journal abstracts related to the gene product. For these genes, it is reasonable to include similarity measures based on the terms found in the GO and/or the index term sets of the related documents (MeSH annotations). We propose a fuzzy measure-based similarity (FMS) for computing the similarity of two gene products annotated with terms from ontology. The advantage of FMS is that it takes into consideration the context of the whole set when computing the similarity. For the case when the two gene products are not annotated by common ontology terms, we propose a method that avoids a zero similarity result. In dealing with large groups of documents describing the objects under consideration, not only do we determine the similarity between the document pairs, but, by introducing the Choquet integral to the scenario, we can fuse this partial agreement function on pairs of documents into a single value relating the gene products. We present examples of FMS calculation for specific situations where two genes are described by a set of terms from the gene ontology, comparing our measures to others from the literature.
Keywords :
biology computing; fuzzy set theory; proteins; Choquet integral; RNA; bioinformatics; gene ontology annotations; gene product; protein; taxonomy; Abstracts; Bioinformatics; Biomedical informatics; DNA; Engineering management; Flexible manufacturing systems; Fuses; Ontologies; Protein engineering; RNA;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems, 2004. Proceedings. 2004 IEEE International Conference on
ISSN :
1098-7584
Print_ISBN :
0-7803-8353-2
Type :
conf
DOI :
10.1109/FUZZY.2004.1375679
Filename :
1375679
Link To Document :
بازگشت