• DocumentCode
    2531304
  • Title

    A New Metric to Measure Gene Product Similarity

  • Author

    Mathur, Sachin ; Dinakarpandian, Deendayal

  • Author_Institution
    Univ. of Missouri-Kansas, Kansas City
  • fYear
    2007
  • fDate
    2-4 Nov. 2007
  • Firstpage
    333
  • Lastpage
    338
  • Abstract
    The widespread use of microarray technology and sequencing of genomes has made it increasingly possible to study the cellular sub-systems of organisms. Computational techniques applied to sequence data annotated with ontologies such as the gene ontology (GO) aid in understanding regulatory networks of genes. An important related problem is the estimation of the similarity between gene products based on their annotations. We present an approach to compute gene product similarity that takes into account both the hierarchical nature of GO and the co-occurrence of GO terms in annotations. It also accounts for differences in the cardinality of annotations and differences in the frequency of usage of GO terms. We demonstrate the validity of the metric by computing the similarity between gene products in several different contexts. These include the analysis of similarity within a specific signaling pathway, between proteins constituting a sequence family and the comparative evaluation of different clusterings of microarray data.
  • Keywords
    biology computing; cellular biophysics; genetics; molecular biophysics; molecular configurations; ontologies (artificial intelligence); proteins; cellular sub-systems; data clustering; gene annotations; gene ontology; gene product similarity; genome sequencing; microarray technology; proteins; signaling pathway; Bioinformatics; Biomedical computing; Biomedical engineering; Biomedical measurements; Gene expression; Genomics; Ontologies; Organisms; Proteins; Signal processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine, 2007. BIBM 2007. IEEE International Conference on
  • Conference_Location
    Fremont, CA
  • Print_ISBN
    978-0-7695-3031-4
  • Type

    conf

  • DOI
    10.1109/BIBM.2007.62
  • Filename
    4413074