• DocumentCode
    2771341
  • Title

    Determining Molecular Similarity for Drug Discovery using the Wavelet Riemannian Metric

  • Author

    Velasquez, Elinor ; Yera, Emmanuel R. ; Singh, Rahul

  • Author_Institution
    Dept. of Comput. Sci., San Francisco State Univ., CA
  • fYear
    2006
  • fDate
    16-18 Oct. 2006
  • Firstpage
    261
  • Lastpage
    268
  • Abstract
    Discerning the similarity between two molecules is a challenging problem in drug discovery as well as in molecular biology. The importance of this problem is due to the fact that the biochemical characteristics of a molecule are closely related to its structure. Therefore molecular similarity is a key notion in investigations targeted at understanding existing molecules as well as in guiding the synthesis of new molecules. Additionally, the notion of molecular similarity plays a central role in structure query-retrieval. This paper presents a wavelet-based Riemannian metric for determining molecular similarity. The proposed metric extends traditional molecular similarity measures in terms of its ability to capture and compare nonlinear molecular descriptors, thus allowing more accurate characterization of the true nature of the factors involved. Furthermore, owing to its metric properties and wavelet nature, this similarity measure supports highly efficient query-retrieval strategies. To compare graph-based molecular representations using the wavelet-based Riemannian metric, the paper uses a two-phase molecular graph matching strategy. In the first step, an efficient nonlinear graph-matching technique based on the graduated assignment algorithm is used to obtain a preliminary correspondence between molecular graphs in terms of their topological characteristics. Starting from this correspondence, the second stage directly optimizes the proposed metric on arbitrary molecular descriptors using a branch-and-bound search strategy. Various experiments, many in comparative settings, study the retrieval performance of this similarity formulation and underline its efficacy and efficiency
  • Keywords
    biochemistry; biology computing; drugs; graph theory; molecular biophysics; pattern matching; query processing; tree searching; biochemical characteristics; branch-and-bound search strategy; drug discovery; graduated assignment algorithm; graph-based molecular representations; molecular biology; molecular similarity determination; molecular topological characteristics; nonlinear graph-matching technique; nonlinear molecular descriptors; retrieval performance; structure query-retrieval strategies; two-phase molecular graph matching strategy; wavelet Riemannian metric; Biology computing; Chemistry; Databases; Drugs; Explosives; Pharmaceuticals; Pipelines; Space exploration; Testing; Throughput;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    BioInformatics and BioEngineering, 2006. BIBE 2006. Sixth IEEE Symposium on
  • Conference_Location
    Arlington, VA
  • Print_ISBN
    0-7695-2727-2
  • Type

    conf

  • DOI
    10.1109/BIBE.2006.253343
  • Filename
    4019668