DocumentCode
2771341
Title
Determining Molecular Similarity for Drug Discovery using the Wavelet Riemannian Metric
Author
Velasquez, Elinor ; Yera, Emmanuel R. ; Singh, Rahul
Author_Institution
Dept. of Comput. Sci., San Francisco State Univ., CA
fYear
2006
fDate
16-18 Oct. 2006
Firstpage
261
Lastpage
268
Abstract
Discerning the similarity between two molecules is a challenging problem in drug discovery as well as in molecular biology. The importance of this problem is due to the fact that the biochemical characteristics of a molecule are closely related to its structure. Therefore molecular similarity is a key notion in investigations targeted at understanding existing molecules as well as in guiding the synthesis of new molecules. Additionally, the notion of molecular similarity plays a central role in structure query-retrieval. This paper presents a wavelet-based Riemannian metric for determining molecular similarity. The proposed metric extends traditional molecular similarity measures in terms of its ability to capture and compare nonlinear molecular descriptors, thus allowing more accurate characterization of the true nature of the factors involved. Furthermore, owing to its metric properties and wavelet nature, this similarity measure supports highly efficient query-retrieval strategies. To compare graph-based molecular representations using the wavelet-based Riemannian metric, the paper uses a two-phase molecular graph matching strategy. In the first step, an efficient nonlinear graph-matching technique based on the graduated assignment algorithm is used to obtain a preliminary correspondence between molecular graphs in terms of their topological characteristics. Starting from this correspondence, the second stage directly optimizes the proposed metric on arbitrary molecular descriptors using a branch-and-bound search strategy. Various experiments, many in comparative settings, study the retrieval performance of this similarity formulation and underline its efficacy and efficiency
Keywords
biochemistry; biology computing; drugs; graph theory; molecular biophysics; pattern matching; query processing; tree searching; biochemical characteristics; branch-and-bound search strategy; drug discovery; graduated assignment algorithm; graph-based molecular representations; molecular biology; molecular similarity determination; molecular topological characteristics; nonlinear graph-matching technique; nonlinear molecular descriptors; retrieval performance; structure query-retrieval strategies; two-phase molecular graph matching strategy; wavelet Riemannian metric; Biology computing; Chemistry; Databases; Drugs; Explosives; Pharmaceuticals; Pipelines; Space exploration; Testing; Throughput;
fLanguage
English
Publisher
ieee
Conference_Titel
BioInformatics and BioEngineering, 2006. BIBE 2006. Sixth IEEE Symposium on
Conference_Location
Arlington, VA
Print_ISBN
0-7695-2727-2
Type
conf
DOI
10.1109/BIBE.2006.253343
Filename
4019668
Link To Document