Title :
Towards context-aware similarity metrics
Author :
Morent, Dominik ; Patterson, David E. ; Berthold, Michael R.
Author_Institution :
Dept. of Comput. & Inf. Sci., Konstanz Univ., Germany
Abstract :
Comparing molecular structures is often done based on relatively high-dimensional, binary fingerprints that represent absence or presence of certain structural elements. The then commonly used Tanimoto distance considers all those bits similar, although some of the underlying structural elements may be quite closely related. We propose a weighting scheme for such related bits which softens the influence of differences in bit patterns that represent similar contexts. We demonstrate the usefulness of this context aware similarity metric using an automated bit-weighting mechanism and a set of similarity weights based on Topomer shape similarity between the structural elements encoded in the finger print.
Keywords :
biology computing; feature extraction; molecular biophysics; molecular configurations; Tanimoto distance; Topomer shape similarity; binary fingerprints; bit patterns; bit-weighting mechanism; context-aware similarity metrics; molecular structures; Bioinformatics; Cancer; Chemicals; Context awareness; Euclidean distance; Fingerprint recognition; Fingers; Fluctuations; Particle measurements; Shape;
Conference_Titel :
Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
Conference_Location :
Guangzhou, China
Print_ISBN :
0-7803-9091-1
DOI :
10.1109/ICMLC.2005.1527933