Title :
Efficient retrieval of electron density patterns for modeling proteins by X-ray crystallography
Author :
Gopal, K. ; Romo, T.D. ; Sacchettini, J.C. ; Ioerger, T.R.
Abstract :
Inefficient case retrieval is a major problem in many case-based reasoning systems, especially when case matching is expensive and the case-base is large. In this paper, we present a two-phase approach where an inexpensive feature-based method is used to jind a set of potential matches and a more expensive and accurate case matching method is used to make the jinal selection. This approach has been successfully employed in TEXTALTM, a system that retrieves previously solved 3D patterns of electron density from a database to determine the structure of proteins. Electron density patterns are characterized by numeric features and an appropriate distance measure is used to efficiently jilter good matches through an exhaustive search of the database. These matches are then examined using a computationally expensive density correlation procedure based on jinding an optimal superposition between 3D patterns. We provide an empirical and theoretical analysis of some of the keys issues related to this method. In particular, we dejine a model for estimating how approximate various featurebased similarity measures are (relative to an objective matching metric), and determine its relation to the number of cases that should be jiltered from a given database to make the approach effective.
Keywords :
Computer science; Crystallography; Databases; Electrons; Information retrieval; Matched filters; Particle measurements; Pattern matching; Predictive models; Proteins;
Conference_Titel :
Machine Learning and Applications, 2004. Proceedings. 2004 International Conference on
Conference_Location :
Louisville, Kentucky, USA
Print_ISBN :
0-7803-8823-2
DOI :
10.1109/ICMLA.2004.1383539