Title :
A protein sequence database cross-field association system
Author :
Guigó, R. ; Vazquez, I. ; Rao, S. ; Smith, T.F.
Author_Institution :
LANL, Los Alamos, NM, USA
Abstract :
The authors approach the problem of obtaining the best query to select an arbitrarily given subset of a database. In particular, they are interested in automatically obtaining the best description for the function of a given protein sequence pattern. It is assumed that such a description is the query on the functional annotation of a protein sequence database having the closest extension in the database to the extension of the pattern. The implementation of an algorithm is presented for efficiently searching the query space when negation is not considered, and a method is developed in which such an implementation is used to search exhaustively a protein sequence database for biologically relevant protein sequence patterns.
Keywords :
database management systems; pattern recognition; proteins; query processing; biologically relevant protein sequence patterns; protein sequence database cross-field association system; protein sequence pattern; query space; Amino acids; Biological information theory; Computational biology; DNA; Databases; Knowledge acquisition; Molecular biophysics; Protein engineering; Protein sequence; Sequences;
Conference_Titel :
System Sciences, 1993, Proceeding of the Twenty-Sixth Hawaii International Conference on
Print_ISBN :
0-8186-3230-5
DOI :
10.1109/HICSS.1993.270608