Title :
Fold recognition using sequence fingerprints of protein local substructures
Author :
Kryshtafovych, Andriy ; Hvidsten, Torgeir R. ; Komorowski, Jan ; Fidelis, Krzysztof
Author_Institution :
Lawrence Livermore Nat. Lab., Berkeley, CA, USA
Abstract :
A protein local substructure (descriptor) is a set of several short nonoverlapping fragments of the polypeptide chain. Each substructure describes local environment of a particular residue and includes only those segments of the main chain that are located in the proximity of that residue. Similar descriptors from the representative set of proteins were analyzed to reveal links between the substructures and the sequences of their segments. Using the detected sequence-based fingerprints, specific geometrical conformations are assigned to new sequences. The ability of the approach to recognize correct SCOP folds was tested on 273 sequences from the 49 most popular folds. Good predictions were obtained in 85% of cases. No performance drop was observed with decreasing sequence similarity between target sequences and sequences from the training set of proteins.
Keywords :
biology computing; pattern recognition; proteins; fold recognition; polypeptide chain; protein local substructure; sequence-based fingerprints; Amino acids; Assembly; Bioinformatics; Fingerprint recognition; Laboratories; Libraries; Prediction methods; Protein engineering; Shape; Testing;
Conference_Titel :
Bioinformatics Conference, 2003. CSB 2003. Proceedings of the 2003 IEEE
Print_ISBN :
0-7695-2000-6
DOI :
10.1109/CSB.2003.1227393