DocumentCode :
2943247
Title :
A modified stack decoder for protein secondary structure prediction
Author :
Aydin, Zafer ; Akgun, Toygar ; Altunbasak, Yucel
Author_Institution :
Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA, USA
Volume :
4
fYear :
2005
fDate :
18-23 March 2005
Abstract :
Secondary structure prediction is an important step in determining the structure and function of proteins. A fundamental assumption of current Bayesian secondary structure prediction methods is the conditional independence of residues which occur in distinct segments. This assumption enables the exact calculation of posterior probabilities by using pre-determined probabilistic models. However, this assumption is clearly violated in the case of protein sequences due to the existence of structural motifs which rely on sequentially distant segments interacting in three-dimensional space, including β-sheets. It has been suggested that the inability to capture such nonlocal interactions may be the main reason for the low accuracy typically achieved in β-strand prediction (Schmidler, S.C. et al., 2000; Frishman, D. and Argos, P., 1996). Furthermore, current Bayesian segmentations are based on maximum a posteriori or marginal posterior mode searches, which return a single segmentation that is optimal in some sense. We introduce a new secondary structure prediction method based on a modified version of the well-known stack decoder. The proposed method is an N-best search algorithm, which enables us to use the returned multiple segmentations to improve over a single segmentation. Also, due to the way the segmentations are constructed, it is possible to exploit the non-local interactions between β-strands in a sub-optimal way with the ultimate goal of increasing the overall prediction accuracy.
Keywords :
Bayes methods; decoding; hidden Markov models; medical computing; molecular configurations; prediction theory; probability; proteins; search problems; sequences; β-sheets; β-strand prediction; Bayesian methods; Bayesian segmentation; HMM; N-best search algorithm; amino acid sequence; beta-sheets; beta-strand prediction; marginal posterior mode searches; maximum a posteriori mode searches; posterior probability calculation; probabilistic models; protein secondary structure prediction; protein sequence analysis; sequentially distant segments; stack decoder; Accuracy; Amino acids; Bayesian methods; Decoding; Hidden Markov models; Hydrogen; Image processing; Prediction methods; Protein sequence; Signal processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8874-7
Type :
conf
DOI :
10.1109/ICASSP.2005.1416114
Filename :
1416114
Link To Document :
بازگشت