• DocumentCode
    2943247
  • Title

    A modified stack decoder for protein secondary structure prediction

  • Author

    Aydin, Zafer ; Akgun, Toygar ; Altunbasak, Yucel

  • Author_Institution
    Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA, USA
  • Volume
    4
  • fYear
    2005
  • fDate
    18-23 March 2005
  • Abstract
    Secondary structure prediction is an important step in determining the structure and function of proteins. A fundamental assumption of current Bayesian secondary structure prediction methods is the conditional independence of residues which occur in distinct segments. This assumption enables the exact calculation of posterior probabilities by using pre-determined probabilistic models. However, this assumption is clearly violated in the case of protein sequences due to the existence of structural motifs which rely on sequentially distant segments interacting in three-dimensional space, including β-sheets. It has been suggested that the inability to capture such nonlocal interactions may be the main reason for the low accuracy typically achieved in β-strand prediction (Schmidler, S.C. et al., 2000; Frishman, D. and Argos, P., 1996). Furthermore, current Bayesian segmentations are based on maximum a posteriori or marginal posterior mode searches, which return a single segmentation that is optimal in some sense. We introduce a new secondary structure prediction method based on a modified version of the well-known stack decoder. The proposed method is an N-best search algorithm, which enables us to use the returned multiple segmentations to improve over a single segmentation. Also, due to the way the segmentations are constructed, it is possible to exploit the non-local interactions between β-strands in a sub-optimal way with the ultimate goal of increasing the overall prediction accuracy.
  • Keywords
    Bayes methods; decoding; hidden Markov models; medical computing; molecular configurations; prediction theory; probability; proteins; search problems; sequences; β-sheets; β-strand prediction; Bayesian methods; Bayesian segmentation; HMM; N-best search algorithm; amino acid sequence; beta-sheets; beta-strand prediction; marginal posterior mode searches; maximum a posteriori mode searches; posterior probability calculation; probabilistic models; protein secondary structure prediction; protein sequence analysis; sequentially distant segments; stack decoder; Accuracy; Amino acids; Bayesian methods; Decoding; Hidden Markov models; Hydrogen; Image processing; Prediction methods; Protein sequence; Signal processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8874-7
  • Type

    conf

  • DOI
    10.1109/ICASSP.2005.1416114
  • Filename
    1416114