• DocumentCode
    463460
  • Title

    Protein Fold Recognition using Residue-Based Alignments of Sequence and Secondary Structure

  • Author

    Aydin, Zafer ; Erdogan, Hakan ; Altunbasak, Yucel

  • Author_Institution
    Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA
  • Volume
    1
  • fYear
    2007
  • fDate
    15-20 April 2007
  • Abstract
    Protein structure prediction aims to determine the three-dimensional structure of proteins form their amino acid sequences. When a protein does not have similarity (homology) to any known fold, threading or fold recognition methods are used to predict structure. Fold recognition methods frequently employ secondary structure, solvent accessibility, and evolutionary information to enhance the accuracy and the quality of the predictions. In this paper, we present a residue based alignment method as an alternative to the state-of-the-art SSEA method, originally introduced by Przytycka et al., and further modified by McGuffin et al. We introduce a residue-based score function, which can incorporate amino acid similarity matrices such as BLOSUM into secondary structure similarity scoring and compute joint alignments. We show that the power of the SSEA method comes from the length normalization instead of the element alignment technique and similar performance can be achieved using residue-based alignments of secondary structures by optimizing gap costs. In simulations with the two benchmark datasets, our method performs slightly better than the SSEA in terms of the fold recognition accuracy. When the secondary structure similarity matrix is combined with the amino acid based BLOSUM30 matrix, the accuracy of our method improves further (4% for the McGuffin set and 10% for the Ding and Dubchak set). The availability of aligning the amino acid and secondary structure sequences in a joint manner offers a better starting point for more elaborate techniques that employ profile-profile alignments and machine learning methods.
  • Keywords
    medical image processing; pattern recognition; proteins; BLOSUM30 matrix; amino acid sequences; element alignment technique; machine learning methods; protein fold recognition; residue-based alignments; secondary structure; three-dimensional structure; Amino acids; Image processing; Image recognition; Learning systems; Libraries; Optimization methods; Protein engineering; Signal processing; Solvents; Target recognition; amino acid alignment; gap cost; protein fold recognition; score normalization; secondary structure alignment;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
  • Conference_Location
    Honolulu, HI
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0727-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2007.366688
  • Filename
    4217088