Title :
Protein secondary structure prediction with semi Markov HMMs
Author :
Z. Aydin;Y. Altunbasak;M. Borodovsky
Author_Institution :
Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA, USA
fDate :
6/26/1905 12:00:00 AM
Abstract :
Secondary structure prediction has been an essential task in determining the structure and function of proteins. Prediction accuracy is improving every year towards the estimated 88% theoretical limit (Rost. B., http://cubic.bioc.columbia.edu/papers/2002 rev dekker/paper.html). There are two approaches for the secondary structure prediction. The first one, ab initio (single sequence) prediction, does not use any homology information. The evolutionary information, if available, is used by the second approach to improve the prediction accuracy by a few percentages (Schmidler, S.C. et al., J. Computational Biology, vol.7, no.1/2, p.233-48, 2000). We address the problem of single sequence prediction by developing a semi Markov HMM, similar to the one proposed by Schmidler et al. We introduce a better dependency model by considering the statistically significant amino acid correlation patterns at segment borders. Also, we propose an internal dependency model considering right to left dependencies without modifying the left to right HMM topology. In addition, we propose an iterative training method to estimate the HMM parameters better. Putting all these together, we obtained 1.5% improvement in three-state-per-residue accuracy.
Keywords :
"Proteins","Hidden Markov models","Accuracy","Sequences","Estimation theory","Computational biology","Biological system modeling","Amino acids","Topology","Iterative methods"
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP ´04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1327176