Title :
Protein secondary structure prediction with Bayesian learning method
Author :
Wang, Peng-Liang ; Zhang, El
Author_Institution :
Dept. of Comput. Sci., California State Univ., Sacramento, CA, USA
Abstract :
This paper describes a Bayesian learning based approach to protein secondary structure prediction. Four secondary structure types are considered, including α-helix, β-strand, β-turn and coil. A six-letter exchange group is utilized to represent a protein sequence. Training cases are expressed as sequence quaternion. A tool called Predictor is developed in Java that implements the proposed approach. To evaluate the tool, we select, from the protein data bank and based on the principle of one-protein-per-family according to the structure family of SCOP, six hundred and twenty-three known proteins without pair wise sequence homology. Several training/test data splits have been tried. The results show that our proposed approach can produce prediction accuracy comparable to those of the traditional prediction methods. Predictor has user-friendly and easy-to-use GUIs, and is of practical value to the molecular biology researchers.
Keywords :
Bayes methods; chemical structure; graphical user interfaces; learning (artificial intelligence); macromolecules; proteins; α-helix; β-strand; β-turn; Bayesian learning method; Java; Predictor; SCOP structure family; coil; data splits; easy-to-use GUI; molecular biology; pair wise sequence homology; pairwise sequence homology; protein data bank; protein secondary structure prediction; protein sequence; secondary structure types; sequence quaternion; six-letter exchange group; user-friendly GUI; Accuracy; Bayesian methods; Coils; Identity-based encryption; Learning systems; Prediction methods; Protein engineering; Protein sequence; Quaternions; Statistics;
Conference_Titel :
Tools with Artificial Intelligence, 2002. (ICTAI 2002). Proceedings. 14th IEEE International Conference on
Print_ISBN :
0-7695-1849-4
DOI :
10.1109/TAI.2002.1180812