Title :
Identification of plant messenger RNA polyadenylation sites using length-variable second order Markov model
Author :
Ji, Guoli ; Zhang, Huanghui ; Wu, Xiaohui ; Tang, Meishuang
Author_Institution :
Dept. of Autom., Xiamen Univ., Xiamen, China
Abstract :
In this paper we adopted a length-variable second order Markov model to identify plant messenger RNA poly(A) sites, and provided a common method that only relies on the experimental sequences. The efficacy of our model is showed up to 92% sensitivity and 79% specificity. This method is particularly suitable for the prediction of the poly(A) site which is lack of biological priori knowledge and has poor conservative signal characteristic, as well as for the identification of the alternative poly(A) sites in different genetic regions. Compared with other algorithms, generalized hidden Markov model needed the signal distributions and AdaBoost required the construction of signal features around the sites, our model is more versatile.
Keywords :
Markov processes; RNA; bioelectric phenomena; botany; genetics; molecular biophysics; biological priori knowledge; conservative signal characteristics; genetic regions; length-variable second order Markov model; plant messenger RNA poly(A) site identification; plant messenger RNA polyadenylation site identification; Bioinformatics; Biological system modeling; Hidden Markov models; Markov processes; Predictive models; RNA; Tin; biological priori knowledge; length-variable second order Markov model; polyadenylation site;
Conference_Titel :
Systems, Man, and Cybernetics (SMC), 2011 IEEE International Conference on
Conference_Location :
Anchorage, AK
Print_ISBN :
978-1-4577-0652-3
DOI :
10.1109/ICSMC.2011.6083769