Title :
Implementation of a classification-based prediction model for plant mRNA Poly(A) sites
Author :
Ji, Guoli ; Wu, Xiaohui ; Huang, Jiangyin ; Li, Qingshun Quinn
Author_Institution :
Dept. of Autom., Xiamen Univ., Xiamen
fDate :
Sept. 28 2008-Oct. 1 2008
Abstract :
The poly(A) site of a messenger RNA (mRNA) defines the end of a transcript during eukaryotic gene expression. Finding poly(A) sites in genome sequences can help to annotate the ends of genes and predict alternative polyadenylation. However, it is challenging to predict plant poly(A) sites using computational methods because of the weak signals that determine the poly(A) sites. Here we describe a classification based plant poly(A) site recognition model. First, several feature representation methods like factorial moments, M encoding, and weight of signal patterns are adopted to describe the makeup of nucleotide sequences of poly(A) signals. Then, a training model using different classification algorithms like Bayesian network is built as a testing model to predict plant mRNA poly(A) sites. Comparing to previous plant poly(A) sites prediction software PASS that we developed, the recognition model introduced here has better performance, flexibility and expansibility.
Keywords :
belief networks; biology computing; genetics; macromolecules; molecular biophysics; pattern classification; Bayesian network; M encoding; classification-based prediction model; eukaryotic gene expression; factorial moment; feature representation; genome sequence; messenger RNA; nucleotide sequence; poly(A) site; polyadenylation; recognition model; signal pattern; Bayesian methods; Bioinformatics; Classification algorithms; Encoding; Gene expression; Genomics; Predictive models; RNA; Software performance; Testing;
Conference_Titel :
Bio-Inspired Computing: Theories and Applications, 2008. BICTA 2008. 3rd International Conference on
Conference_Location :
Adelaide, SA
Print_ISBN :
978-1-4244-2724-6
DOI :
10.1109/BICTA.2008.4656716