Title :
Improvement of the inside-outside algorithm using prediction and application to RNA modeling
Author :
Chen, Jin-miao ; Chaudhari, Narendra S.
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore
Abstract :
In the last decade, stochastic grammars became an important probabilistic tool for modeling biological sequences. The inside-outside (IO) algorithm is the most popular estimator for learning the probability parameters associated to the rules from training sequences. The inside-outside algorithm needs O(L3M3) (L is the length of training sequence and M is the number of non-terminals) operations. The IO algorithm considers a number of useless inside variables (α-variables) and useless outside variables (β-variables) in each training iteration. In this paper we give a method to avoid these useless variables. Our method uses a prediction function for this purpose. We give an example based on RNA sequences (taken from [Y. Sakakibara, et al., 1994]) to illustrate the percentage of such useless variables avoided in our method.
Keywords :
context-free grammars; macromolecules; parameter estimation; prediction theory; probability; stochastic processes; RNA; biological sequence modeling; inside-outside algorithm; parameter estimation; prediction; probabilistic tool; probability parameter; stochastic grammars; Application software; Biological system modeling; Hidden Markov models; Inference algorithms; Prediction algorithms; Predictive models; Probability; RNA; Sequences; Stochastic processes;
Conference_Titel :
Machine Learning and Cybernetics, 2003 International Conference on
Print_ISBN :
0-7803-8131-9
DOI :
10.1109/ICMLC.2003.1259882