DocumentCode :
1588479
Title :
The Prediction of RNA Secondary Structure Using Multiple Unaligned Sequences
Author :
Fang, Xiaoyong ; Luo, Zhigang ; Yuan, Bo
Author_Institution :
Nat. Univ. of Defense Technol., Changsha
Volume :
2
fYear :
2007
Firstpage :
367
Lastpage :
374
Abstract :
Comparative analysis of homologous sequences has been used to predict RNA secondary structure. However, most of existing comparative approaches are vulnerable to alignment errors and thus are not quite suitable for practical application. Here we devise a new method for predicting RNA secondary structure using multiple unaligned sequences. Our method completes the prediction in four major steps: 1) to detect all possible stems in each sequence using the so-called position matrix which indicates paired or unpaired information for each position in the sequence; 2) to find conserved stems across all sequences by multiplying the position matrices; 3) to assess the conserved stems and select some of them as the constraint for RNA folding; 4) to perform final structure prediction using RNAalifold, which is a popular program for secondary structure prediction. We tested our method on data sets composed of RNA sequences with known secondary structures. Our method has average accuracy 73.21% for two-sequence tests, 74.18% for three-sequence tests, and 79.75% for four-sequence tests. The results show that our method can predict RNA secondary structure with a higher accuracy than RNAalifold.
Keywords :
biology computing; macromolecules; matrix algebra; prediction theory; sequences; RNA secondary structure prediction; RNAalifold; comparative analysis; homologous sequences; multiple unaligned sequences; position matrix; stem detection; Biomedical informatics; Computer science; Databases; Educational institutions; Hidden Markov models; Public healthcare; RNA; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Computation, 2007. ICNC 2007. Third International Conference on
Conference_Location :
Haikou
Print_ISBN :
978-0-7695-2875-5
Type :
conf
DOI :
10.1109/ICNC.2007.733
Filename :
4344378
Link To Document :
بازگشت