DocumentCode :
1815095
Title :
Tree decomposition based fast search of RNA structures including pseudoknots in genomes
Author :
Song, Yinglei ; Liu, Chunmei ; Malmberg, Russell ; Pan, Fabgfang ; Cai, Liming
Author_Institution :
Dept. of Comput. Sci., Georgia Univ., Athens, GA, USA
fYear :
2005
fDate :
8-11 Aug. 2005
Firstpage :
223
Lastpage :
234
Abstract :
Searching genomes for RNA secondary structure with computational methods has become an important approach to the annotation of non-coding RNAs. However, due to the lack of efficient algorithms for accurate RNA structure-sequence alignment, computer programs capable of fast and effectively searching genomes for RNA secondary structures have not been available. In this paper, a novel RNA structure profiling model is introduced based on the notion of a conformational graph to specify the consensus structure of an RNA family. Tree decomposition yields a small tree width t for such conformation graphs (e.g., t=2 for stem loops and only a slight increase for pseudo-knots). Within this modelling framework, the optimal alignment of a sequence to the structure model corresponds to finding a maximum valued isomorphic subgraph and consequently can be accomplished through dynamic programming on the tree decomposition of the conformational graph in time O(ktN2), where k is a small parameter, and N is the size of the profiled RNA structure. Experiments show that the application of the alignment algorithm to search in genomes yields the same search accuracy as methods based on a covariance model with a significant reduction in computation time. In particular, very accurate searches of tmRNAs in bacteria genomes and of telomerase RNAs in yeast genomes can be accomplished in days, as opposed to months required by other methods. The tree decomposition based searching tool is free upon request and can be downloaded at our site http://www.uga.edu/RNA-Informatics/software/index.php.
Keywords :
biology computing; dynamic programming; genetics; macromolecules; matrix decomposition; microorganisms; molecular biophysics; organic compounds; trees (mathematics); RNA structure profiling model; RNA structure-sequence alignment; bacteria genome; computational method; conformational graph; covariance model; dynamic programming; genome searching; noncoding RNA; pseudoknots; telomerase RNA; tmRNA; tree decomposition; yeast genome; Bioinformatics; Biology computing; Computational biology; Computer science; Genomics; Plants (biology); RNA; Sequences; Stochastic systems; Tree graphs; Covariance model; Pseudo-knot search; RNA secondary structure profiling; Tree decomposition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Systems Bioinformatics Conference, 2005. Proceedings. 2005 IEEE
Print_ISBN :
0-7695-2344-7
Type :
conf
DOI :
10.1109/CSB.2005.52
Filename :
1498024
Link To Document :
بازگشت