Title :
Chain-RNA: A Comparative ncRNA Search Tool Based on the Two-Dimensional Chain Algorithm
Author :
Jikai Lei ; Techa-angkoon, Prapaporn ; Yanni Sun
Author_Institution :
Dept. of Comput. Sci. & Eng., Michigan State Univ., East Lansing, MI, USA
Abstract :
Noncoding RNA (ncRNA) identification is highly important to modern biology. The state-of-the-art method for ncRNA identification is based on comparative genomics, in which evolutionary conservations of sequences and secondary structures provide important evidence for ncRNA search. For ncRNAs with low sequence conservation but high structural similarity, conventional local alignment tools such as BLAST yield low sensitivity. Thus, there is a need for ncRNA search methods that can incorporate both sequence and structural similarities. We introduce chain-RNA, a pairwise structural alignment tool that can effectively locate cross-species conserved RNA elements with low sequence similarity. In chain-RNA, stem-loop structures are extracted from dot plots generated by an efficient local-folding algorithm. Then, we formulate stem alignment as an extended 2D chain problem and employ existing chain algorithms. Chain-RNA is tested on a data set containing annotated ncRNA homologs and is applied to novel ncRNA search in a transcriptomic data set. The experimental results show that chain-RNA has better tradeoff between sensitivity and false positive rate in ncRNA prediction than conventional sequence similarity search tools and is more time efficient than structural alignment tools. The source codes of chain-RNA can be downloaded at http://sourceforge.net/projects/chain-rna/ or at http://www.cse.msu.edu/~leijikai/chain-rna/.
Keywords :
RNA; biology computing; molecular biophysics; molecular configurations; BLAST; chain-RNA; local-folding algorithm; low sequence similarity; ncRNA search tool; pairwise structural alignment tool; structural similarity; transcriptomic data set; two-dimensional chain algorithm; Chain algorithms; RNA; Noncoding RNA search; chain algorithm; secondary structures; structural alignment; Algorithms; Computational Biology; Databases, Genetic; Genome, Bacterial; Models, Genetic; Nucleic Acid Conformation; RNA, Untranslated; ROC Curve; Sequence Alignment; Sequence Analysis, RNA;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2012.137