DocumentCode
65360
Title
Chain-RNA: A Comparative ncRNA Search Tool Based on the Two-Dimensional Chain Algorithm
Author
Jikai Lei ; Techa-angkoon, Prapaporn ; Yanni Sun
Author_Institution
Dept. of Comput. Sci. & Eng., Michigan State Univ., East Lansing, MI, USA
Volume
10
Issue
2
fYear
2013
fDate
March-April 2013
Firstpage
274
Lastpage
285
Abstract
Noncoding RNA (ncRNA) identification is highly important to modern biology. The state-of-the-art method for ncRNA identification is based on comparative genomics, in which evolutionary conservations of sequences and secondary structures provide important evidence for ncRNA search. For ncRNAs with low sequence conservation but high structural similarity, conventional local alignment tools such as BLAST yield low sensitivity. Thus, there is a need for ncRNA search methods that can incorporate both sequence and structural similarities. We introduce chain-RNA, a pairwise structural alignment tool that can effectively locate cross-species conserved RNA elements with low sequence similarity. In chain-RNA, stem-loop structures are extracted from dot plots generated by an efficient local-folding algorithm. Then, we formulate stem alignment as an extended 2D chain problem and employ existing chain algorithms. Chain-RNA is tested on a data set containing annotated ncRNA homologs and is applied to novel ncRNA search in a transcriptomic data set. The experimental results show that chain-RNA has better tradeoff between sensitivity and false positive rate in ncRNA prediction than conventional sequence similarity search tools and is more time efficient than structural alignment tools. The source codes of chain-RNA can be downloaded at http://sourceforge.net/projects/chain-rna/ or at http://www.cse.msu.edu/~leijikai/chain-rna/.
Keywords
RNA; biology computing; molecular biophysics; molecular configurations; BLAST; chain-RNA; local-folding algorithm; low sequence similarity; ncRNA search tool; pairwise structural alignment tool; structural similarity; transcriptomic data set; two-dimensional chain algorithm; Chain algorithms; RNA; Noncoding RNA search; chain algorithm; secondary structures; structural alignment; Algorithms; Computational Biology; Databases, Genetic; Genome, Bacterial; Models, Genetic; Nucleic Acid Conformation; RNA, Untranslated; ROC Curve; Sequence Alignment; Sequence Analysis, RNA;
fLanguage
English
Journal_Title
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher
ieee
ISSN
1545-5963
Type
jour
DOI
10.1109/TCBB.2012.137
Filename
6342939
Link To Document