Title :
Fast Structural Similarity Search of Noncoding RNAs Based on Matched Filtering of Stem Patterns
Author :
Yoon, Byung-Jun ; Vaidyanathan, P.P.
Author_Institution :
California Inst. of Technol., Pasadena
Abstract :
Many noncoding RNAs (ncRNAs) have characteristic secondary structures that give rise to complicated base correlations in their primary sequences. Therefore, when performing an RNA similarity search to find new members of a ncRNA family, we need a statistical model - such as the profile- csHMM or the covariance model (CM) - that can effectively describe the correlations between distant bases. However, these models are computationally expensive, making the resulting RNA search very slow. To overcome this problem, various prescreening methods have been proposed that first use a simpler model to scan the database and filter out the dissimilar regions. Only the remaining regions that bear some similarity are passed to a more complex model for closer inspection. It has been shown that the prescreening approach can make the search speed significantly faster at no (or a slight) loss of prediction accuracy. In this paper, we propose a novel prescreening method based on matched filtering of stem patterns. Unlike many existing methods, the proposed method can prescreen the database solely based on structural similarity. The proposed method can handle RNAs with arbitrary secondary structures, and it can be easily incorporated into various search methods that use different statistical models. Furthermore, the proposed approach has a low computational cost, yet very effective for prescreening, as will be demonstrated in the paper.
Keywords :
biology computing; covariance analysis; macromolecules; organic compounds; search problems; RNA search; characteristic secondary structures; covariance model; fast structural similarity search; matched filtering; noncoding RNA; statistical model; stem patterns; Accuracy; Computational efficiency; Computational modeling; Databases; Filtering; Inspection; Matched filters; Pattern matching; RNA; Search methods;
Conference_Titel :
Signals, Systems and Computers, 2007. ACSSC 2007. Conference Record of the Forty-First Asilomar Conference on
Conference_Location :
Pacific Grove, CA
Print_ISBN :
978-1-4244-2109-1
Electronic_ISBN :
1058-6393
DOI :
10.1109/ACSSC.2007.4487161