DocumentCode :
3244050
Title :
Fast Structural Similarity Search of Noncoding RNAs Based on Matched Filtering of Stem Patterns
Author :
Yoon, Byung-Jun ; Vaidyanathan, P.P.
Author_Institution :
California Inst. of Technol., Pasadena
fYear :
2007
fDate :
4-7 Nov. 2007
Firstpage :
44
Lastpage :
48
Abstract :
Many noncoding RNAs (ncRNAs) have characteristic secondary structures that give rise to complicated base correlations in their primary sequences. Therefore, when performing an RNA similarity search to find new members of a ncRNA family, we need a statistical model - such as the profile- csHMM or the covariance model (CM) - that can effectively describe the correlations between distant bases. However, these models are computationally expensive, making the resulting RNA search very slow. To overcome this problem, various prescreening methods have been proposed that first use a simpler model to scan the database and filter out the dissimilar regions. Only the remaining regions that bear some similarity are passed to a more complex model for closer inspection. It has been shown that the prescreening approach can make the search speed significantly faster at no (or a slight) loss of prediction accuracy. In this paper, we propose a novel prescreening method based on matched filtering of stem patterns. Unlike many existing methods, the proposed method can prescreen the database solely based on structural similarity. The proposed method can handle RNAs with arbitrary secondary structures, and it can be easily incorporated into various search methods that use different statistical models. Furthermore, the proposed approach has a low computational cost, yet very effective for prescreening, as will be demonstrated in the paper.
Keywords :
biology computing; covariance analysis; macromolecules; organic compounds; search problems; RNA search; characteristic secondary structures; covariance model; fast structural similarity search; matched filtering; noncoding RNA; statistical model; stem patterns; Accuracy; Computational efficiency; Computational modeling; Databases; Filtering; Inspection; Matched filters; Pattern matching; RNA; Search methods;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signals, Systems and Computers, 2007. ACSSC 2007. Conference Record of the Forty-First Asilomar Conference on
Conference_Location :
Pacific Grove, CA
ISSN :
1058-6393
Print_ISBN :
978-1-4244-2109-1
Electronic_ISBN :
1058-6393
Type :
conf
DOI :
10.1109/ACSSC.2007.4487161
Filename :
4487161
Link To Document :
بازگشت