DocumentCode :
1986914
Title :
Statistical inference for well-ordered structures in nucleotide sequences
Author :
Le, Shu-Yun ; Chen, Jih-H ; Maizel, Jacob V., Jr.
Author_Institution :
Nat. Cancer Inst., Nat. Inst. of Health, Frederick, MD, USA
fYear :
2003
fDate :
11-14 Aug. 2003
Firstpage :
190
Lastpage :
196
Abstract :
Distinct, local structures are frequently correlated with functional RNA elements involved in post-transcriptional regulation of gene expression. Discovery of microRNAs (miRNAs) suggests that there are a large class of small noncoding RNAs in eukaryotic genomes. These miRNAs have the potential to form distinct fold-back stem-loop structures. The prediction of those well-ordered folding sequences (WFS) in genomic sequences is very helpful for our understanding of RNA-based gene regulation and the determination of local RNA elements with structure-dependent functions. In this study, we describe a novel method for discovering the local WFS in a nucleotide sequence by Monte Carlo simulation and RNA folding. In the approach the quality of a local WFS is assessed by the energy difference (Ediff) between the optimal structure folded in the local segment and its corresponding optimal, restrained structure where all the previous base pairings formed in the optimal structure are prohibited. Distinct WFS can be discovered by scanning successive segments along a sequence for evaluating the difference between Ediff of the natural sequence and those computed from randomly shuffled sequences. Our results indicate that the statistically significant WFS detected in the genomic sequences of Caenorhabditis elegans (C.elegans) F49E12, T07C5, T07D1, T10H9, Y56A3A and Y71G12B are coincident with known fold-back stem-loops found in miRNA precursors. The potential and implications of our method in searching for miRNAs in genomes is discussed.
Keywords :
Monte Carlo methods; cellular biophysics; genetics; macromolecules; organic compounds; Caenorhabditis elegans genomic sequences; F49E12; Monte Carlo simulation; RNA folding; RNA-based gene regulation; T07C5; T07D1; T10H9; Y56A3A; Y71G12B; distinct fold-back stem-loop structures; energy difference; eukaryotic genomes; gene expression post-transcriptional regulation; local RNA elements; local structures; miRNA precursor; microRNA discovery; noncoding RNA; nucleotide sequences; optimal structure; randomly shuffled sequences; statistical inference; structure-dependent functions; well-ordered folding sequences; well-ordered structures; Bioinformatics; Biomedical computing; Cancer; Computational biology; Gene expression; Genomics; Laboratories; Proteins; RNA; Sequences;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics Conference, 2003. CSB 2003. Proceedings of the 2003 IEEE
Print_ISBN :
0-7695-2000-6
Type :
conf
DOI :
10.1109/CSB.2003.1227318
Filename :
1227318
Link To Document :
بازگشت