Title :
Understanding Sequence Variability of RNA Motifs Using Geometric Search and IsoDiscrepancy Matrices
Author :
Petrov, Anton I. ; Stombaugh, Jesse ; Zirbel, Craig L. ; Leontis, Neocles B.
Author_Institution :
Bowling Green State Univ., Bowling Green, OH, USA
Abstract :
Many of the nominally single-stranded hairpin, internal, and junction ldquolooprdquo regions of RNA secondary structures, in fact, form uniquely folded 3D motifs. These elements are largely structured by non-Watson-Crick basepairs. Many 3D motifs are recurrent, meaning they occur in different RNAs. Recurrent motifs have the same 3D structure but not necessarily the same sequence. We describe a methodology for identifying the sequence variability of a given recurrent RNA internal loop that can be generalized to hairpin and junction loops. Since the database of RNA 3D structures now contains a significant number of biologically active, structured RNAs, including ribosomal RNAs, ribozymes, and riboswitches, we can directly observe some of the sequence variability for recurrent motifs in x-ray crystal structures. We use our search program, FR3D, to search the 3D structure database for geometrically similar motif instances that share the same spatial pattern of basepairs. We apply our analysis of RNA basepair isostericity and occurrence frequencies to suggest likely basepair substitutions. We use the IsoDiscrepancy Index (IDI), which we recently introduced to quantify basepair isostericities, to derive 4x4 IDI Tables for each base combination in each basepair family. We illustrate how these tables can be applied to predict the most likely base substitutions that occur in a 3D motif. By comparing observed motif instances, we also determine the most likely locations of inserted ("bulged") nucleotides. We compare the predictions from these considerations to observed variability in multiple sequence alignments of the motif.
Keywords :
crystal structure; enzymes; genomics; macromolecules; molecular biophysics; molecular configurations; IsoDiscrepancy Index; Isodiscrepancy matrices; RNA 3D structures; RNA basepair isostericity; RNA motif sequence variability; X-ray crystal structures; hairpin loop; junction loop; multiple sequence alignments; nucleotides; recurrent RNA internal loop; ribosomal RNA; riboswitches; ribozymes; Bioinformatics; Biological information theory; Collaboration; Crystallography; Frequency; Genomics; Performance analysis; Predictive models; RNA; Spatial databases; Find RNA 3D (FR3D); IsoDiscrepancy Index (IDI); RNA 3D Motif; RNA Base-Phosphate Interactions (BPh); RNA Basepairs;
Conference_Titel :
Bioinformatics, 2009. OCCBIO '09. Ohio Collaborative Conference on
Conference_Location :
Cleveland, OH
Print_ISBN :
978-0-7695-3685-9
DOI :
10.1109/OCCBIO.2009.15