• DocumentCode
    2931382
  • Title

    Exons and introns characterization in nucleic acid sequences by time-frequency analysis

  • Author

    Melia, Umberto S P ; Clarià, Francesc ; Gallardo, Juan J. ; Caminal, Pere ; Perera, Alexandre ; Vallverdú, Montserrat

  • Author_Institution
    Dept. ESAII, Univ. Politec. de Catalunya, Barcelona, Spain
  • fYear
    2010
  • fDate
    Aug. 31 2010-Sept. 4 2010
  • Firstpage
    1783
  • Lastpage
    1786
  • Abstract
    A current problem in deoxyribonucleic acid (DNA) sequence analysis is to determine the exact locations of the genes and also in eukaryotes, the protein-coding regions in the mRNA primary transcript (pre-mRNA).The conversion into discrete numerical values of the symbols associated to the nucleotides of these sequences allows for a signal to address the problems related to localization and annotation of genes. In this work, thermodynamic data of free energy changes (ΔG°) on the formation of a duplex structure of DNA or RNA are used to convert the symbols into numerical values associated with the nucleotide sequence pre-mRNA. This study presents an analysis, based on techniques of time-frequency representation of a large number of gene sequences, in order to find variables related to pre-mRNA that could best characterize and discriminate coding regions from non-coding regions. It has been found that instantaneous frequency variables and instantaneous spectral energy variables in different frequency bands, allowed exons and introns to be correctly classified with more than 85%.
  • Keywords
    DNA; cellular biophysics; free energy; genetics; genomics; proteins; proteomics; time-frequency analysis; RNA DNA; deoxyribonucleic acid sequence analysis; duplex structure; eukaryotes; exons; free energy; gene sequences; instantaneous frequency variables; instantaneous spectral energy variables; introns; mRNA primary transcript; protein-coding regions; time-frequency analysis; Classification algorithms; DNA; Entropy; Proteins; RNA; Splicing; Time frequency analysis; Algorithms; Base Sequence; DNA; Exons; Introns; Molecular Sequence Data; Sequence Alignment; Sequence Analysis, DNA;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering in Medicine and Biology Society (EMBC), 2010 Annual International Conference of the IEEE
  • Conference_Location
    Buenos Aires
  • ISSN
    1557-170X
  • Print_ISBN
    978-1-4244-4123-5
  • Type

    conf

  • DOI
    10.1109/IEMBS.2010.5626756
  • Filename
    5626756