DocumentCode
2931382
Title
Exons and introns characterization in nucleic acid sequences by time-frequency analysis
Author
Melia, Umberto S P ; Clarià, Francesc ; Gallardo, Juan J. ; Caminal, Pere ; Perera, Alexandre ; Vallverdú, Montserrat
Author_Institution
Dept. ESAII, Univ. Politec. de Catalunya, Barcelona, Spain
fYear
2010
fDate
Aug. 31 2010-Sept. 4 2010
Firstpage
1783
Lastpage
1786
Abstract
A current problem in deoxyribonucleic acid (DNA) sequence analysis is to determine the exact locations of the genes and also in eukaryotes, the protein-coding regions in the mRNA primary transcript (pre-mRNA).The conversion into discrete numerical values of the symbols associated to the nucleotides of these sequences allows for a signal to address the problems related to localization and annotation of genes. In this work, thermodynamic data of free energy changes (ΔG°) on the formation of a duplex structure of DNA or RNA are used to convert the symbols into numerical values associated with the nucleotide sequence pre-mRNA. This study presents an analysis, based on techniques of time-frequency representation of a large number of gene sequences, in order to find variables related to pre-mRNA that could best characterize and discriminate coding regions from non-coding regions. It has been found that instantaneous frequency variables and instantaneous spectral energy variables in different frequency bands, allowed exons and introns to be correctly classified with more than 85%.
Keywords
DNA; cellular biophysics; free energy; genetics; genomics; proteins; proteomics; time-frequency analysis; RNA DNA; deoxyribonucleic acid sequence analysis; duplex structure; eukaryotes; exons; free energy; gene sequences; instantaneous frequency variables; instantaneous spectral energy variables; introns; mRNA primary transcript; protein-coding regions; time-frequency analysis; Classification algorithms; DNA; Entropy; Proteins; RNA; Splicing; Time frequency analysis; Algorithms; Base Sequence; DNA; Exons; Introns; Molecular Sequence Data; Sequence Alignment; Sequence Analysis, DNA;
fLanguage
English
Publisher
ieee
Conference_Titel
Engineering in Medicine and Biology Society (EMBC), 2010 Annual International Conference of the IEEE
Conference_Location
Buenos Aires
ISSN
1557-170X
Print_ISBN
978-1-4244-4123-5
Type
conf
DOI
10.1109/IEMBS.2010.5626756
Filename
5626756
Link To Document