DocumentCode :
2931382
Title :
Exons and introns characterization in nucleic acid sequences by time-frequency analysis
Author :
Melia, Umberto S P ; Clarià, Francesc ; Gallardo, Juan J. ; Caminal, Pere ; Perera, Alexandre ; Vallverdú, Montserrat
Author_Institution :
Dept. ESAII, Univ. Politec. de Catalunya, Barcelona, Spain
fYear :
2010
fDate :
Aug. 31 2010-Sept. 4 2010
Firstpage :
1783
Lastpage :
1786
Abstract :
A current problem in deoxyribonucleic acid (DNA) sequence analysis is to determine the exact locations of the genes and also in eukaryotes, the protein-coding regions in the mRNA primary transcript (pre-mRNA).The conversion into discrete numerical values of the symbols associated to the nucleotides of these sequences allows for a signal to address the problems related to localization and annotation of genes. In this work, thermodynamic data of free energy changes (ΔG°) on the formation of a duplex structure of DNA or RNA are used to convert the symbols into numerical values associated with the nucleotide sequence pre-mRNA. This study presents an analysis, based on techniques of time-frequency representation of a large number of gene sequences, in order to find variables related to pre-mRNA that could best characterize and discriminate coding regions from non-coding regions. It has been found that instantaneous frequency variables and instantaneous spectral energy variables in different frequency bands, allowed exons and introns to be correctly classified with more than 85%.
Keywords :
DNA; cellular biophysics; free energy; genetics; genomics; proteins; proteomics; time-frequency analysis; RNA DNA; deoxyribonucleic acid sequence analysis; duplex structure; eukaryotes; exons; free energy; gene sequences; instantaneous frequency variables; instantaneous spectral energy variables; introns; mRNA primary transcript; protein-coding regions; time-frequency analysis; Classification algorithms; DNA; Entropy; Proteins; RNA; Splicing; Time frequency analysis; Algorithms; Base Sequence; DNA; Exons; Introns; Molecular Sequence Data; Sequence Alignment; Sequence Analysis, DNA;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering in Medicine and Biology Society (EMBC), 2010 Annual International Conference of the IEEE
Conference_Location :
Buenos Aires
ISSN :
1557-170X
Print_ISBN :
978-1-4244-4123-5
Type :
conf
DOI :
10.1109/IEMBS.2010.5626756
Filename :
5626756
Link To Document :
بازگشت