Title :
Numerical representation of DNA sequences based on genetic code context and its applications in periodicity analysis of genomes
Author :
Yin, Changchuan ; Yau, Stephen S -T
Author_Institution :
Dept. of Math., Stat. & Comput. Sci., Univ. of Illinois at Chicago, Chicago, IL
Abstract :
The indispensable prerequisites in characterizing information content of DNA molecules by computational methods are the numerical representations of symbolic DNA sequences. Current numerical representation methods for DNA sequences do not contain the genetic code context information, which may play an important role in defining protein coding regions. We propose a novel numerical representation of DNA sequences based on genetic code context within DNA sequences and explore the feasibility of applying this method to identify protein coding regions in genomes. Computational experiments indicate that incorporating genetic code information into numerical representations is a promising approach in which DNA sequences are uniquely represented and more information is represented so that digital processing tools can be applied to the periodicity analysis in DNA sequences effectively.
Keywords :
DNA; biochemistry; bioinformatics; genomics; molecular biophysics; molecular configurations; proteins; signal processing; DNA molecules; biochemical properties; computational methods; digital signal processing tools; genetic code context information; genomes; numerical representation; periodicity analysis; protein coding regions; symbolic DNA sequences; Bioinformatics; DNA computing; Discrete Fourier transforms; Genetics; Genomics; Humans; Proteins; Sequences; Signal processing; Signal processing algorithms;
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology, 2008. CIBCB '08. IEEE Symposium on
Conference_Location :
Sun Valley, ID
Print_ISBN :
978-1-4244-1778-0
Electronic_ISBN :
978-1-4244-1779-7
DOI :
10.1109/CIBCB.2008.4675783