DocumentCode :
772792
Title :
A DSP Approach for Finding the Codon Bias in DNA Sequences
Author :
Tuqan, Jamal ; Rushdi, Ahmad
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of California, Davis, CA
Volume :
2
Issue :
3
fYear :
2008
fDate :
6/1/2008 12:00:00 AM
Firstpage :
343
Lastpage :
356
Abstract :
The detection of different forms of periodicities in DNA sequences has been an active area of research in recent years. Most of the signal processing based methods have primarily focussed on assigning numerical values to the symbolic DNA sequence and then applying spectral analysis tools such as the short-time discrete Fourier transform (ST-DFT) to locate these repeats. A key application of DNA periodicity finding has been in the identification of the protein coding regions in DNA sequences by tracking the so-called period-3 component using the DNA spectrum. The main problem with this gene detection approach is that it is successful for certain genes but does not work for others. An interesting open research problem is to therefore determine the underlying reasons behind this disparity in performance. This requires, in turn, a solid understanding of the working principles of the period-3 component and the DNA spectrum. In this paper, we present a DSP-based approach that provides a complete analysis of this phenomenon. Specifically, we derive a new DSP based model that 1) clearly explains the underlying mechanism of the period-3 component, 2) directly relates the identification of the period-3 component to the detection of nucleotide bias in the codon structure, and 3) completely characterizes the DNA spectrum by a set of numerical sequences termed the filtered polyphase sequences. Furthermore, by adhering to the specific structure of the derived model, we can show that standard signal processing tools such as digital filtering can substantially enhance the detection of the codon bias. Several performance measures of DNA periodicity detection are also proposed and experimental results are provided to illustrate the key findings of our work.
Keywords :
DNA; biology computing; filtering theory; genetics; molecular biophysics; molecular configurations; proteins; sequences; signal detection; spectral analysis; DNA periodicity finding; DNA spectrum; DSP; codon bias; codon structure; digital filtering; gene detection; nucleotide bias detection; period-3 component; polyphase sequences; protein coding regions; signal processing based methods; symbolic DNA sequences; DNA; Digital filters; Digital signal processing; Discrete Fourier transforms; Filtering; Proteins; Sequences; Signal processing; Solids; Spectral analysis; DNA periodicities; DNA sequence analysis; genomic signal processing; multirate DSP; period-3 component; tandem repeats;
fLanguage :
English
Journal_Title :
Selected Topics in Signal Processing, IEEE Journal of
Publisher :
ieee
ISSN :
1932-4553
Type :
jour
DOI :
10.1109/JSTSP.2008.923851
Filename :
4550567
Link To Document :
بازگشت