Title :
The effects of C-G content and mutations on the Fourier transform method for periodicity
Author :
Garbarine, Elaine M. ; Rosen, Gail L.
Author_Institution :
Sch. of Electr. & Comput. Eng., Drexel Univ., Philadelphia, PA
Abstract :
It is well-known that the Fourier spectrum of a DNA protein-coding region exhibits an f = 1/3 peak. This is due to an unbalanced nucleotide distribution and open reading frame (ORF) positional bias that introduces a 3-base periodicity into the sequence. Until now, the f = 1/3 property has mainly been used to detect protein-coding regions, but in our paper, we use the f = 1/3 spectral height to detect the strength of the periodicity in these regions. First, we quantify the effects of sequence length, nucleotide distribution, ORF positional bias, and mutations on this spectral measure. Then once we know how the spectral peak height changes in response to these aberrations, we investigate a sliding window technique in an attempt to detect changes in CG content. Detection of frameshift mutations are important for finding disease, and it has been hypothesized that nucleotide bias variation signifies a change in the function of encoded proteins.
Keywords :
DNA; Fourier transform spectra; molecular biophysics; molecular configurations; spectral analysis; statistical analysis; C-G content effects; C-G mutation effects; DNA protein coding region Fourier spectrum; DNA sequence periodicity; Fourier spectral height; Fourier transform method; ORF positional bias; open reading frame; periodicity analysis; periodicity strength detection; sequence length; spectral peak height; unbalanced nucleotide distribution; Character generation; DNA; Diseases; Fourier transforms; Genetic mutations; Length measurement; Nuclear measurements; Position measurement; Proteins; Sequences;
Conference_Titel :
Genomic Signal Processing and Statistics, 2008. GENSiPS 2008. IEEE International Workshop on
Conference_Location :
Phoenix, AZ
Print_ISBN :
978-1-4244-2371-2
Electronic_ISBN :
978-1-4244-2372-9
DOI :
10.1109/GENSIPS.2008.4555668