Title :
Detection and visualization of tandem repeats in DNA sequences
Author :
Buchner, Marc ; Janjarasjitt, Suparerk
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Case Western Reserve Univ., Cleveland, OH, USA
Abstract :
One conspicuous feature of DNA is the extent to which nucleotide subsequences repeat in the genome. Several strongly repetitive tandem (or contiguous) repeats are known to be associated with genetic diseases, while weaker repetitive structures are thought to be representative of historical events associated with sequence repetition. Thus, it is important to develop sensitive and rapid automation of the detection and identification of repeat sequences. A new algorithm for examining periodic patterns in DNA sequences is developed. The algorithm uses the short-time periodicity transform to compute the closest periodic sequence of fixed length at each nucleotide position in a given sequence to be analyzed. Each such subsequence is then compared to its closest periodic sequence to provide a quantitative measure of the amount of repetition within the sequence. In addition to being used to detect the presence of repeat subsequences in DNA, the periodicity explorer algorithm provides a potentially useful visualization of periodic patterns in a DNA sequence through a graphical display of the relative energy in the optimal periodic projections of the analyzed sequences, i.e., the DNA periodogram. Computationally, the algorithm is linear in the length of the analyzed sequence.
Keywords :
DNA; biological techniques; data visualisation; diseases; genetics; molecular biophysics; sequences; transforms; DNA periodogram; DNA sequences; contiguous repeats; genetic diseases; genome; graphical display; historical events; nucleotide; nucleotide subsequences; optimal periodic projections; periodic patterns visualization; periodic sequence; periodicity explorer algorithm; relative energy; repeat sequences detection automation; repeat sequences identification automation; repeat subsequences; short-time periodicity transform; tandem repeats detection; tandem repeats visualization; Algorithm design and analysis; Automation; Bioinformatics; DNA; Diseases; Displays; Genetics; Genomics; Sequences; Visualization;
Journal_Title :
Signal Processing, IEEE Transactions on
DOI :
10.1109/TSP.2003.815396