• DocumentCode
    768299
  • Title

    Detection and visualization of tandem repeats in DNA sequences

  • Author

    Buchner, Marc ; Janjarasjitt, Suparerk

  • Author_Institution
    Dept. of Electr. Eng. & Comput. Sci., Case Western Reserve Univ., Cleveland, OH, USA
  • Volume
    51
  • Issue
    9
  • fYear
    2003
  • Firstpage
    2280
  • Lastpage
    2287
  • Abstract
    One conspicuous feature of DNA is the extent to which nucleotide subsequences repeat in the genome. Several strongly repetitive tandem (or contiguous) repeats are known to be associated with genetic diseases, while weaker repetitive structures are thought to be representative of historical events associated with sequence repetition. Thus, it is important to develop sensitive and rapid automation of the detection and identification of repeat sequences. A new algorithm for examining periodic patterns in DNA sequences is developed. The algorithm uses the short-time periodicity transform to compute the closest periodic sequence of fixed length at each nucleotide position in a given sequence to be analyzed. Each such subsequence is then compared to its closest periodic sequence to provide a quantitative measure of the amount of repetition within the sequence. In addition to being used to detect the presence of repeat subsequences in DNA, the periodicity explorer algorithm provides a potentially useful visualization of periodic patterns in a DNA sequence through a graphical display of the relative energy in the optimal periodic projections of the analyzed sequences, i.e., the DNA periodogram. Computationally, the algorithm is linear in the length of the analyzed sequence.
  • Keywords
    DNA; biological techniques; data visualisation; diseases; genetics; molecular biophysics; sequences; transforms; DNA periodogram; DNA sequences; contiguous repeats; genetic diseases; genome; graphical display; historical events; nucleotide; nucleotide subsequences; optimal periodic projections; periodic patterns visualization; periodic sequence; periodicity explorer algorithm; relative energy; repeat sequences detection automation; repeat sequences identification automation; repeat subsequences; short-time periodicity transform; tandem repeats detection; tandem repeats visualization; Algorithm design and analysis; Automation; Bioinformatics; DNA; Diseases; Displays; Genetics; Genomics; Sequences; Visualization;
  • fLanguage
    English
  • Journal_Title
    Signal Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1053-587X
  • Type

    jour

  • DOI
    10.1109/TSP.2003.815396
  • Filename
    1223540