• DocumentCode
    2855711
  • Title

    Information divergence measures-for detection of borders between coding and noncoding DNA regions using recursive entropic segmentation

  • Author

    Nicorici, Daniel ; Astola, Jaakko

  • Author_Institution
    Tampere Int. Center for Signal Process., Tampere Univ. of Technol., Finland
  • fYear
    2003
  • fDate
    28 Sept.-1 Oct. 2003
  • Firstpage
    577
  • Lastpage
    580
  • Abstract
    Entropy-based divergence measures have shown promising results in many areas of engineering and image processing. In this study, we use the Jensen-Shannon and Jensen-Renyi divergence measures for recursive segmentation of DNA sequences in order to find borders between coding and noncoding regions. Heterogeneous DNA sequences that are comprised of the four nucleotides A, C, G, and T and the stop codons can be partitioned into homogeneous domains. We introduce a new 18 symbol alphabet that captures: (i) the differential base composition in codons, and (ii) the differential stop codon composition along three phases in both DNA strands. For two entire genomes of bacteria our results obtained using the new approach, based on Jensen-Renyi divergence and the new 18 symbol alphabet, are more accurate than those obtained using the standard approach, based on Jensen-Shannon divergence, when searching for borders between coding and noncoding DNA regions.
  • Keywords
    DNA; entropy; image coding; image segmentation; medical image processing; Jensen-Renyi divergence; Jensen-Shannon divergence; borders detection; differential stop codon composition; entropic segmentation; entropy-based divergence measures; heterogeneous DNA sequences; image processing; noncoding DNA regions; Area measurement; Bioinformatics; DNA; Entropy; Genomics; Image coding; Image segmentation; Microorganisms; Sequences; Signal processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Statistical Signal Processing, 2003 IEEE Workshop on
  • Print_ISBN
    0-7803-7997-7
  • Type

    conf

  • DOI
    10.1109/SSP.2003.1289538
  • Filename
    1289538