DocumentCode
2855711
Title
Information divergence measures-for detection of borders between coding and noncoding DNA regions using recursive entropic segmentation
Author
Nicorici, Daniel ; Astola, Jaakko
Author_Institution
Tampere Int. Center for Signal Process., Tampere Univ. of Technol., Finland
fYear
2003
fDate
28 Sept.-1 Oct. 2003
Firstpage
577
Lastpage
580
Abstract
Entropy-based divergence measures have shown promising results in many areas of engineering and image processing. In this study, we use the Jensen-Shannon and Jensen-Renyi divergence measures for recursive segmentation of DNA sequences in order to find borders between coding and noncoding regions. Heterogeneous DNA sequences that are comprised of the four nucleotides A, C, G, and T and the stop codons can be partitioned into homogeneous domains. We introduce a new 18 symbol alphabet that captures: (i) the differential base composition in codons, and (ii) the differential stop codon composition along three phases in both DNA strands. For two entire genomes of bacteria our results obtained using the new approach, based on Jensen-Renyi divergence and the new 18 symbol alphabet, are more accurate than those obtained using the standard approach, based on Jensen-Shannon divergence, when searching for borders between coding and noncoding DNA regions.
Keywords
DNA; entropy; image coding; image segmentation; medical image processing; Jensen-Renyi divergence; Jensen-Shannon divergence; borders detection; differential stop codon composition; entropic segmentation; entropy-based divergence measures; heterogeneous DNA sequences; image processing; noncoding DNA regions; Area measurement; Bioinformatics; DNA; Entropy; Genomics; Image coding; Image segmentation; Microorganisms; Sequences; Signal processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Statistical Signal Processing, 2003 IEEE Workshop on
Print_ISBN
0-7803-7997-7
Type
conf
DOI
10.1109/SSP.2003.1289538
Filename
1289538
Link To Document