• DocumentCode
    791170
  • Title

    A Three-State Model for DNA Protein-Coding Regions

  • Author

    Pinho, A.J. ; Neves, A.J.R. ; Afreixo, V. ; Bastos, C.A.C. ; Ferreira, P.J.S.G.

  • Author_Institution
    Signal Process. Lab., Aveiro Univ.
  • Volume
    53
  • Issue
    11
  • fYear
    2006
  • Firstpage
    2148
  • Lastpage
    2155
  • Abstract
    It is known that the protein-coding regions of DNA are usually characterized by a three-base periodicity. In this paper, we exploit this property, studying a DNA model based on three deterministic states, where each state implements a finite-context model. The experimental results obtained confirm the appropriateness of the proposed approach, showing compression gains in relation to the single finite-context model counterpart. Additionally, and potentially more interesting than the compression gain on its own, is the observation that the entropy associated to each of the three base positions of a codon differs and that this variation is not the same among the organisms analyzed
  • Keywords
    DNA; entropy; molecular biophysics; physiological models; proteins; DNA protein-coding regions; codon; compression gain; deterministic states; entropy; finite context model; three-base periodicity; three-state model; Amino acids; Bioinformatics; DNA; Data compression; Entropy; Genetics; Genomics; Proteins; Sequences; Signal processing; DNA compression; DNA sequence modeling; finite-context models; Algorithms; Base Sequence; Computer Simulation; DNA; Models, Genetic; Molecular Sequence Data; Open Reading Frames; Proteins; Sequence Alignment; Sequence Analysis, DNA;
  • fLanguage
    English
  • Journal_Title
    Biomedical Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9294
  • Type

    jour

  • DOI
    10.1109/TBME.2006.879477
  • Filename
    1710155