Title :
Exploring Three-Base Periodicity for DNA Compression and Modeling
Author :
Ferreira, Paulo J S G ; Neves, António J R ; Afreixo, Vera ; Pinho, Armando J.
Author_Institution :
Signal Process. Lab., Aveiro Univ.
Abstract :
To explore the three-base periodicity often found in protein-coding DNA regions, we introduce a DNA model based on three deterministic states, where each state implements a finite-context model. The results obtained show compression gains in relation to the single finite-context model counterpart. Additionally, and potentially more interesting than the compression gain on its own, is the observation that the entropy associated to each of the three states differs and that this variation is not the same among the organisms analyzed
Keywords :
DNA; proteins; DNA compression; DNA modeling; compression gains; finite-context model; protein-coding DNA regions; three-base periodicity; Bioinformatics; Compression algorithms; DNA; Data compression; Entropy; Genomics; Organisms; Proteins; Sequences; Signal processing;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
Print_ISBN :
1-4244-0469-X
DOI :
10.1109/ICASSP.2006.1661416