DocumentCode
147111
Title
A Conditional Compression Distance that Unveils Insights of the Genomic Evolution
Author
Pratas, Diogo ; Pinho, Armando J.
Author_Institution
Dept. of Electron., Telecommun. & Inf., Univ. of Aveiro, Aveiro, Portugal
fYear
2014
fDate
26-28 March 2014
Firstpage
421
Lastpage
421
Abstract
We describe a compression-based distance for genomic sequences. Instead of using the usual conjoint information content, as in the classical Normalized Compression Distance (NCD), it uses the conditional information content. To compute this Normalized Conditional Compression Distance (NCCD), we need a normal conditional compressor, that we built using a mixture of static and dynamic finite-context models. Using this approach, we measured chromosomal distances between Hominidae primates and also between Muroidea (rat and mouse), observing several insights of evolution that so far have not been reported in the literature.
Keywords
biology computing; data compression; evolution (biological); genomics; NCCD; chromosomal distances; conditional compression distance; conjoint information content; finite-context models; genomic evolution; genomic sequences; hominidae primates; muroidea; normal conditional compressor; normalized conditional compression distance; Bioinformatics; Data compression; Genomics; Informatics; Materials; Telecommunications; compression distances; finite-context models; genomic sequences;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Compression Conference (DCC), 2014
Conference_Location
Snowbird, UT
ISSN
1068-0314
Type
conf
DOI
10.1109/DCC.2014.58
Filename
6824473
Link To Document