Title :
Source Coding Scheme for Multiple Sequence Alignments
Author :
Hanus, Pavol ; Dingel, Janis ; Chalkidis, Georg ; Hagenauer, Joachim
Author_Institution :
Inst. for Commun. Eng., Tech. Univ. Munchen, Munich
Abstract :
Rapid development of DNA sequencing technologies exponentially increases the amount of publicly available genomic data. Whole genome multiple sequence alignments represent a particularly voluminous, frequently downloaded static dataset. In this work we propose an asymmetric source coding scheme for such alignments using evolutionary prediction in combination with lossless black and white image compression. Compared to the Lempel-Ziv algorithm used so far the compression rates are almost halved.
Keywords :
DNA; biology computing; image coding; image colour analysis; prediction theory; source coding; DNA sequencing technologies; evolutionary prediction; genome multiple sequence alignments; lossless image compression; source coding scheme; Bioinformatics; Biological cells; DNA; Data compression; Data engineering; Genomics; Image coding; Image databases; Sequences; Source coding;
Conference_Titel :
Data Compression Conference, 2009. DCC '09.
Conference_Location :
Snowbird, UT
Print_ISBN :
978-1-4244-3753-5
DOI :
10.1109/DCC.2009.64