Title :
Genome Sequence Compression with Distributed Source Coding
Author :
Shuang Wang ; Xiaoqian Jiang ; Lijuan Cui ; Wenrui Dai ; Deligiannis, Nikos ; Pinghao Li ; Hongkai Xiong ; Cheng, Shukang ; Ohno-Machado, L.
Author_Institution :
Div. of Biomed. Inf., Univ. of California, San Diego, La Jolla, CA, USA
Abstract :
In this paper, we develop a novel genome compression framework based on distributed source coding (DSC)[3], which is specially tailored to the need of miniaturized devices. At the encoder side, subsequences with adaptive code length can be compressed flexibly through either low complexity DSC based syndrome coding or hash coding with the decision determined by the existence of variations between source and reference known from the decoder feedback. Moreover, to tackle the variations between source and reference at the decoder, we carefully designed a factor graph based low-density parity-check (LDPC) decoder, which automatically detects insertion, deletion and substitution.
Keywords :
adaptive codes; data compression; decoding; graph theory; parity check codes; sequences; source coding; LDPC decoder; adaptive code length; decoder feedback; distributed source coding; factor graph based low-density parity-check decoder; genome compression framework; genome sequence compression; hash coding; low complexity DSC based syndrome coding; Bioinformatics; Complexity theory; Decoding; Genomics; Sequential analysis; Source coding;
Conference_Titel :
Data Compression Conference (DCC), 2013
Conference_Location :
Snowbird, UT
Print_ISBN :
978-1-4673-6037-1
DOI :
10.1109/DCC.2013.104