Title :
Relative Lempel-Ziv with Constant-Time Random Access
Author :
Gagie, Travis ; Puglisi, Simon J.
Author_Institution :
Univ. of Helsinki, Helsinki, Finland
Abstract :
Relative Lempel-Ziv [1] (RLZ) is a variant of LZ77 that can compress collections of similar genomes well, while still allowing fast random access to them. We implemented RLZ using compressed bit vectors to support constant-time random access at the cost of sublinear extra space. We compared our implementation of RLZ to Deorowicz and Grabowski´s GDC [11] scheme and achieved comparable compression and much smaller access times for short substrings.
Keywords :
bioinformatics; data compression; genomics; LZ77; RLZ; compressed bit vectors; constant time random access; genomes collection compression; relative Lempel-Ziv; sublinear extra space; Bioinformatics; Data compression; Educational institutions; Genomics; Information retrieval; Libraries; Robustness; random access; relative Lempel-Ziv;
Conference_Titel :
Data Compression Conference (DCC), 2014
Conference_Location :
Snowbird, UT
DOI :
10.1109/DCC.2014.42