DocumentCode
1916640
Title
Slashing the Time for BWT Inversion
Author
Kärkkäinen, Juha ; Kempa, Dominik ; Puglisi, Simon J.
Author_Institution
Dept. of Comput. Sci., Univ. of Helsinki, Helsinki, Finland
fYear
2012
fDate
10-12 April 2012
Firstpage
99
Lastpage
108
Abstract
Inverting the Burrows-Wheeler transform (BWT) is a bottleneck in BWT-based decompressors. The state-of-the-art inversion algorithm runs in linear time but is slow in practice due to CPU-cache misses. For more than a decade these cache misses have been thought to be inherent to BWT inversion. We show how to reduce the number of cache misses by a factor of nearly two, and simultaneously the cost of cache misses by another factor of two, obtaining a consistent speed up by a factor of 2.3-4. We can do even better if the data is highly repetitive. We describe an algorithm that achieves an asymptotic reduction in cache misses in theory and is the fastest algorithm in practice for such data.
Keywords
cache storage; data compression; BWT inversion; Burrows-Wheeler transform; cache misses; time slashing; Algorithm design and analysis; Arrays; Complexity theory; Educational institutions; Out of order; Transforms;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Compression Conference (DCC), 2012
Conference_Location
Snowbird, UT
ISSN
1068-0314
Print_ISBN
978-1-4673-0715-4
Type
conf
DOI
10.1109/DCC.2012.18
Filename
6189241
Link To Document