• DocumentCode
    1916640
  • Title

    Slashing the Time for BWT Inversion

  • Author

    Kärkkäinen, Juha ; Kempa, Dominik ; Puglisi, Simon J.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Helsinki, Helsinki, Finland
  • fYear
    2012
  • fDate
    10-12 April 2012
  • Firstpage
    99
  • Lastpage
    108
  • Abstract
    Inverting the Burrows-Wheeler transform (BWT) is a bottleneck in BWT-based decompressors. The state-of-the-art inversion algorithm runs in linear time but is slow in practice due to CPU-cache misses. For more than a decade these cache misses have been thought to be inherent to BWT inversion. We show how to reduce the number of cache misses by a factor of nearly two, and simultaneously the cost of cache misses by another factor of two, obtaining a consistent speed up by a factor of 2.3-4. We can do even better if the data is highly repetitive. We describe an algorithm that achieves an asymptotic reduction in cache misses in theory and is the fastest algorithm in practice for such data.
  • Keywords
    cache storage; data compression; BWT inversion; Burrows-Wheeler transform; cache misses; time slashing; Algorithm design and analysis; Arrays; Complexity theory; Educational institutions; Out of order; Transforms;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Compression Conference (DCC), 2012
  • Conference_Location
    Snowbird, UT
  • ISSN
    1068-0314
  • Print_ISBN
    978-1-4673-0715-4
  • Type

    conf

  • DOI
    10.1109/DCC.2012.18
  • Filename
    6189241