• DocumentCode
    970343
  • Title

    An O(N) semipredictive universal encoder via the BWT

  • Author

    Baron, Dror ; Bresler, Yoram

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Univ. of Illinois, Urbana, IL, USA
  • Volume
    50
  • Issue
    5
  • fYear
    2004
  • fDate
    5/1/2004 12:00:00 AM
  • Firstpage
    928
  • Lastpage
    937
  • Abstract
    We provide an O(N) algorithm for a nonsequential semipredictive encoder whose pointwise redundancy with respect to any (unbounded depth) tree source is O(1) bits per state above Rissanen´s lower bound. This is achieved by using the Burrows-Wheeler transform (BWT), an invertible permutation transform that has been suggested for lossless data compression. First, we use the BWT only as an efficient computational tool for pruning context trees, and encode the input sequence rather than the BWT output. Second, we estimate the minimum description length (MDL) source by incorporating suffix tree methods to construct the unbounded depth context tree that corresponds to the input sequence in O(N) time. Third, we point out that a variety of previous source coding methods required superlinear complexity for determining which tree source state generated each of the symbols of the input. We show how backtracking from the BWT output to the input sequence enables to solve this problem in O(N) worst case complexity.
  • Keywords
    data compression; dynamic programming; source coding; transform coding; transforms; Burrows-Wheeler transform; context tree pruning; data compression; dynamic programming; lossless source coding; minimum description length source; semipredictive universal encoder; suffix tree; Computational complexity; Data compression; Delay; Encoding; Entropy; Maximum likelihood decoding; Maximum likelihood estimation; Radio access networks; Source coding; Tree data structures;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/TIT.2004.826664
  • Filename
    1291744