Title :
Linear time universal coding of tree sources via FSM closure
Author :
Martín, Alvaro ; Seroussi, Gadiel ; Weinberger, Marcelo J.
Author_Institution :
Hewlett-Packard Labs., Palo Alto, CA, USA
Abstract :
Applying generalized context trees and their finite state machine closures, a two-pass version of context, a twice-universal lossless coding scheme for tree models, can be implemented in linear encoding/decoding time. As it turns out, an optimal context selection rule and the corresponding context transitions are computationally not more expensive than the various steps involved in the implementation of the Burrows-Wheeler transform (BWT) and use, in fact, similar tools. Also the paper presents a reversible transform that displays the same "context deinterleaving" feature as the BWT but is naturally based on an optimal context tree. This transform offers insight into the workings of the BWT and the nature of its suboptimality for twice-universal coding of tree models.
Keywords :
data compression; finite state machines; linear codes; source coding; transforms; tree codes; BWT; Burrows-Wheeler transform; context deinterleaving feature; finite state machine closure; generalized context tree; linear time coding; lossless coding scheme; twice-universal coding; Automata; Computational complexity; Context modeling; Data compression; Decoding; Displays; Encoding; Entropy; Laboratories; Upper bound;
Conference_Titel :
Data Compression Conference, 2004. Proceedings. DCC 2004
Print_ISBN :
0-7695-2082-0
DOI :
10.1109/DCC.2004.1281482