• DocumentCode
    29531
  • Title

    Universal Enumerative Coding for Tree Models

  • Author

    Martin, Andrew ; Seroussi, Gadiel ; Weinberger, Marcelo J.

  • Author_Institution
    Univ. de la Republica, Montevideo, Uruguay
  • Volume
    60
  • Issue
    3
  • fYear
    2014
  • fDate
    Mar-14
  • Firstpage
    1387
  • Lastpage
    1411
  • Abstract
    Efficient enumerative coding for tree sources is, in general, surprisingly intricate-a simple uniform encoding of type classes, which is asymptotically optimal in expectation for many classical models, such as FSMs, turns out not to be so in this case. We describe an efficiently computable enumerative code that is universal in the family of tree models in the sense that, for a string emitted by an unknown source whose model is supported on a known tree, the expected normalized code length of the encoding approaches the entropy rate of the source with a convergence rate (K/2)(log n)/n, where K is the number of free parameters of the model family. Based on recent results characterizing type classes of context trees, the code consists of the index of the sequence in the tree type class, and an efficient description of the class itself using a nonuniform encoding of selected string counts. The results are extended to a twice-universal setting, where the tree underlying the source model is unknown.
  • Keywords
    tree codes; trees (mathematics); computable enumerative code; entropy rate; nonuniform encoding; tree model; tree sources; universal enumerative coding; Computational modeling; Context; Encoding; Entropy; Indexes; Polynomials; Probability distribution; Universal coding; enumerative coding; method of types; tree models; type classes;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/TIT.2013.2295217
  • Filename
    6685867