• DocumentCode
    428906
  • Title

    Binary codes for non-uniform sources

  • Author

    Moffat, Alistair ; Anh, Vo Ngoc

  • Author_Institution
    Dept. of Comput. Sci. & Software Eng., Melbourne Univ., Vic., Australia
  • fYear
    2005
  • fDate
    29-31 March 2005
  • Firstpage
    133
  • Lastpage
    142
  • Abstract
    In many applications of compression, decoding speed is at least as important as compression effectiveness. For example, the large inverted indexes associated with text retrieval mechanisms are best stored compressed, but a working system must also process queries at high speed. Here we present two coding methods that make use of fixed binary representations. They have all of the consequent benefits in terms of decoding performance, but are also sensitive to localized variations in the source data, and in practice give excellent compression. The methods are validated by applying them to various test data, including the index of an 18 GB document collection.
  • Keywords
    binary codes; decoding; indexing; query processing; source coding; binary codes; compression; decoding performance; decoding speed; document collection; fixed binary representations; indexing; nonuniform sources; query processing; Application software; Binary codes; Computer science; Costs; Decoding; Dictionaries; Entropy; Probability distribution; Software engineering; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Compression Conference, 2005. Proceedings. DCC 2005
  • ISSN
    1068-0314
  • Print_ISBN
    0-7695-2309-9
  • Type

    conf

  • DOI
    10.1109/DCC.2005.22
  • Filename
    1402174