DocumentCode
2865846
Title
In-place length-restricted prefix coding
Author
Milidiu, Ruy Luiz ; Pessoa, Artur Alves ; Laber, Eduardo Sany
Author_Institution
Dept. de Inf., PUC, Rio de Janeiro, Brazil
fYear
1998
fDate
9-11 Sep 1998
Firstpage
50
Lastpage
59
Abstract
Huffman codes, combined with word-based models, are considered efficient compression schemes for full-text retrieval systems. The decoding rate for these schemes can be substantially improved if the maximum length of the codewords is not greater then the machine word size L. However, if the vocabulary is large, simple methods for generating optimal length-restricted codes are either too slow or require a significantly large amount of memory. We present an in-place, simple and fast implementation for the BRCI (Build, Remove, Condense and Insert) algorithm, an approximative method for length-restricted coding. It overwrites a sorted input list of n weights with the corresponding codeword lengths in O(n) time. In addition, the worst-case compression loss introduced by BRCI codes with respect to unrestricted Huffman codes is proved to be negligible for all practical values of both L and n
Keywords
Huffman codes; computational complexity; data compression; decoding; full-text databases; sorting; vocabulary; BRCI; Build Remove Condense Insert algorithm; Huffman codes; codewords; compression schemes; decoding; full-text retrieval systems; in-place length-restricted prefix coding; sorted input list; vocabulary; word-based models; Databases; Decoding; Delay; Encoding; Information retrieval; Polynomials; Upper bound; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
String Processing and Information Retrieval: A South American Symposium, 1998. Proceedings
Conference_Location
Santa Cruz de La Sierra
Print_ISBN
0-8186-8664-2
Type
conf
DOI
10.1109/SPIRE.1998.712982
Filename
712982
Link To Document