DocumentCode
428906
Title
Binary codes for non-uniform sources
Author
Moffat, Alistair ; Anh, Vo Ngoc
Author_Institution
Dept. of Comput. Sci. & Software Eng., Melbourne Univ., Vic., Australia
fYear
2005
fDate
29-31 March 2005
Firstpage
133
Lastpage
142
Abstract
In many applications of compression, decoding speed is at least as important as compression effectiveness. For example, the large inverted indexes associated with text retrieval mechanisms are best stored compressed, but a working system must also process queries at high speed. Here we present two coding methods that make use of fixed binary representations. They have all of the consequent benefits in terms of decoding performance, but are also sensitive to localized variations in the source data, and in practice give excellent compression. The methods are validated by applying them to various test data, including the index of an 18 GB document collection.
Keywords
binary codes; decoding; indexing; query processing; source coding; binary codes; compression; decoding performance; decoding speed; document collection; fixed binary representations; indexing; nonuniform sources; query processing; Application software; Binary codes; Computer science; Costs; Decoding; Dictionaries; Entropy; Probability distribution; Software engineering; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Compression Conference, 2005. Proceedings. DCC 2005
ISSN
1068-0314
Print_ISBN
0-7695-2309-9
Type
conf
DOI
10.1109/DCC.2005.22
Filename
1402174
Link To Document