DocumentCode :
3254354
Title :
Coding for compression in full-text retrieval systems
Author :
Moffat, Alistair ; Zobel, Justin
Author_Institution :
Dept. of Comput. Sci., Melbourne Univ., Parkville, Vic., Australia
fYear :
1992
fDate :
24-27 March 1992
Firstpage :
72
Lastpage :
81
Abstract :
Witten, Bell and Nevill (see ibid., p.23, 1991) have described compression models for use in full-text retrieval systems. The authors discuss other coding methods for use with the same models, and give results that show their scheme yielding virtually identical compression, and decoding more than forty times faster. One of the main features of their implementation is the complete absence of arithmetic coding; this, in part, is the reason for the high speed. The implementation is also particularly suited to slow devices such as CD-ROM, in that the answering of a query requires one disk access for each term in the query and one disk access for each answer. All words and numbers are indexed, and there are no stop words. They have built two compressed databases.<>
Keywords :
data compression; encoding; full-text databases; Bible; CD-ROM; coding methods; compression models; decoding; disk access; full-text retrieval systems; Arithmetic; CD-ROMs; Computer science; Databases; Decoding; Huffman coding; Indexing; Information retrieval; Law; Legal factors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Compression Conference, 1992. DCC '92.
Conference_Location :
Snowbird, UT, USA
Print_ISBN :
0-8186-2717-4
Type :
conf
DOI :
10.1109/DCC.1992.227474
Filename :
227474
Link To Document :
بازگشت