مرکز منطقه ای اطلاع رساني علوم و فناوري - Coding for compression in full-text retrieval systems

DocumentCode :

3254354

Title :

Coding for compression in full-text retrieval systems

Author :

Moffat, Alistair ; Zobel, Justin

Author_Institution :

Dept. of Comput. Sci., Melbourne Univ., Parkville, Vic., Australia

fYear :

1992

fDate :

24-27 March 1992

Firstpage :

Lastpage :

Abstract :

Witten, Bell and Nevill (see ibid., p.23, 1991) have described compression models for use in full-text retrieval systems. The authors discuss other coding methods for use with the same models, and give results that show their scheme yielding virtually identical compression, and decoding more than forty times faster. One of the main features of their implementation is the complete absence of arithmetic coding; this, in part, is the reason for the high speed. The implementation is also particularly suited to slow devices such as CD-ROM, in that the answering of a query requires one disk access for each term in the query and one disk access for each answer. All words and numbers are indexed, and there are no stop words. They have built two compressed databases.<>

Keywords :

data compression; encoding; full-text databases; Bible; CD-ROM; coding methods; compression models; decoding; disk access; full-text retrieval systems; Arithmetic; CD-ROMs; Computer science; Databases; Decoding; Huffman coding; Indexing; Information retrieval; Law; Legal factors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Data Compression Conference, 1992. DCC '92.

Conference_Location :

Snowbird, UT, USA

Print_ISBN :

0-8186-2717-4

Type :

conf

DOI :

10.1109/DCC.1992.227474

Filename :

227474

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3254354