DocumentCode :
2258098
Title :
Parallel indexing in a Chinese information retrieval system
Author :
Wong, Kam-Fai ; Lum, Vincent Y.
Author_Institution :
Dept. of Syst. Eng., Chinese Univ. of Hong Kong, Shatin, Hong Kong
fYear :
1994
fDate :
9-11 Nov 1994
Firstpage :
320
Lastpage :
325
Abstract :
The increasing data size in Chinese information-based applications renders conventional information retrieval (IR) systems unsuitable. This is because they are limited in both storage and speed. To overcome these predicaments, a parallel Chinese IR system (CIR) has been designed. It is being developed on a SIMD parallel computer, DECmpp, which is configured with 8,192 processing elements. It uses full inverted indices for retrieval. The “divide-and-conquer” principle is exercised in exploiting data parallelism in the inverted index files. The inverted indices are first partitioned into fragments. Each fragment is then assigned to an individual processing elements. Thereafter, during an index retrieval operation, all index fragments are searched in parallel. Although the principle is simple, realising the parallel indexing algorithm in a naive fashion (i.e. without considering the underlying parallel architecture) would result in poor retrieval performance. During the design of the CIR system, 3 different implementation models for parallel indexing have been considered. In this paper, qualitative evaluation of the 3 models is presented. Based on the result of the evaluation, the model that offers the best run-time performance was adopted
Keywords :
DEC computers; indexing; information retrieval systems; parallel processing; Chinese information retrieval system; DECmpp massively parallel processor; SIMD parallel computer; data parallelism; divide-and-conquer principle; full inverted indices; index fragments; parallel indexing; partitioned indexes; retrieval performance; run-time performance; Application software; Artificial intelligence; Concurrent computing; Engines; Indexing; Information retrieval; Information systems; Parallel architectures; Parallel processing; Systems engineering and theory;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Software and Applications Conference, 1994. COMPSAC 94. Proceedings., Eighteenth Annual International
Conference_Location :
Taipei
Print_ISBN :
0-8186-6705-2
Type :
conf
DOI :
10.1109/CMPSAC.1994.342784
Filename :
342784
Link To Document :
بازگشت