DocumentCode :
3796020
Title :
Inverted file partitioning schemes in multiple disk systems
Author :
Byeong-Soo Jeong;E. Omiecinski
Author_Institution :
Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA
Volume :
6
Issue :
2
fYear :
1995
Firstpage :
142
Lastpage :
153
Abstract :
Multiple-disk I/O systems (disk arrays) have been an attractive approach to meet high performance I/O demands in data intensive applications such as information retrieval systems. When we partition and distribute files across multiple disks to exploit the potential for I/O parallelism, a balanced I/O workload distribution becomes important for good performance. Naturally, the performance of a parallel information retrieval system using an inverted file structure is affected by the partitioning scheme of the inverted file. In this paper, we propose two different partitioning schemes for an inverted file system for a shared-everything multiprocessor machine with multiple disks. We study the performance of these schemes by simulation under a number of workloads where the term frequencies in the documents are varied, the term frequencies in the queries are varied, the number of disks are varied and the multiprogramming level is varied.
Keywords :
"Information retrieval","File systems","Frequency","Parallel architectures","Spatial databases","Message passing","Scalability","Load management","Multiprocessing systems","System performance"
Journal_Title :
IEEE Transactions on Parallel and Distributed Systems
Publisher :
ieee
ISSN :
1045-9219
Type :
jour
DOI :
10.1109/71.342125
Filename :
342125
Link To Document :
بازگشت