DocumentCode
3796020
Title
Inverted file partitioning schemes in multiple disk systems
Author
Byeong-Soo Jeong;E. Omiecinski
Author_Institution
Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA
Volume
6
Issue
2
fYear
1995
Firstpage
142
Lastpage
153
Abstract
Multiple-disk I/O systems (disk arrays) have been an attractive approach to meet high performance I/O demands in data intensive applications such as information retrieval systems. When we partition and distribute files across multiple disks to exploit the potential for I/O parallelism, a balanced I/O workload distribution becomes important for good performance. Naturally, the performance of a parallel information retrieval system using an inverted file structure is affected by the partitioning scheme of the inverted file. In this paper, we propose two different partitioning schemes for an inverted file system for a shared-everything multiprocessor machine with multiple disks. We study the performance of these schemes by simulation under a number of workloads where the term frequencies in the documents are varied, the term frequencies in the queries are varied, the number of disks are varied and the multiprogramming level is varied.
Keywords
"Information retrieval","File systems","Frequency","Parallel architectures","Spatial databases","Message passing","Scalability","Load management","Multiprocessing systems","System performance"
Journal_Title
IEEE Transactions on Parallel and Distributed Systems
Publisher
ieee
ISSN
1045-9219
Type
jour
DOI
10.1109/71.342125
Filename
342125
Link To Document