• DocumentCode
    3796020
  • Title

    Inverted file partitioning schemes in multiple disk systems

  • Author

    Byeong-Soo Jeong;E. Omiecinski

  • Author_Institution
    Coll. of Comput., Georgia Inst. of Technol., Atlanta, GA, USA
  • Volume
    6
  • Issue
    2
  • fYear
    1995
  • Firstpage
    142
  • Lastpage
    153
  • Abstract
    Multiple-disk I/O systems (disk arrays) have been an attractive approach to meet high performance I/O demands in data intensive applications such as information retrieval systems. When we partition and distribute files across multiple disks to exploit the potential for I/O parallelism, a balanced I/O workload distribution becomes important for good performance. Naturally, the performance of a parallel information retrieval system using an inverted file structure is affected by the partitioning scheme of the inverted file. In this paper, we propose two different partitioning schemes for an inverted file system for a shared-everything multiprocessor machine with multiple disks. We study the performance of these schemes by simulation under a number of workloads where the term frequencies in the documents are varied, the term frequencies in the queries are varied, the number of disks are varied and the multiprogramming level is varied.
  • Keywords
    "Information retrieval","File systems","Frequency","Parallel architectures","Spatial databases","Message passing","Scalability","Load management","Multiprocessing systems","System performance"
  • Journal_Title
    IEEE Transactions on Parallel and Distributed Systems
  • Publisher
    ieee
  • ISSN
    1045-9219
  • Type

    jour

  • DOI
    10.1109/71.342125
  • Filename
    342125