• DocumentCode
    899083
  • Title

    Speeding up external mergesort

  • Author

    Zheng, LuoQuan ; Larson, Per-Åke

  • Author_Institution
    IBM Canada Ltd., North York, Ont., Canada
  • Volume
    8
  • Issue
    2
  • fYear
    1996
  • fDate
    4/1/1996 12:00:00 AM
  • Firstpage
    322
  • Lastpage
    332
  • Abstract
    External mergesort is normally implemented so that each run is stored continuously on disk and blocks of data are read exactly in the order they are needed during merging. We investigate two ideas for improving the performance of external mergesort: interleaved layout and a new reading strategy. Interleaved layout places blocks from different runs in consecutive disk addresses. This is done in the hope that interleaving will reduce seek overhead during merging. The new reading strategy precomputes the order in which data blocks are to be read according to where they are located on disk and when they are needed for merging. Extra buffer space makes it possible to read blocks in an order that reduces seek overhead, instead of reading them exactly in the order they are needed for merging. A detailed simulation model was used to compare the two layout strategies and three reading strategies. The effects of using multiple work disks were also investigated. We found that, in most cases, interleaved layout does not improve performance, but that the new reading strategy consistently performs better than double buffering and forecasting
  • Keywords
    buffer storage; interleaved storage; magnetic disc storage; merging; sorting; buffer space; consecutive disk addresses; data blocks; external mergesort; interleaved layout; layout strategies; multiple work disks; reading strategies; reading strategy; seek overhead; seek overhead reduction; simulation model; Algorithm design and analysis; Computer science; Delay; File systems; Heuristic algorithms; Laboratories; Magnetic separation; Merging; Scheduling algorithm; Sorting;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/69.494169
  • Filename
    494169