DocumentCode
899083
Title
Speeding up external mergesort
Author
Zheng, LuoQuan ; Larson, Per-Åke
Author_Institution
IBM Canada Ltd., North York, Ont., Canada
Volume
8
Issue
2
fYear
1996
fDate
4/1/1996 12:00:00 AM
Firstpage
322
Lastpage
332
Abstract
External mergesort is normally implemented so that each run is stored continuously on disk and blocks of data are read exactly in the order they are needed during merging. We investigate two ideas for improving the performance of external mergesort: interleaved layout and a new reading strategy. Interleaved layout places blocks from different runs in consecutive disk addresses. This is done in the hope that interleaving will reduce seek overhead during merging. The new reading strategy precomputes the order in which data blocks are to be read according to where they are located on disk and when they are needed for merging. Extra buffer space makes it possible to read blocks in an order that reduces seek overhead, instead of reading them exactly in the order they are needed for merging. A detailed simulation model was used to compare the two layout strategies and three reading strategies. The effects of using multiple work disks were also investigated. We found that, in most cases, interleaved layout does not improve performance, but that the new reading strategy consistently performs better than double buffering and forecasting
Keywords
buffer storage; interleaved storage; magnetic disc storage; merging; sorting; buffer space; consecutive disk addresses; data blocks; external mergesort; interleaved layout; layout strategies; multiple work disks; reading strategies; reading strategy; seek overhead; seek overhead reduction; simulation model; Algorithm design and analysis; Computer science; Delay; File systems; Heuristic algorithms; Laboratories; Magnetic separation; Merging; Scheduling algorithm; Sorting;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/69.494169
Filename
494169
Link To Document