DocumentCode :
2643528
Title :
A multi-pass algorithm for sorting extremely large data files
Author :
Shatnawi, Ali ; Alzahouri, Yathrip
Author_Institution :
Dept. of Comput. Eng., Jordan Univ. of Sci. & Technol., Irbid, Jordan
fYear :
2015
fDate :
7-9 April 2015
Firstpage :
79
Lastpage :
82
Abstract :
An extremely large data file is a file that is greater than the size of the main memory by multiple orders of magnitude. Sorting such a file involves external sorting algorithm, which uses both the hard disk and the main memory to accomplish the sorting task. Since the hard disk is much slower than the main memory, the number of hard disk input/output operations is considered the main performance metric. The new proposed method decreases the total number of input/output operations; hence, it reduces the total time of sorting. The proposed method has less number of disk read/write operations than currently existing approaches. The input/output complexity of the proposed algorithm is analyzed and compared with other algorithms. The proposed algorithm uses a constant merging order at the merge phase of the external sort with multiple passes over each set of data. It is shown that the proposed algorithm has lower sort time requirements than previous approaches.
Keywords :
computational complexity; file organisation; merging; sorting; constant merging order; data file sorting; external sorting algorithm; extremely large data file; hard disk; input-output complexity; main memory; multipass algorithm; sort time requirements; Algorithm design and analysis; Communication systems; Complexity theory; Computers; Hard disks; Merging; Sorting; External Merge Sort; Large File Processing; Sorting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Communication Systems (ICICS), 2015 6th International Conference on
Conference_Location :
Amman
Type :
conf
DOI :
10.1109/IACS.2015.7103206
Filename :
7103206
Link To Document :
بازگشت