Title :
System issues in parallel sorting for database systems
Author :
Iyer, Balakrishna R. ; Dias, Daniel M.
Author_Institution :
IBM Santa Teresa Lab., San Jose, CA, USA
Abstract :
An external parallel merge-sort and sort-merge join on tightly coupled processors is considered. The issue of whether significant speedup can be achieved with good CPU efficiency is addressed. A pure sort query and a five-relation join query using a sort-merge-join algorithm are examined. It is found that the external sort is readily parallelizable. In the absence of skew, a speedup, linear in the number of tightly coupled processors can be obtained. However, it is shown that skew can reduce the speedup significantly. An examination is made of how important types of skew can be handled to yield close to linear speedup. The effect on the speedup and CPU efficiency of the database size, memory constraints, CPU MIPS, query selectivity, I/O striping and skew is shown
Keywords :
database management systems; information retrieval systems; merging; parallel programming; sorting; CPU MIPS; CPU efficiency; I/O striping; database size; database systems; external parallel merge-sort; five-relation join query; memory constraints; parallel sorting; parallelizable; pure sort query; query selectivity; skew; sort-merge join; speedup; system issues; tightly coupled processors; Cable TV; Database systems; Peak to average power ratio; Petroleum; Radio access networks; Sorting;
Conference_Titel :
Data Engineering, 1990. Proceedings. Sixth International Conference on
Conference_Location :
Los Angeles, CA
Print_ISBN :
0-8186-2025-0
DOI :
10.1109/ICDE.1990.113475