DocumentCode :
1312042
Title :
Effectiveness of parallel joins
Author :
Lakshmi, M. Seetha ; Yu, Philip S.
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume :
2
Issue :
4
fYear :
1990
fDate :
12/1/1990 12:00:00 AM
Firstpage :
410
Lastpage :
424
Abstract :
The effectiveness of parallel processing of relational join operations is examined. The skew in the distribution of join attribute values and the stochastic nature of the task processing times are identified as the major factors that can affect the effective exploitation of parallelism. Expressions for the execution time of parallel hash join and semijoin are derived and their effectiveness analyzed. When many small processors are used in the parallel architecture, the skew can result in some processors becoming sources of bottleneck while other processors are being underutilized. Even in the absence of skew, the variations in the processing times of the parallel tasks belonging to a query can lead to high task synchronization delay and impact the maximum speedup achievable through parallel execution. For example, when the task processing time on each processor is exponential with the same mean, the speedup is proportional to P/ln(P) where P is the number of processors. Other factors such as memory size, communication bandwidth, etc., can lead to even lower speedup. These are quantified using analytical models
Keywords :
database theory; parallel programming; relational databases; storage management; distribution; execution time; high task synchronization delay; join attribute values; maximum speedup; parallel architecture; parallel execution; parallel hash join; parallel processing; relational join operations; semijoin; skew; small processors; stochastic nature; task processing times; Analytical models; Bandwidth; Database machines; Delay; Parallel architectures; Parallel processing; Performance analysis; Proposals; Relational databases; Stochastic processes;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/69.63253
Filename :
63253
Link To Document :
بازگشت