DocumentCode
347823
Title
Combining semijoins and hash-semijoins in a distributed query processing strategy
Author
Morrissey, J.M. ; Ogunbadejo, O.
Author_Institution
Sch. of Comput. Sci., Windsor Univ., Ont., Canada
Volume
1
fYear
1999
fDate
9-12 May 1999
Firstpage
122
Abstract
Tseng and Chen (1992) presented a new relational operator called a hash-semijoin. They proposed a method which will transform a sequence of semijoins, produced by any existing semijoin-based algorithm, into a more cost effective sequence by replacing some of the semijoins by hash-semijoins. The hash-semijoin can be very effective but some of the benefits may be negated by the occurrence of collisions in the reduction filter. Such collisions mean that the hash-semijoin cannot achieve the same amount of reduction as a semijoin can. However, under certain circumstances, the benefits may greatly outweigh the costs and thus collisions are not a serious problem. But their approach has not been experimentally evaluated. In this paper we explore their hypothesis that traditional algorithms, which use semijoins, can always be improved by replacing some or all of the semijoins by hash-semijoins. Four different algorithms are investigated. Each aims to minimize the amount of data transmitted around the network during distributed query processing.
Keywords
distributed databases; query processing; relational databases; algorithms; distributed query processing strategy; hash-semijoin; reduction filter collisions; relational operator; semijoin; Character generation; Computer network management; Computer science; Costs; Database systems; Filters; Intelligent networks; Query processing; Scheduling algorithm; Software engineering;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical and Computer Engineering, 1999 IEEE Canadian Conference on
Conference_Location
Edmonton, Alberta, Canada
ISSN
0840-7789
Print_ISBN
0-7803-5579-2
Type
conf
DOI
10.1109/CCECE.1999.807182
Filename
807182
Link To Document