DocumentCode
3264345
Title
Surrogate join for massive data on tertiary storage system
Author
Liu, Baoliang ; Li, Jianzhong ; Zhang, Yanqiu
Author_Institution
Harbin Inst. of Technol., Heilongjiang, China
fYear
2004
fDate
7-9 July 2004
Firstpage
271
Lastpage
276
Abstract
In This work surrogate join (SJ) for massive data on tertiary storage is presented. The relations to be joined are first split into surrogate relations and nonsurrogate relations. Surrogate relation consists of tuple identifier and join attribute and nonsurrogate relation consists of tuple identifier and nonjoin attributes. Join is first performed on the two surrogate relations and a join result index is produced which consists of the identifiers of the matching tuples of both surrogate relations, then the join result index is merged with both nonsurrogate relations to get final join result. Experimental results show that our method is better than previous ones in performance and scalability. Note that SJ can convert tertiary join into disk join and one pass scan of both tertiary resident nonsurrogate relations for most applications.
Keywords
database indexing; disk join; join attribute; join result index; nonjoin attribute; nonsurrogate relations; surrogate join; surrogate relations; tertiary join; tuple identifier; Costs; Database systems; Earth; Magnetic devices; Magnetic switching; Memory; Mobile handsets; Random media; Scalability; Switches;
fLanguage
English
Publisher
ieee
Conference_Titel
Database Engineering and Applications Symposium, 2004. IDEAS '04. Proceedings. International
ISSN
1098-8068
Print_ISBN
0-7695-2168-1
Type
conf
DOI
10.1109/IDEAS.2004.1319800
Filename
1319800
Link To Document