Surrogate join for massive data on tertiary storage system

Author

Liu, Baoliang ; Li, Jianzhong ; Zhang, Yanqiu

Author_Institution

Harbin Inst. of Technol., Heilongjiang, China

fYear

2004

fDate

7-9 July 2004

Firstpage

271

Lastpage

276

Abstract

In This work surrogate join (SJ) for massive data on tertiary storage is presented. The relations to be joined are first split into surrogate relations and nonsurrogate relations. Surrogate relation consists of tuple identifier and join attribute and nonsurrogate relation consists of tuple identifier and nonjoin attributes. Join is first performed on the two surrogate relations and a join result index is produced which consists of the identifiers of the matching tuples of both surrogate relations, then the join result index is merged with both nonsurrogate relations to get final join result. Experimental results show that our method is better than previous ones in performance and scalability. Note that SJ can convert tertiary join into disk join and one pass scan of both tertiary resident nonsurrogate relations for most applications.

Keywords

database indexing; disk join; join attribute; join result index; nonjoin attribute; nonsurrogate relations; surrogate join; surrogate relations; tertiary join; tuple identifier; Costs; Database systems; Earth; Magnetic devices; Magnetic switching; Memory; Mobile handsets; Random media; Scalability; Switches;

fLanguage

English

Publisher

ieee

Conference_Titel

Database Engineering and Applications Symposium, 2004. IDEAS '04. Proceedings. International

ISSN

1098-8068

Print_ISBN

0-7695-2168-1

Type

conf

DOI

10.1109/IDEAS.2004.1319800

Filename

1319800