DocumentCode :
661138
Title :
A robust join operator to process streaming data in real-time data warehousing
Author :
Naeem, Muhammad A.
Author_Institution :
Sch. of Comput. & Math. Sci., Auckland Univ. of Technol., Auckland, New Zealand
fYear :
2013
fDate :
10-12 Sept. 2013
Firstpage :
119
Lastpage :
124
Abstract :
In the field of real-time data warehousing semi-stream processing has become a potential area of research since last one decade. One important operation in semi-stream processing is to join stream data with a slowly changing disk-based master data. A join operator is usually required to implement this operation. This join operator typically works under limited main memory and this memory is generally not large enough to hold the whole disk-based master data. Recently, a seminal join algorithm called MESHJOIN (Mesh Join) has been proposed in the literature to process semi-stream data. MESHJOIN is a candidate for a resource-aware system setup. However, MESHJOIN is not very selective. In particular, MESHJOIN does not consider the characteristics of stream data and its performance is suboptimal for skewed stream data. In this paper we propose a novel Semi-Stream Join (SSJ) using a new cache module. The algorithm is more appropriate for skewed distributions, and we present results for Zipfian distributions of the type that appears in many applications. We conduct a rigorous experimental study to test our algorithm. Our experiments show that SSJ outperforms MESHJOIN significantly. We also present the cost model for our SSJ and validate it with experiments.
Keywords :
data handling; data warehouses; MESHJOIN; SSJ; Zipfian distributions; real-time data warehousing; robust join operator; semistream join; streaming data processing; Algorithm design and analysis; Loading; Mathematical model; Partitioning algorithms; Probes; Real-time systems; Warehousing; Join operator; Performance measurement; Real-time data warehousing; Semi-stream processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Information Management (ICDIM), 2013 Eighth International Conference on
Conference_Location :
Islamabad
Print_ISBN :
978-1-4799-0613-0
Type :
conf
DOI :
10.1109/ICDIM.2013.6693964
Filename :
6693964
Link To Document :
بازگشت