DocumentCode :
2625887
Title :
Clustering-Variable-Width Histogram Based Window Semi-hash Multi-join over Streams
Author :
Zhang, Xiaojian ; Jiang, Wanchang ; Zhang, Yadong ; Huo, Cong
Author_Institution :
Henan Univ. of Finance & Econ., Zhengzhou
fYear :
2007
fDate :
21-23 Nov. 2007
Firstpage :
850
Lastpage :
853
Abstract :
Join operator has become more and more important in the context of data stream. Most join algorithms over streams to date are based on nested loop joins or hash joins. However, these time expensive algorithms can not suit for CPU-limited case. In this paper, a clustering-based variable-width histogram is designed for obtaining value distribution of tuples in the sliding windows, and some important characteristics of tuples can be retained. Semi-hash tables can be constructed by using the histogram. Our sliding window semi-hash multi-join algorithm can minimize the processing time cost of join and produce an accurate join result much earlier. Experimental results show that our approach is more efficient than other approaches.
Keywords :
data analysis; file organisation; query processing; clustering-variable-width histogram; data stream; nested loop joins; processing time cost; time expensive algorithms; window semi-hash multi-join; Clustering algorithms; Computer science; Costs; Data analysis; Data engineering; Educational institutions; Finance; Histograms; Information science; Information technology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Convergence Information Technology, 2007. International Conference on
Conference_Location :
Gyeongju
Print_ISBN :
0-7695-3038-9
Type :
conf
DOI :
10.1109/ICCIT.2007.246
Filename :
4420366
Link To Document :
بازگشت