DocumentCode
1573087
Title
Improvement of snapshot differential algorithm based on hadoop platform
Author
Yuan, Guoyong ; Li, Bin ; Xiao, Taiyang
Author_Institution
Dept. of Comput. Sci., Jinan Univ., Guangzhou, China
Volume
2
fYear
2011
Firstpage
1212
Lastpage
1214
Abstract
Snapshot differential algorithm is one of ways of extracting delta from views in the data warehouse in data integration circumstance. Due to the scale of the views in data warehouse is likely to be very massive, it will take lots of time to run snapshot differential algorithm and become the bottleneck of the system performance. In this paper, in order to improve efficiency of Snapshot Differential Algorithm, by using the massive data processing platform, we modify traditional Partition Hash algorithm, improve the efficiency and reduce the calculating time. At the end of this paper, we show a test which will demonstrate the improvement of efficiency after modification.
Keywords
data warehouses; data integration; data warehouse; hadoop platform; massive data processing platform; partition hash algorithm; snapshot differential algorithm; delta extraction; distributed computing; snapshot differential algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Cross Strait Quad-Regional Radio Science and Wireless Technology Conference (CSQRWC), 2011
Conference_Location
Harbin
Print_ISBN
978-1-4244-9792-8
Type
conf
DOI
10.1109/CSQRWC.2011.6037179
Filename
6037179
Link To Document