• DocumentCode
    1573087
  • Title

    Improvement of snapshot differential algorithm based on hadoop platform

  • Author

    Yuan, Guoyong ; Li, Bin ; Xiao, Taiyang

  • Author_Institution
    Dept. of Comput. Sci., Jinan Univ., Guangzhou, China
  • Volume
    2
  • fYear
    2011
  • Firstpage
    1212
  • Lastpage
    1214
  • Abstract
    Snapshot differential algorithm is one of ways of extracting delta from views in the data warehouse in data integration circumstance. Due to the scale of the views in data warehouse is likely to be very massive, it will take lots of time to run snapshot differential algorithm and become the bottleneck of the system performance. In this paper, in order to improve efficiency of Snapshot Differential Algorithm, by using the massive data processing platform, we modify traditional Partition Hash algorithm, improve the efficiency and reduce the calculating time. At the end of this paper, we show a test which will demonstrate the improvement of efficiency after modification.
  • Keywords
    data warehouses; data integration; data warehouse; hadoop platform; massive data processing platform; partition hash algorithm; snapshot differential algorithm; delta extraction; distributed computing; snapshot differential algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cross Strait Quad-Regional Radio Science and Wireless Technology Conference (CSQRWC), 2011
  • Conference_Location
    Harbin
  • Print_ISBN
    978-1-4244-9792-8
  • Type

    conf

  • DOI
    10.1109/CSQRWC.2011.6037179
  • Filename
    6037179