• DocumentCode
    683947
  • Title

    An optimized ETL fault-tolerant algorithm in data warehouses

  • Author

    Tu, Shitao ; Zhu, Lanjuan

  • Author_Institution
    Department of Automation, Shanghai Jiao Tong University, and Key Laboratory of System Control and Information Processing, Ministry of Education of China, 200240, China
  • fYear
    2013
  • fDate
    23-25 March 2013
  • Firstpage
    484
  • Lastpage
    487
  • Abstract
    Extraction-Transformation-Loading (ETL) plays an important role in data warehouse. Typically, performance is considered the main factor in ETL projects. Actually, faulttolerance and many other aspects influence the results of ETL greatly especially when the time period of projects are long and transformation rules cannot be determined from beginning, such as the situation of changing business logic. To satisfy the fault-tolerance and data validation in such kinds of situation, in this paper, we introduce a fault-tolerant algorithm which gives Redo strategy for different process interrupt scenarios. Moreover, we present a compound refresh mode consisting of full and incremental refresh to guarantee data correctness in changing business logic as well as timely data migration.
  • Keywords
    Business; Compounds; Data warehouses; Databases; Engines; Fault tolerance; Fault tolerant systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Science and Technology (ICIST), 2013 International Conference on
  • Conference_Location
    Yangzhou
  • Print_ISBN
    978-1-4673-5137-9
  • Type

    conf

  • DOI
    10.1109/ICIST.2013.6747594
  • Filename
    6747594