DocumentCode :
3190214
Title :
Delta view generation for incremental loading of large dimensions in a data warehouse
Author :
Mekterovic, Igor ; Brkic, Ljiljana
Author_Institution :
Fac. of Electr. Eng. & Comput., Univ. of Zagreb, Zagreb, Croatia
fYear :
2015
fDate :
25-29 May 2015
Firstpage :
1417
Lastpage :
1422
Abstract :
Incremental load is the preferred approach in efficient ETL processes. Fact tables are the ones who benefit the most from this approach, since they are large in terms of row count. For the sake of simplicity, dimension tables are often ignored and populated in a full reload manner. However, big dimensions (e.g. Client) can also have a significant impact on the ETL process and should also be considered for incremental load. Although they have much smaller cardinality than a typical fact table, it usually takes much more resources to calculate one dimension table row than to calculate one fact table row. Large dimension tables are based on multiple source tables, and it is not trivial to determine the changed records that should be considered for the incremental load because changes in any and all of underlying source tables must be considered. In this paper, we present an algorithm for the dimension´s delta view generation. Delta view for a dimension encompasses all its source tables and produces a set of keys (e.g. ClientIds) that should be incrementally processed. We have employed this approach in a real world project and have noticed a significant reduction in the loading time of big dimensions.
Keywords :
data handling; data warehouses; ETL process; data warehouse; delta view generation; dimension tables; fact tables; incremental loading; Business; Cities and towns; Data mining; Data warehouses; Databases; Loading;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Communication Technology, Electronics and Microelectronics (MIPRO), 2015 38th International Convention on
Conference_Location :
Opatija
Type :
conf
DOI :
10.1109/MIPRO.2015.7160496
Filename :
7160496
Link To Document :
بازگشت