Title :
Modelling ETL Conciliation Tasks Using Relational Algebra Operators
Author :
Santos, Vasco ; Belo, Orlando
Author_Institution :
Sch. of Manage. & Technol., CIICESI, Polytech. of Porto Felgueiras, Porto, Portugal
Abstract :
The design and development of a data warehousing system (DWS) tends to be an exceptional resource consuming project which in turn makes it a high risk/reward project. In order to minimize the risk, some design methodologies and tools are used along the several phases of the project. The Extract-Transform-Load (ETL) component is normally one of the most critical components of a DWS since it gathers, corrects and conforms data in order to be loaded into the Data Warehouse (DW). Data conciliation task tends to be a dull and manual intensive job that often deals with several heterogeneous sources which is critical to the correct representation of the enterprise´s information. The manual nature of this task makes it prone to errors and subject of intensive and successive monitoring. In this paper, we analyse some of the most common ETL tasks for data conciliation using a Relational Algebra approach, as an effort to standardize them for future use in a generic ETL environment. A slowly changed dimension scenario will be used to support the data conciliation modelling process designed for this work.
Keywords :
data warehouses; relational algebra; DWS; ETL conciliation task modelling; data conciliation modelling process; data warehousing system; design methodologies; design tools; enterprise information; extract-transform-load component; generic environment; heterogeneous sources; relational algebra operators; Algebra; Business; Data mining; Data models; Unified modeling language; Warehousing; Data Conciliation Tasks; Data Warehousing Systems; ETL Conceptual Modelling; Relational Algebra;
Conference_Titel :
Modelling Symposium (EMS), 2014 European
Print_ISBN :
978-1-4799-7411-5
DOI :
10.1109/EMS.2014.59