Title of article :
A proposed model for data warehouse ETL processes
Author/Authors :
El-Sappagh, Shaker H. Ali King Saud University - College of Science - Mathematics Department, Saudi Arabia , Hendawi, Abdeltawab M. Ahmed Cairo University - Faculty of Computers and Information - Information Systems Department, Egypt , El Bastawissy, Ali Hamed Cairo University - Faculty of Computers and Information, - Information Systems Department, Egypt
From page :
91
To page :
104
Abstract :
Extraction–transformation–loading (ETL) tools are pieces of software responsible for the extraction of data from several sources, its cleansing, customization, reformatting, integration, and insertion into a data warehouse. Building the ETL process is potentially one of the biggest tasks of building a warehouse; it is complex, time consuming, and consume most of data warehouse project’s implementation efforts, costs, and resources. Building a data warehouse requires focusing closely on understanding three main areas: the source area, the destination area, and the mapping area (ETL processes). The source area has standard models such as entity relationship diagram, and the destination area has standard models such as star schema, but the mapping area has not a standard model till now. In spite of the importance of ETL processes, little research has been done in this area due to its complexity. There is a clear lack of a standard model that can be used to represent the ETL scenarios. In this paper we will try to navigate through the efforts done to conceptualize the ETL processes. Research in the field of modeling ETL processes can be categorized into three main approaches: Modeling based on mapping expressions and guidelines, modeling based on conceptual constructs, and modeling based on UML environment. These projects try to represent the main mapping activities at the conceptual level. Due to the variation and differences between the proposed solutions for the conceptual design of ETL processes and due to their limitations, this paper also will propose a model for conceptual design of ETL processes. The proposed model is built upon the enhancement of the models in the previous models to support some missing mapping features.
Keywords :
Data warehouse , ETL processes , Database , Data mart , OLAP , Conceptual modeling
Journal title :
Journal Of King Saud University - Computer an‎d Information Sciences
Journal title :
Journal Of King Saud University - Computer an‎d Information Sciences
Record number :
2609724
Link To Document :
بازگشت