Title :
CommonCube-based conceptual modeling of ETL processes
Author :
Li, Zehai ; Sun, Jigui ; Yu, Haihong ; Zhang, Jian
Author_Institution :
Coll. of Comput. Sci. & Technol., Jilin Univ., Changchun, China
Abstract :
ETL tools are responsible for the extraction of data from sources, their cleansing and loading into a target data warehouse. However, nowadays, the design and development of ETL processes are performed in an in-house fashion, and need uniformed methodological foundations. In this paper, we propose a novel conceptual model for the modeling of ETL processes. We employ CommonCubes to represent the cubes in a target data warehouse. CommonCubes release the design of ETL processes from overdependence on the physical schema of the target data warehouse, and enable the designers to pay more efforts to data transforming than data loading when designing ETL processes. Based on the constraint functions on source attributes and the transforming operations on target attributes, we define ETL mappings 1:0 capture the semantics of various relationship cardinalities between source attributes and target attributes, which provide a good basis for the design of ETL processes.
Keywords :
data mining; data models; data warehouses; CommonCube-based conceptual modeling; ETL process; constraint functions; cube representation; data extraction; data transformation; data warehouse; extraction transformation loading tools; semantics capture; source attributes; Cleaning; Computer science; Computer science education; Costs; Data mining; Data warehouses; Educational technology; Knowledge engineering; Laboratories; Process design;
Conference_Titel :
Control and Automation, 2005. ICCA '05. International Conference on
Print_ISBN :
0-7803-9137-3
DOI :
10.1109/ICCA.2005.1528104