• DocumentCode
    442233
  • Title

    CommonCube-based conceptual modeling of ETL processes

  • Author

    Li, Zehai ; Sun, Jigui ; Yu, Haihong ; Zhang, Jian

  • Author_Institution
    Coll. of Comput. Sci. & Technol., Jilin Univ., Changchun, China
  • Volume
    1
  • fYear
    2005
  • fDate
    26-29 June 2005
  • Firstpage
    131
  • Abstract
    ETL tools are responsible for the extraction of data from sources, their cleansing and loading into a target data warehouse. However, nowadays, the design and development of ETL processes are performed in an in-house fashion, and need uniformed methodological foundations. In this paper, we propose a novel conceptual model for the modeling of ETL processes. We employ CommonCubes to represent the cubes in a target data warehouse. CommonCubes release the design of ETL processes from overdependence on the physical schema of the target data warehouse, and enable the designers to pay more efforts to data transforming than data loading when designing ETL processes. Based on the constraint functions on source attributes and the transforming operations on target attributes, we define ETL mappings 1:0 capture the semantics of various relationship cardinalities between source attributes and target attributes, which provide a good basis for the design of ETL processes.
  • Keywords
    data mining; data models; data warehouses; CommonCube-based conceptual modeling; ETL process; constraint functions; cube representation; data extraction; data transformation; data warehouse; extraction transformation loading tools; semantics capture; source attributes; Cleaning; Computer science; Computer science education; Costs; Data mining; Data warehouses; Educational technology; Knowledge engineering; Laboratories; Process design;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Control and Automation, 2005. ICCA '05. International Conference on
  • Print_ISBN
    0-7803-9137-3
  • Type

    conf

  • DOI
    10.1109/ICCA.2005.1528104
  • Filename
    1528104