• DocumentCode
    2384500
  • Title

    A Domain Ontology Approach in the ETL Process of Data Warehousing

  • Author

    Jiang, Lihong ; Cai, Hongming ; Xu, Boyi

  • Author_Institution
    Sch. of Software, Shanghai JiaoTong Univ., Shanghai, China
  • fYear
    2010
  • fDate
    10-12 Nov. 2010
  • Firstpage
    30
  • Lastpage
    35
  • Abstract
    Extract-Transform-Loading (ETL) tools integrate data from source side to target in building data warehouse. However data structure and semantic heterogeneity exits widely in the enterprise information systems. On the purpose of eliminate data heterogeneity so as to construct data warehouse, this paper introduces domain ontology into ETL process of finding the data sources, defining the rules of data transformation, and eliminating the heterogeneity. In this method, the domain ontology is embedded in the metadata of the data warehouse. Hence, the data record could be mapped from data bases to ontology classes of Web Ontology Language (OWL). As result, the accessing of information resources could be done more efficiently. The method is testing in a hospital data warehouse project, and the result shows that ontology method plays an important role in the process of data integration by providing common descriptions of the concepts and relationships of data items, and medical domain ontology in the ETL process is of practical feasibility.
  • Keywords
    business data processing; data warehouses; knowledge representation languages; ontologies (artificial intelligence); ETL process; OWL; Web ontology language; data structure; data warehouse metadata; domain ontology; enterprise information system; extract-transform-loading tool; semantic heterogeneity; Business; Data mining; Data models; Data warehouses; Databases; Ontologies; Semantics; Domain Ontology; ETL; Hospital Data Warehouse;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    e-Business Engineering (ICEBE), 2010 IEEE 7th International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4244-8386-0
  • Electronic_ISBN
    978-0-7695-4227-0
  • Type

    conf

  • DOI
    10.1109/ICEBE.2010.36
  • Filename
    5704295