• DocumentCode
    3069449
  • Title

    The Design and Implementation of XML Semi-structured Data Extraction and Loading into the Data Warehouse

  • Author

    Guohua, Yue ; Jingting, Wang

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Xi´´an Univ. of Sci. & Technol., Xi´´an, China
  • Volume
    3
  • fYear
    2010
  • fDate
    16-18 July 2010
  • Firstpage
    30
  • Lastpage
    33
  • Abstract
    By analyzing the characteristics of Semi-structured data along with the actual Book Return Data (BokeDataInfo.xml) in the Auto department chain sales as an example, DOM objects extract, transform and load into the data tables of current level of detail of the Data Warehouse. The paper based XML Semi-structured data has designed and implemented a Data Warehouse ETL tool which based on the Semi-structured data. Meanwhile, it also cleans up the defect which the loading of Data Warehouse data can not directly loaded and extracted the XML documents by commercial ETL tool, and it also fathoms the practical exploration for the problem of extracting and loading the semi-structured XML data into the current level of detail of the Data Warehouse.
  • Keywords
    XML; data structures; data warehouses; ETL tool; XML document extraction; XML semistructured data extraction; data warehouse; document object model; Algorithm design and analysis; Business; Data mining; Data warehouses; Databases; Loading; XML; DOM; Date Warehouse; ETL tool; Semi-Structured Data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology and Applications (IFITA), 2010 International Forum on
  • Conference_Location
    Kunming
  • Print_ISBN
    978-1-4244-7621-3
  • Electronic_ISBN
    978-1-4244-7622-0
  • Type

    conf

  • DOI
    10.1109/IFITA.2010.265
  • Filename
    5634728