Title :
The Design and Implementation of XML Semi-structured Data Extraction and Loading into the Data Warehouse
Author :
Guohua, Yue ; Jingting, Wang
Author_Institution :
Sch. of Comput. Sci. & Technol., Xi´´an Univ. of Sci. & Technol., Xi´´an, China
Abstract :
By analyzing the characteristics of Semi-structured data along with the actual Book Return Data (BokeDataInfo.xml) in the Auto department chain sales as an example, DOM objects extract, transform and load into the data tables of current level of detail of the Data Warehouse. The paper based XML Semi-structured data has designed and implemented a Data Warehouse ETL tool which based on the Semi-structured data. Meanwhile, it also cleans up the defect which the loading of Data Warehouse data can not directly loaded and extracted the XML documents by commercial ETL tool, and it also fathoms the practical exploration for the problem of extracting and loading the semi-structured XML data into the current level of detail of the Data Warehouse.
Keywords :
XML; data structures; data warehouses; ETL tool; XML document extraction; XML semistructured data extraction; data warehouse; document object model; Algorithm design and analysis; Business; Data mining; Data warehouses; Databases; Loading; XML; DOM; Date Warehouse; ETL tool; Semi-Structured Data;
Conference_Titel :
Information Technology and Applications (IFITA), 2010 International Forum on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-7621-3
Electronic_ISBN :
978-1-4244-7622-0
DOI :
10.1109/IFITA.2010.265