DocumentCode :
3069449
Title :
The Design and Implementation of XML Semi-structured Data Extraction and Loading into the Data Warehouse
Author :
Guohua, Yue ; Jingting, Wang
Author_Institution :
Sch. of Comput. Sci. & Technol., Xi´´an Univ. of Sci. & Technol., Xi´´an, China
Volume :
3
fYear :
2010
fDate :
16-18 July 2010
Firstpage :
30
Lastpage :
33
Abstract :
By analyzing the characteristics of Semi-structured data along with the actual Book Return Data (BokeDataInfo.xml) in the Auto department chain sales as an example, DOM objects extract, transform and load into the data tables of current level of detail of the Data Warehouse. The paper based XML Semi-structured data has designed and implemented a Data Warehouse ETL tool which based on the Semi-structured data. Meanwhile, it also cleans up the defect which the loading of Data Warehouse data can not directly loaded and extracted the XML documents by commercial ETL tool, and it also fathoms the practical exploration for the problem of extracting and loading the semi-structured XML data into the current level of detail of the Data Warehouse.
Keywords :
XML; data structures; data warehouses; ETL tool; XML document extraction; XML semistructured data extraction; data warehouse; document object model; Algorithm design and analysis; Business; Data mining; Data warehouses; Databases; Loading; XML; DOM; Date Warehouse; ETL tool; Semi-Structured Data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology and Applications (IFITA), 2010 International Forum on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-7621-3
Electronic_ISBN :
978-1-4244-7622-0
Type :
conf
DOI :
10.1109/IFITA.2010.265
Filename :
5634728
Link To Document :
بازگشت