DocumentCode
3069449
Title
The Design and Implementation of XML Semi-structured Data Extraction and Loading into the Data Warehouse
Author
Guohua, Yue ; Jingting, Wang
Author_Institution
Sch. of Comput. Sci. & Technol., Xi´´an Univ. of Sci. & Technol., Xi´´an, China
Volume
3
fYear
2010
fDate
16-18 July 2010
Firstpage
30
Lastpage
33
Abstract
By analyzing the characteristics of Semi-structured data along with the actual Book Return Data (BokeDataInfo.xml) in the Auto department chain sales as an example, DOM objects extract, transform and load into the data tables of current level of detail of the Data Warehouse. The paper based XML Semi-structured data has designed and implemented a Data Warehouse ETL tool which based on the Semi-structured data. Meanwhile, it also cleans up the defect which the loading of Data Warehouse data can not directly loaded and extracted the XML documents by commercial ETL tool, and it also fathoms the practical exploration for the problem of extracting and loading the semi-structured XML data into the current level of detail of the Data Warehouse.
Keywords
XML; data structures; data warehouses; ETL tool; XML document extraction; XML semistructured data extraction; data warehouse; document object model; Algorithm design and analysis; Business; Data mining; Data warehouses; Databases; Loading; XML; DOM; Date Warehouse; ETL tool; Semi-Structured Data;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Technology and Applications (IFITA), 2010 International Forum on
Conference_Location
Kunming
Print_ISBN
978-1-4244-7621-3
Electronic_ISBN
978-1-4244-7622-0
Type
conf
DOI
10.1109/IFITA.2010.265
Filename
5634728
Link To Document