Title :
Web Mining of Relations from XML and Construct Database Schema
Author :
Zhou, Xu ; Pan, Xuezeng ; Ren, Yu
Author_Institution :
Coll. of Comput. Sci., Zhejiang Univ., Hangzhou
fDate :
Nov. 28 2006-Dec. 1 2006
Abstract :
Increasing amount of commercial data is presented in XML format for exchanging or publishing on the Web. It is emerging as a new standard for information representation and exchanging over the Internet. How to retrieve valuable information from XML documents on the Web is a new challenge to data mining research. Compared with relational database, XML data in documents is stored as file with tree logical structure inside, it results in lower efficiency and performance in directly querying data. So it is still necessary to transform data into database (warehouse) for data mining afterwards. In this paper, we present a scheme to analyze relation of elements in XML on the Web, and construct relational database schema based on the analysis. During the process, there would be a worthy accessory product - a glossary, which can facilitate the process of data mining warehouse designing and building.
Keywords :
Internet; XML; data mining; relational databases; Internet; Web mining; World Wide Web; XML data; XML documents; XML format; construct database schema; data mining warehouse; data querying; eXtensible Markup Language; information representation; information retrieval; relational database schema; tree logical structure; Buildings; Data mining; Information representation; Information retrieval; Internet; Publishing; Relational databases; Terminology; Web mining; XML;
Conference_Titel :
Computational Intelligence for Modelling, Control and Automation, 2006 and International Conference on Intelligent Agents, Web Technologies and Internet Commerce, International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
0-7695-2731-0
DOI :
10.1109/CIMCA.2006.233