DocumentCode :
458733
Title :
Mining Association Rules from a Collection of XML Documents using Cross Filtering Algorithm
Author :
Shin, Jun ; Paik, Juryon ; Kim, Ungmo
Author_Institution :
Dept. of Comput. Eng., Sungkyunkwan Univ., Suwon
Volume :
1
fYear :
2006
fDate :
9-11 Nov. 2006
Firstpage :
120
Lastpage :
126
Abstract :
Since numerous data have been represented and exchanged by XML, the ability to extract useful knowledge from XML data is needed. There are several attempts to mine association rules from XML data. However, they mostly rely on legacy relational database with an XML interface so that efficiency and simplicity are challenging issue. In this paper, HILoP (hierarchical layered structure of PairSet) is introduced. The use of this data structure prevent from multiple XML data scans to mine association rules from a collection of XML documents. Also, cross filtering algorithm is introduced to mine frequent patterns, the algorithm reduces the number of candidate set. The performance evaluation result shows that this mechanism is powerful enough to represent both simple and complex structured association relationships inherent in XML data
Keywords :
XML; data mining; data structures; relational databases; software maintenance; XML documents; cross filtering algorithm; data structure; hierarchical layered structure; legacy relational database; mining association rules; Association rules; Computer crashes; Data engineering; Data mining; Data structures; Database languages; Filtering algorithms; Knowledge engineering; Relational databases; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Hybrid Information Technology, 2006. ICHIT '06. International Conference on
Conference_Location :
Cheju Island
Print_ISBN :
0-7695-2674-8
Type :
conf
DOI :
10.1109/ICHIT.2006.253475
Filename :
4021078
Link To Document :
بازگشت