DocumentCode
458733
Title
Mining Association Rules from a Collection of XML Documents using Cross Filtering Algorithm
Author
Shin, Jun ; Paik, Juryon ; Kim, Ungmo
Author_Institution
Dept. of Comput. Eng., Sungkyunkwan Univ., Suwon
Volume
1
fYear
2006
fDate
9-11 Nov. 2006
Firstpage
120
Lastpage
126
Abstract
Since numerous data have been represented and exchanged by XML, the ability to extract useful knowledge from XML data is needed. There are several attempts to mine association rules from XML data. However, they mostly rely on legacy relational database with an XML interface so that efficiency and simplicity are challenging issue. In this paper, HILoP (hierarchical layered structure of PairSet) is introduced. The use of this data structure prevent from multiple XML data scans to mine association rules from a collection of XML documents. Also, cross filtering algorithm is introduced to mine frequent patterns, the algorithm reduces the number of candidate set. The performance evaluation result shows that this mechanism is powerful enough to represent both simple and complex structured association relationships inherent in XML data
Keywords
XML; data mining; data structures; relational databases; software maintenance; XML documents; cross filtering algorithm; data structure; hierarchical layered structure; legacy relational database; mining association rules; Association rules; Computer crashes; Data engineering; Data mining; Data structures; Database languages; Filtering algorithms; Knowledge engineering; Relational databases; XML;
fLanguage
English
Publisher
ieee
Conference_Titel
Hybrid Information Technology, 2006. ICHIT '06. International Conference on
Conference_Location
Cheju Island
Print_ISBN
0-7695-2674-8
Type
conf
DOI
10.1109/ICHIT.2006.253475
Filename
4021078
Link To Document