Title :
Automatic Pattern-Taxonomy Extraction for Web Mining
Author :
Wu, Sheng-Tang ; Li, Yuefeng ; Xu, Yue ; Pham, Binh ; Chen, Phoebe
Author_Institution :
Queensland University of Technology, Australia
Abstract :
In this paper, we propose a model for discovering frequent sequential patterns, phrases, which can be used as profile descriptors of documents. It is indubitable that we can obtain numerous phrases using data mining algorithms. However, it is difficult to use these phrases effectively for answering what users want. Therefore, we present a pattern taxonomy extraction model which performs the task of extracting descriptive frequent sequential patterns by pruning the meaningless ones. The model then is extended and tested by applying it to the information filtering system. The results of the experiment show that pattern-based methods outperform the keyword-based methods. The results also indicate that removal of meaningless patterns not only reduces the cost of computation but also improves the effectiveness of the system.
Keywords :
Association rules; Data communication; Data mining; Data processing; Information filtering; Information filters; Software engineering; Taxonomy; Text mining; Web mining;
Conference_Titel :
Web Intelligence, 2004. WI 2004. Proceedings. IEEE/WIC/ACM International Conference on
Print_ISBN :
0-7695-2100-2
DOI :
10.1109/WI.2004.10132