Title :
An efficient approach for the maintenance of path traversal patterns
Author :
Yen, Show-Jane ; Lee, Yue-Shi ; Cho, Chung-Wen
Author_Institution :
Dept. of CSIE, Fu Jen Catholic Univ., Taiwan
Abstract :
Mining frequent traversal patterns is to discover the consecutive reference paths traversed by a sufficient number of users from Web logs. The previous approaches for mining frequent traversal patterns need to repeatedly scan the traversal paths and take a large amount of computation time to find frequent traversal patterns. However, the discovered frequent traversal patterns may become invalid or inappropriate when the databases are updated. We propose an incremental updating technique to maintain the discovered frequent traversal patterns when the user sequences are inserted into or the database. Our approach partitions the database into some segments and scans the database segment by segment. For each segment scan, the candidate traversal sequences that cannot be frequent traversal sequences can be pruned and the frequent traversal sequences can be found out earlier. Besides, the number of database scans can be significantly reduced because some information can be computed by our approach. The experimental results show that our algorithms are more efficient than other algorithms for the maintenance of mining frequent traversal patterns.
Keywords :
data mining; pattern recognition; very large databases; Web logs; candidate traversal sequences; consecutive reference path discover; database partitioning; database segment scanning; frequent traversal pattern mining; frequent traversal sequences; incremental updating technique; path traversal pattern maintenance; user sequences; Computer science; Data mining; Databases; Decision making; Information analysis; Partitioning algorithms; Web mining; Web pages;
Conference_Titel :
e-Technology, e-Commerce and e-Service, 2004. EEE '04. 2004 IEEE International Conference on
Print_ISBN :
0-7695-2073-1
DOI :
10.1109/EEE.2004.1287311