DocumentCode :
2725063
Title :
On Mining Dynamic Web ClickStreams for Frequent Traversal Sequences
Author :
Ren, Jiadong ; Zhang, Xiaojian ; Peng, Huili
Author_Institution :
Coll. of Inf. Sci. & Eng., Yanshan Univ., Qinhuangdao
fYear :
2007
fDate :
March 1 2007-April 5 2007
Firstpage :
477
Lastpage :
484
Abstract :
Although frequent traversal sequence (FTS) mining has been extensively studied over the last decade in Web usage mining, it is challenging to extend the mining technique to dynamic Web click streams. The main challenge is that existing false-positive methods control memory consumption and output accuracy by a relaxation ratio r (r = e/s, e is the error parameter, and s is the specified minimum support). However, the higher the value of r, the more saving is the memory consumption and the better recall but degrades the output precision, while on the contrary, decreasing r gives a more precise output but needs higher storage space. In this paper, the upper and lower bounds are established to constrain r, a weighted harmonic average (WHA) of the two bounds is designed to adjust r, and a novel algorithm FTS-Stream is proposed to find the FTS over a time-sensitive sliding window. Thus, the precision and recall can be maintained with the WHA (r). Our analysis and experiments show that FTS-Stream has high accuracy and requires less memory in dynamic Web clickstreams
Keywords :
Internet; data mining; Web usage mining; dynamic Web ClickStreams mining; false-positive methods; frequent traversal sequences; time-sensitive sliding window; weighted harmonic average; Algorithm design and analysis; Computational intelligence; Data analysis; Data engineering; Data mining; Databases; Degradation; Educational institutions; Error correction; Information science;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Data Mining, 2007. CIDM 2007. IEEE Symposium on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0705-2
Type :
conf
DOI :
10.1109/CIDM.2007.368913
Filename :
4221337
Link To Document :
بازگشت