Title :
A Double Algorithm of Web Usage Mining Based on Sequence Number
Author :
Fang, Gang ; Wang, Jia-Le ; Ying, Hong ; Xiong, Jiang
Author_Institution :
Chongqing Three Gorges Univ., Chongqing, China
Abstract :
Web usage mining is an application of data mining technology to mining the data of the Web server log files. It can discover these session patterns of user and some kinds of correlations between these Web pages. Web usage mining provides the support for the Web site design, providing personalization server and other business making decision. There are some session patterns saved in Web server log files, page attribute of which is Boolean quantity. In order to improve efficiency of presented algorithms and reduce the time of scanning database, and so aiming to these characters, this paper proposes a double algorithm of Web usage mining based on sequence number, which is suitable for mining any session patterns. The algorithm turns session pattern of user into binary, and then uses up and down search strategy to double generate candidate frequent itemsets. The algorithm computes support by sequence number dimension in order to scan once session pattern of user, which is different from traditional double search mining algorithm. And the efficiency of Web usage mining is efficiently improved because of this way. The experiment indicates that the efficiency is faster and more efficient than presented similar algorithms.
Keywords :
Internet; Web design; data mining; decision making; file servers; Boolean quantity; Web pages; Web server log files; Web site design; Web usage mining double algorithm; business decision making; candidate frequent itemsets; data mining technology; database scanning; down search strategy; personalization server; sequence number; up down search strategy; Data mining; Databases; Itemsets; Mathematical model; Uniform resource locators; Web design; Web mining; Web pages; Web server; Web sites;
Conference_Titel :
Information Engineering and Computer Science, 2009. ICIECS 2009. International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-4994-1
DOI :
10.1109/ICIECS.2009.5363879