DocumentCode :
3271052
Title :
The Construction of Transactions for Web Usage Mining
Author :
Li, Yan ; Feng, Bo-Qin
Author_Institution :
Sch. of Electron. & Inf. Eng., Xi´´an Jiaotong Univ., Xi´´an, China
Volume :
1
fYear :
2009
fDate :
6-7 June 2009
Firstpage :
121
Lastpage :
124
Abstract :
A data preprocessing system for constructing the transactions in Web usage mining is presented. To implement transaction identification, the user sessions and the user access paths are extracted from the Web access log and missing information is appended. These tasks are accomplished with the application of the referer-based method, which is an effective solution to the problems introduced by using proxy servers, local caching and firewall. Meanwhile, the reference length of accessed pages is calculated with the consideration of the time spent on data transfer over Internet. Then two kinds of transactions are defined, i.e. travel-path transactions and content-only transactions. These two kinds of transactions are constructed by the maximal forward references (MFR) algorithm and the reference length (RL) algorithm, respectively. As verified by practical Web access log, it is shown that the transactions can be efficiently identified while the reliability of the original Web access data is obviously improved for the further researches.
Keywords :
Internet; data mining; Internet; Web access log; Web usage mining; data transfer; firewall; local caching; maximal forward references algorithm; missing information; proxy servers; reference length algorithm; referer-based method; transaction identification; user access paths; user sessions; Cleaning; Computational intelligence; Computer science; Data engineering; Data mining; Data preprocessing; Internet; Privacy; System testing; Topology; data preprocessing; path completion; transaction identification; user session identification; web usage mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Natural Computing, 2009. CINC '09. International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-0-7695-3645-3
Type :
conf
DOI :
10.1109/CINC.2009.101
Filename :
5231340
Link To Document :
بازگشت