Title :
The Construction of Transactions for Web Usage Mining
Author :
Li, Yan ; Feng, Bo-Qin
Author_Institution :
Sch. of Electron. & Inf. Eng., Xi´´an Jiaotong Univ., Xi´´an, China
Abstract :
A data preprocessing system for constructing the transactions in Web usage mining is presented. To implement transaction identification, the user sessions and the user access paths are extracted from the Web access log and missing information is appended. These tasks are accomplished with the application of the referer-based method, which is an effective solution to the problems introduced by using proxy servers, local caching and firewall. Meanwhile, the reference length of accessed pages is calculated with the consideration of the time spent on data transfer over Internet. Then two kinds of transactions are defined, i.e. travel-path transactions and content-only transactions. These two kinds of transactions are constructed by the maximal forward references (MFR) algorithm and the reference length (RL) algorithm, respectively. As verified by practical Web access log, it is shown that the transactions can be efficiently identified while the reliability of the original Web access data is obviously improved for the further researches.
Keywords :
Internet; data mining; Internet; Web access log; Web usage mining; data transfer; firewall; local caching; maximal forward references algorithm; missing information; proxy servers; reference length algorithm; referer-based method; transaction identification; user access paths; user sessions; Cleaning; Computational intelligence; Computer science; Data engineering; Data mining; Data preprocessing; Internet; Privacy; System testing; Topology; data preprocessing; path completion; transaction identification; user session identification; web usage mining;
Conference_Titel :
Computational Intelligence and Natural Computing, 2009. CINC '09. International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-0-7695-3645-3
DOI :
10.1109/CINC.2009.101