Title :
Research on Path Completion Technique in Web Usage Mining
Author :
Li, Yan ; Feng, BoQin ; Mao, Qinjiao
Abstract :
An implementation of data preprocessing system for Web usage mining and the details of algorithm for path completion are presented. After user session identification, the missing pages in user access paths are appended by using the referer-based method which is an effective solution to the problems introduced by using proxy servers and local caching. The reference length of pages in complete path is modified by considering the average reference length of auxiliary pages which is estimated in advance through the maximal forward references and the reference length algorithms. As verified by practical Web access log, the proposed path completion algorithm efficiently appends the lost information and improves the reliability of access data for further Web usage mining calculations.
Keywords :
Internet; data mining; Web usage mining; data preprocessing system; maximal forward references; path completion; reference length algorithms; referer-based method; user session identification; Cleaning; Computer science; Data engineering; Data mining; Data preprocessing; Impedance; Network servers; Research and development; Telecommunication traffic; Web sites; Data Preprocessing; Path Completion; Web Usage Mining;
Conference_Titel :
Computer Science and Computational Technology, 2008. ISCSCT '08. International Symposium on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-3746-7
DOI :
10.1109/ISCSCT.2008.151