Title :
Dynamic timeout-based a session identification algorithm
Author :
Xinhua, He ; Qiong, Wang
Author_Institution :
Dept. of Inf., Acad. of Armored Force Eng., Beijing, China
Abstract :
In order to improve the accuracy of data preprocessing in Web log mining, the basic procedure of data preprocessing is introduced firstly in this paper. Then the traditional session identification algorithm is fully analyzed, on the basis of which, a session identification algorithm based on dynamic timeout is presented. At the beginning of the algorithm, the initial timeout is computed for each page according to the statistical result, combining with the importance degree of page; then during the procedure of session identification, timeout is dynamically adjusted, and user sessions is determined judging by the dynamic timeout. Comparing experiment shows that the algorithm proposed can obtain a better performance on session identification.
Keywords :
Web sites; data mining; Web log mining; data preprocessing; dynamic timeout; session identification algorithm; Algorithm design and analysis; Cleaning; Data mining; Data preprocessing; Heuristic algorithms; IP networks; Topology; Web log mining; dynamic timeout; session;
Conference_Titel :
Electric Information and Control Engineering (ICEICE), 2011 International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-8036-4
DOI :
10.1109/ICEICE.2011.5777587