Title :
The Study on the Preprocessing in Web Log Mining
Author :
Shu-yue, Ma ; Wen-cai, Liu ; Shuo, Wang
Author_Institution :
Coll. of Inf., Jiu Jiang Univ., Jiu Jiang, China
Abstract :
According to the Web log mining, the site administrators can control the network traffic and understand the user access modes. Then they can further improve the performance of Web systems and optimize the system design of Web sites by using these information. However, the Web log data doesn´t perform the data mining directly in most cases because of the messy and redundant content and other reasons. This paper analyzes the data pre-processing on Web log in order to meet the needs of data mining. At the same time, it also puts forward some reasonable processing means.
Keywords :
Internet; data mining; Web log mining; Web site; data mining; data preprocessing; user access mode; Cleaning; Data mining; Educational institutions; IP networks; Web servers; Web log mining; data cleaning; transaction identification segmentation; user identification;
Conference_Titel :
Knowledge Acquisition and Modeling (KAM), 2011 Fourth International Symposium on
Conference_Location :
Sanya
Print_ISBN :
978-1-4577-1788-8
DOI :
10.1109/KAM.2011.90