DocumentCode :
3228869
Title :
Active User-Based and Ontology-Based Web Log Data Preprocessing for Web Usage Mining
Author :
Khasawneh, Natheer ; Chan, Chien-Chung
Author_Institution :
Dept. of Comput. Eng., Jordan Univ. of Sci. & Technol., Irbid
fYear :
2006
fDate :
18-22 Dec. 2006
Firstpage :
325
Lastpage :
328
Abstract :
User identification and session identification are two major steps in preprocessing Web log data for Web usage mining. This paper introduces a fast active user-based user identification algorithm with time complexity O(n). The algorithm uses both an IP address and a finite users´ inactive time to identify different users in the Web log. Web site ontology is useful for identifying Web site structure and break points for browsing behavior. For session identification, we present an ontology-based method that utilizes the Web site structure and functionalities to identify different sessions
Keywords :
Internet; computational complexity; data mining; ontologies (artificial intelligence); Web usage mining; active user-based user identification algorithm; ontology-based Web log data preprocessing; session identification; time complexity; Application software; Cleaning; Computer science; Data engineering; Data preprocessing; Data security; HTML; Navigation; Ontologies; Web mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Web Intelligence, 2006. WI 2006. IEEE/WIC/ACM International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2747-7
Type :
conf
DOI :
10.1109/WI.2006.32
Filename :
4061387
Link To Document :
بازگشت