Title :
Semantic Formalization of Cross-Site User Browsing Behavior
Author :
Hoxha, J. ; Agarwal, Sankalp
Abstract :
Large amounts of data are being produced daily as detailed records of Web usage behavior, but the task of deriving actionable knowledge from them remains a challenge. Investigations of user browsing behavior at multiple websites, while more beneficial than studies restricted to a single site, still need to tackle the problems of information heterogeneity and mapping usage logs to meaningful events from the application domain. Focusing on the problem of modeling cross-site browsing behavior, we present a formalization approach based on a Web browsing Activity Model (WAM). We introduce a novel two-staged approach for the semantic enrichment of usage logs with domain knowledge, bringing together Semantic Web technologies and Machine Learning techniques. For learning the semantic types of logs, we present a supervised multi-class classification formulation, deploying structural Support Vector Machines with new sequential input features. We provide an implementation of these approaches and show the results of evaluation with real-world data.
Keywords :
Web sites; behavioural sciences computing; learning (artificial intelligence); pattern classification; semantic Web; support vector machines; Web browsing activity model; Web usage behavior; Websites; cross-site user browsing behavior; domain knowledge; information heterogeneity; machine learning techniques; real-world data evaluation; semantic Web technologies; semantic formalization; semantic-type logs; structural support vector machine; supervised multiclass classification formulation; two-staged approach; usage log mapping;
Conference_Titel :
Web Intelligence and Intelligent Agent Technology (WI-IAT), 2012 IEEE/WIC/ACM International Conferences on
Conference_Location :
Macau
Print_ISBN :
978-1-4673-6057-9
DOI :
10.1109/WI-IAT.2012.232