Title :
Methodology for Preprocessing and Evaluating the Time Spent on Web Pages
Author :
Hofgesang, Peter I.
Author_Institution :
Dept. Comput. Sci., Vrije Univ. Amsterdam
Abstract :
On the Web, the intention of a user is mostly hidden. To approximate user intention and characterise user behaviour researches in Web usage mining mainly exploit two types of information: order and frequency of visited pages. However, several studies in information retrieval and human-computer interaction have suggested that the time spent on Web pages (TSP) is an important measure of user intention and page relevance. In our paper we provide a methodology to preprocess the TSP. In addition, we present a real-world testbed that provides an unbiased environment and representative, real-world data in specific Web domains. The environment can be used to evaluate user interest indicators and importance measures, to validate clustering algorithms and for a broad selection of other validation problems. As a case study, we define a testbed on online retail shop data and evaluate, among others, the relevance of TSP
Keywords :
Internet; data mining; human factors; Web pages; Web usage mining; human-computer interaction; information retrieval; user behaviour; user intention; Clustering algorithms; Computer science; Electronic learning; Frequency; Human computer interaction; Information retrieval; Navigation; Testing; Time measurement; Web pages;
Conference_Titel :
Web Intelligence, 2006. WI 2006. IEEE/WIC/ACM International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2747-7
DOI :
10.1109/WI.2006.116