Title :
On Modelling and Synthetically Generating Web Usage Data
Author :
Hofgesang, Peter I. ; Patist, Jan Peter
Author_Institution :
Dept. of Comput. Sci., VU Univ. Amsterdam, Amsterdam
Abstract :
While the whole public Web is a potential source for Web content and Web structure mining, the actual usage information, that is essential for Web usage mining (WUM), is kept hidden by Web servers of hosted Web sites. Furthermore, there are only a handful of poorly described Web access datasets publicly available. On the one hand, the lack of public datasets hamper WUM research, while on the other hand, online services demand for advanced techniques, e.g. to profile their customers and personalize their Web based services. In this paper we propose our methodology to build synthetic Web usage data generators based on the knowledge established by an extensive analysis of five real-world Web usage datasets.
Keywords :
Internet; data mining; Web access datasets; Web based services; Web content; Web structure mining; Web usage datasets; Web usage mining; public Web; public datasets; synthetic Web usage data generators; Computational modeling; Data mining; Intelligent agent; Intelligent structures; System performance; System testing; Telecommunication traffic; Traffic control; Web mining; Web server; data generator; web usage mining;
Conference_Titel :
Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-0-7695-3496-1
DOI :
10.1109/WIIAT.2008.384