DocumentCode :
162162
Title :
Identifying user clicks based on dependency graph
Author :
Jun Liu ; Cheng Fang ; Ansari, Nayeem
Author_Institution :
Sch. of Inf. & Commun. Eng., Beijing Univ. of Posts & Telecommun., Beijing, China
fYear :
2014
fDate :
9-10 May 2014
Firstpage :
1
Lastpage :
5
Abstract :
Identifying user clicks from a large number of measured HTTP requests is the fundamental task for web usage mining, which is important for web administrators and developers. Nowadays, the prevalent parallel web browsing behavior caused by multi-tab web browsers renders accurate user click identification from massive requests a great challenge. In this paper, we propose a dependency graph model to describe the complicated web browsing behavior. Based on this model, we develop two algorithms to establish the dependency graph for measured requests, and identify user clicks by comparing their probabilities of being primary requests with a self-learned threshold. We evaluate our method with a large dataset collected from a real world mobile core network. The experimental results show that our method can achieve high accurate user clicks identification.
Keywords :
Internet; data mining; graph theory; hypermedia; parallel processing; transport protocols; HTTP; Web administrators; Web usage mining; dependency graph; mobile core network; multitab Web browsers; parallel Web browsing; self learned threshold; user click identification; Accuracy; Browsers; Cleaning; Data mining; Mathematical model; Mobile communication; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Wireless and Optical Communication Conference (WOCC), 2014 23rd
Conference_Location :
Newark, NJ
Type :
conf
DOI :
10.1109/WOCC.2014.6839915
Filename :
6839915
Link To Document :
بازگشت