DocumentCode :
2500053
Title :
Crawler Detection: A Bayesian Approach
Author :
Stassopoulou, Athena ; Dikaiakos, Marios D.
Author_Institution :
Dept. of Comput. Sci., Intercollege, Nicosia
fYear :
2006
fDate :
26-28 Aug. 2006
Firstpage :
16
Lastpage :
16
Abstract :
In this paper, we introduce a probabilistic modeling approach for addressing the problem of Web robot detection from Web-server access logs. More specifically, we construct a Bayesian network that classifies automatically access-log sessions as being crawler- or human-induced, by combining various pieces of evidence proven to characterize crawler and human behavior. Our approach uses machine learning techniques to determine the parameters of the probabilistic model. We apply our method to real Web-server logs and obtain results that demonstrate the robustness and effectiveness of probabilistic reasoning for crawler detection
Keywords :
Internet; belief networks; inference mechanisms; learning (artificial intelligence); online front-ends; Bayesian approach; Web robot detection; Web-server access logs; crawler detection; human behavior; machine learning techniques; probabilistic modeling approach; probabilistic reasoning; Bayesian methods; Computer science; Crawlers; Humans; Machine learning; Navigation; Robotics and automation; Robots; Robustness; Web server;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Internet Surveillance and Protection, 2006. ICISP '06. International Conference on
Conference_Location :
Cote d´Azur
Print_ISBN :
0-7695-2649-7
Type :
conf
DOI :
10.1109/ICISP.2006.7
Filename :
1690400
Link To Document :
بازگشت