DocumentCode :
1826697
Title :
A comparison of Web robot and human requests
Author :
Doran, Derek ; Morillo, Kevin ; Gokhale, S.S.
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Connecticut, Storrs, CT, USA
fYear :
2013
fDate :
25-28 Aug. 2013
Firstpage :
1374
Lastpage :
1380
Abstract :
Sophisticated Web robots sport a wide variety of functionality and visiting characteristics, constituting a significant percentage of the requests serviced by a Web server. Unlike human clients that retrieve information off a site by navigating links and ignoring irrelevant information, Web robots may collect many different types of resources, and employ varying navigation strategies to find the knowledge on the site they desire. Thus, the resource request patterns of their visits are unpredictable and cannot be inferred based on our knowledge of human request patterns. In this paper, we perform an analysis on the types of resources requested by Web robots using recent Web logs from an academic Web server. We study the distribution of response sizes and response codes, the types of resources requested, and popularity of resources for requests from Web robots. Throughout, we contrast our findings against human resource request patterns. We find reasons to suggest that robots severely handicaps the ability of Web server caches to operate with high performance.
Keywords :
Internet; cache storage; file servers; information retrieval; Web logs; Web robot; Web server caches; academic Web server; human clients; human request patterns; human resource request patterns; information retrieve; resource request patterns; response codes; response sizes; visiting characteristics; Browsers; HTML; Market research; Navigation; Robots; Web servers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Social Networks Analysis and Mining (ASONAM), 2013 IEEE/ACM International Conference on
Conference_Location :
Niagara Falls, ON
Type :
conf
Filename :
6785880
Link To Document :
بازگشت