DocumentCode :
1871631
Title :
Scalable web monitoring system
Author :
Opalinski, Andrzej ; Turek, Wojciech ; Cetnarowicz, Krzysztof
Author_Institution :
AGH Univ. of Sci. & Technol., Krakow, Poland
fYear :
2013
fDate :
8-11 Sept. 2013
Firstpage :
1273
Lastpage :
1279
Abstract :
Publicly available Web search engines suffer from several limitations, which significantly reduce usability in particular cases. The most important limitations are out-of-date information, very simple query language and limited number of results. In many cases, users of the Internet are interested in finding new information which appear in the particular Web portal. In this paper, a system for monitoring of Web sites is presented. The system can continuously analyze the content of specified Web pages using advanced text processing algorithms. It actively notifies the user when required information is found in newly-added content. It can be deployed on a single PC as well as on a cluster of computers, providing good scalability. The paper presents an abstract architecture of the system, details of the implementation and real-life experiments results.
Keywords :
Internet; Web sites; portals; text analysis; Internet; Web pages; Web portal; Web search engines; Web sites; abstract architecture; computer cluster; query language; scalable Web monitoring system; single PC; text processing algorithms; Crawlers; Detectors; Google; Monitoring; Pattern matching; Servers; Web pages;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Information Systems (FedCSIS), 2013 Federated Conference on
Conference_Location :
Krako??w
Type :
conf
Filename :
6644178
Link To Document :
بازگشت