Title :
Adaptive Web Data Extraction Policies
Author :
Fiumara, Giacomo ; Marchi, Massimo ; Provetti, Alessandro
Author_Institution :
Univ. di Messina, Messina
Abstract :
Dynamo is a middleware that helps in generating informative RSS feeds out of legacy HTML Web sites. To produce timely and informative RSS feeds, and to be scalable, Dynamo needs a careful tuning and customization of its polling policies which have been evaluated against frequently-updated news portals.
Keywords :
Internet; Web sites; middleware; text analysis; Dynamo; adaptive web data extraction policy; informative RSS feeds; legacy HTML Web sites; middleware; news portal; polling policy; Aggregates; Data mining; Feeds; Frequency estimation; HTML; Magnetohydrodynamic power generation; Middleware; Portals; Web pages; Web services;
Conference_Titel :
Policies for Distributed Systems and Networks, 2007. POLICY '07. Eighth IEEE International Workshop on
Conference_Location :
Bologna
Print_ISBN :
0-7695-2767-1
DOI :
10.1109/POLICY.2007.4