Title :
Wrapping WWW information sources
Author_Institution :
Dept. of Comput. Sci. & Artificial Intelligence, Malta Univ., Malta
Abstract :
As information over the World-Wide Web (WWW) is proliferating rapidly and the demand to access it is also escalating, the need to efficiently and effectively make use of the various knowledge resources is highly imperative. Research to extract maximum benefit from the Internet is still in its infancy. Different research areas like data mining, information retrieval, machine learning and knowledge discovery, are giving the problem due attention and all seem to converge onto a common theme-wrappers. Wrappers provide access to heterogeneous information sources by converting or translating queries into source specific queries or commands. We discuss improvements on a distinctive approach, wrapper conduction, to automatically generate a wrapper once an information source has been identified. We will show how different information filtering systems we developed employ numerous Internet information sources in an attempt to exploit this knowledge base by automatically wrapping the source to safeguard the evolvability of the same system once new sources become available on the WWW. We examine and compare the performance these systems achieved when employing wrapper-conducted queries in contrast to tailored hand-coded ones; comparative results are presented
Keywords :
data mining; information resources; information retrieval; learning (artificial intelligence); WWW information sources; data mining; information retrieval; knowledge discovery; knowledge resources; machine learning; source specific queries; wrapper conduction; wrappers; Artificial intelligence; Computer science; Data mining; IP networks; Information retrieval; Internet; Metasearch; Search engines; World Wide Web; Wrapping;
Conference_Titel :
Database Engineering and Applications Symposium, 2000 International
Conference_Location :
Yokohama
Print_ISBN :
0-7695-0789-1
DOI :
10.1109/IDEAS.2000.880632