DocumentCode :
2422031
Title :
An intelligent WWW agent for similarity-based searching
Author :
Rose, Tony G. ; Wyard, Peter J.
Author_Institution :
Canon Res. Centre Europe, Guildford, UK
fYear :
1997
fDate :
35506
Firstpage :
42552
Lastpage :
42557
Abstract :
The paper describes the development of a WWW agent that uses similarity-based methods to search the Internet. The Internet Information Agent (IIA) works by analysing a sample of the type of text that is known to be of interest to the user. It then extracts a number of linguistic features and stores these as a feature vector that is used to describe the content of the document. This data is then used as input to a range of similarity metrics that allow the agent to compare new texts with the original and thereby acquire “more of the same”. The agent´s strengths lie in its use of a range of similarity metrics that are known to perform well over a wide variety of input. The agent has been tested across a range of input data and evaluated against a number of criteria. The results of this evaluation are described and the prospects for the ongoing development of the agent are discussed
Keywords :
software agents; Internet Information Agent; Internet searching; document content description; feature vector; input data; intelligent WWW agent; linguistic features; new text comparison; similarity metrics; similarity-based searching; text analysis;
fLanguage :
English
Publisher :
iet
Conference_Titel :
Intelligent World Wide Web Agents (Digest No.: 1997/118), IEE Colloquium on
Conference_Location :
London
Type :
conf
DOI :
10.1049/ic:19970648
Filename :
637460
Link To Document :
بازگشت