DocumentCode :
3036208
Title :
An Evolutionary Model for Measuring Document Relevance in a Focused Web Spider
Author :
Lopez, Israel ; Alvarez-Carrillo, Pavel A. ; Fernandez-Gonzalez, Eduardo R.
Author_Institution :
Sch. of Inf., Autonomous Univ. of Sinaloa, Culiacan
fYear :
2008
fDate :
Sept. 30 2008-Oct. 3 2008
Firstpage :
177
Lastpage :
182
Abstract :
Exploring the Web in search of relevant information is a difficult task due to the vast amount of documents it stores and to the heterogeneity of such documents. Using automated systems such as search engines help users cope with the size of the Web. However the results produced by these systems usually contain documents from a large variety of topics with little or no relevance to the end user. In this work, we propose a model that can be used by a Web spider to selectively explore the Web for relevant documents. In this model, two criteria are used for assessing document relevance; content and structure. These two criteria are integrated in a fuzzy predicate that indicates the degree of relevance of a document with respect to a user-defined topic. The parameters of the proposed model are generated by a genetic algorithm that solves a bi-criteria optimization problem.
Keywords :
Internet; fuzzy set theory; genetic algorithms; information retrieval; search engines; bi-criteria optimization problem; document relevance measurement; evolutionary model; focused Web spider; fuzzy predicate; genetic algorithm; search engines; user-defined topic; Automotive engineering; Databases; Evolutionary computation; Genetic algorithms; Informatics; Information retrieval; Mechanical variables measurement; Robots; Search engines; Web pages; Evolutionary Algorithms; Information Retrieval; MCDA; Web Spider;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electronics, Robotics and Automotive Mechanics Conference, 2008. CERMA '08
Conference_Location :
Morelos
Print_ISBN :
978-0-7695-3320-9
Type :
conf
DOI :
10.1109/CERMA.2008.28
Filename :
4641067
Link To Document :
بازگشت