Title :
Indexing of Reading Paths for a Structured Information Retrieval on the Web
Author_Institution :
Univ. de Lyon, St. Etienne
Abstract :
In this paper, we present a hyperdocument model taking into account the essential aspects of information on the Web: content, composition (logical structure) and non-linear reading (hypertext structure). We have developed a Structured Information Retrieval System (SIRS) based on this model. Its phases of indexing and querying are based on a ldquoreading pathsrdquo point of view of the Web: a Web site is considered as a set of potential reading paths, instead of a set of atomic and flat pages. We have developed an specific algorithm to index the reading paths. We present some experiments aiming at evaluating the interest of our indexing process of reading paths.
Keywords :
Internet; indexing; information retrieval; Web site; World Wide Web; hyperdocument model; hypertext structure; indexing process; nonlinear reading; querying; reading paths; structured information retrieval system; Content based retrieval; Context modeling; HTML; Indexing; Information retrieval; Intelligent agent; Intelligent structures; Search engines; Web pages; Web search; indexing; information retrieval; reading path; structure; web;
Conference_Titel :
Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-0-7695-3496-1
DOI :
10.1109/WIIAT.2008.386