DocumentCode :
3009886
Title :
Improving Content-Oriented XML Retrieval by Exploiting Small Elements
Author :
Dopichaj, Philipp
Author_Institution :
Univ. of Kaiserslautern, Kaiserslautern
fYear :
2007
fDate :
3-5 July 2007
Firstpage :
68
Lastpage :
74
Abstract :
XML element retrieval aims at finding the best elements satisfying a user´s information need. Elements spanning only a few words, like titles or italicized phrases, are not in themselves useful results, but they can support the relevance of their enclosing elements. For example, if a section´s title contains the key words from the user´s query, the title itself is unlikely to be a useful result, but the section is very likely to be useful. This paper provides an overview of methods for exploiting small elements for better retrieval results, highlighting their respective advantages and disadvantages. Using the INEX testbed, we show that small elements can indeed provide useful retrieval hints, and we evaluate the trade-offs.
Keywords :
XML; content-based retrieval; information needs; INEX testbed; XML element retrieval; content-oriented XML retrieval; information need; user query; Content based retrieval; Context modeling; Engines; HTML; Information retrieval; Sections; Spatial databases; Testing; XML;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Databases, 2007. BNCOD '07. 24th British National Conference on
Conference_Location :
Glasgow
Print_ISBN :
0-7695-2912-7
Type :
conf
DOI :
10.1109/BNCOD.2007.12
Filename :
4269819
Link To Document :
بازگشت