Title of article :
Content-Based Retrieval using Heuristic Search
Author/Authors :
Papadias، Dimitris نويسنده , , Mantzourogiannis، Marios نويسنده , , Kalnis، Panos نويسنده , , Mamoulis، Nikos نويسنده , , Ahmad، Ishfaq نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 1999
Pages :
-167
From page :
168
To page :
0
Abstract :
In this paper we examine the question of query parsing for World Wide Web queries and present a novel method for phrase recognition and expansion. Given a training corpus of approximately 16 million Web queries and a handwritten context-free grammar, the EM algorithm is used to estimate the parameters of a probabilistic context-free grammar (PCFG) with a system developed by Carroll [5]. We use the PCFG to compute the most probable parse for a user query, reflecting linguistic structure and word usage of the domain being parsed. The optimal syntactic parse for a user query thus obtained is employed for phrase recognition and expansion. Phrase recognition is used to increase retrieval precision; phrase expansion is applied to make the best use possible of very short Web queries.
Keywords :
MMIR , image indexing/retrieval , content-based indexing/retrieval
Journal title :
SIGIR FORUM
Serial Year :
1999
Journal title :
SIGIR FORUM
Record number :
16670
Link To Document :
بازگشت