Title :
Predicting Query Performance in Domain-Specific Corpora
Author :
Sarnikar, Surendra ; Zhang, Zhu ; Zhao, J. Leon
Author_Institution :
Dept. of Manage. Inf. Syst., Arizona Univ., Tucson, AZ
Abstract :
The performance of a document recommender system is dependent on the quality and characteristics of the query used by the recommender to retrieve relevant documents. Automatically predicting the performance of a query can help identify ineffective queries and can help improve performance by selectively applying query expansion techniques. In this paper, we study information-entropy-based measures for predicting performance of a query in the context of domain-specific corpora. We propose a new sampling mechanism that can more accurately estimate query models in domain-specific corpora and hence deliver better predictions. We evaluate the validity our technique by analyzing its performance in five different domain-specific corpora
Keywords :
query processing; document retrieval; domain-specific corpora; information-entropy-based measures; query performance prediction; sampling mechanism; Content based retrieval; Educational institutions; Frequency; Humans; Information retrieval; Machine learning; Performance analysis; Predictive models; Recommender systems; Search engines;
Conference_Titel :
System Sciences, 2007. HICSS 2007. 40th Annual Hawaii International Conference on
Conference_Location :
Waikoloa, HI
Electronic_ISBN :
1530-1605
DOI :
10.1109/HICSS.2007.440