DocumentCode :
1793657
Title :
Query subtopic mining for search result diversification
Author :
Ullah, Md Zia ; Aono, Masaki
Author_Institution :
Dept. of Comput. Sci. & Eng., Toyohashi Univ. of Technol., Toyohashi, Japan
fYear :
2014
fDate :
20-21 Aug. 2014
Firstpage :
309
Lastpage :
314
Abstract :
Web search queries are usually short, ambiguous, and contain multiple aspects or subtopics. Different users may have different search intents (or information needs) when submitting the same query. The task of identifying the subtopics underlying a query has received much attention in recent years. In this paper, we propose a method that exploits query reformulations provided by three major Web search engines (WSEs) as a means to uncover different query subtopics. In this regard, we estimate the importance of the subtopics by introducing multiple query-dependent and query-independent features, and rank the subtopics by balancing relevancy and novelty. Our experiment with the NTCIR-10 INTENT-2 English Subtopic Mining test collection shows that our method outperforms all participants´ methods in NTCIR-10 INTENT-2 task in terms of D#-nDCG@10.
Keywords :
data mining; query formulation; search engines; D#-nDCG@10; NTCIR-10 INTENT-2 english subtopic mining test collection; WSE; Web search engines; Web search queries; information needs; multiple query-dependent features; query reformulations; query subtopic mining; query subtopics; query-independent features; search intents; search result diversification; Computational modeling; Educational institutions; Feature extraction; Search engines; Vectors; Web search; diversification; intent; subtopic mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Informatics: Concept, Theory and Application (ICAICTA), 2014 International Conference of
Conference_Location :
Bandung
Print_ISBN :
978-1-4799-6984-5
Type :
conf
DOI :
10.1109/ICAICTA.2014.7005960
Filename :
7005960
Link To Document :
بازگشت