Title :
Query subtopic mining for search result diversification
Author :
Ullah, Md Zia ; Aono, Masaki
Author_Institution :
Dept. of Comput. Sci. & Eng., Toyohashi Univ. of Technol., Toyohashi, Japan
Abstract :
Web search queries are usually short, ambiguous, and contain multiple aspects or subtopics. Different users may have different search intents (or information needs) when submitting the same query. The task of identifying the subtopics underlying a query has received much attention in recent years. In this paper, we propose a method that exploits query reformulations provided by three major Web search engines (WSEs) as a means to uncover different query subtopics. In this regard, we estimate the importance of the subtopics by introducing multiple query-dependent and query-independent features, and rank the subtopics by balancing relevancy and novelty. Our experiment with the NTCIR-10 INTENT-2 English Subtopic Mining test collection shows that our method outperforms all participants´ methods in NTCIR-10 INTENT-2 task in terms of D#-nDCG@10.
Keywords :
data mining; query formulation; search engines; D#-nDCG@10; NTCIR-10 INTENT-2 english subtopic mining test collection; WSE; Web search engines; Web search queries; information needs; multiple query-dependent features; query reformulations; query subtopic mining; query subtopics; query-independent features; search intents; search result diversification; Computational modeling; Educational institutions; Feature extraction; Search engines; Vectors; Web search; diversification; intent; subtopic mining;
Conference_Titel :
Advanced Informatics: Concept, Theory and Application (ICAICTA), 2014 International Conference of
Conference_Location :
Bandung
Print_ISBN :
978-1-4799-6984-5
DOI :
10.1109/ICAICTA.2014.7005960