Title :
Diversification of web search results using post-retrieval clustering
Author :
Kumar, Sudhakar ; Jain, S.K. ; Sharma, R.M.
Author_Institution :
Dept. of Comput. Eng., Nat. Inst. of Technol., Kurukshetra, India
Abstract :
Diversification of results in web search engines is a very attractive area for researchers now a days. Information retrieval techniques mainly focus on the relevance of the documents retrieved but these techniques often fail to satisfy each user. In this work, we present a coverage based diversification using post retrieval clustering. We model clusters corresponding to the query based on the features of the web pages such as web pages of similar features are to be in one cluster and web pages from dissimilar features are to be in different clusters. A query can retrieve relevant and diverse result set if all the results cover as many clusters as possible.
Keywords :
information retrieval; search engines; Web pages; Web search diversification; Web search engine; coverage based diversification; information retrieval; post-retrieval clustering; Algorithm design and analysis; Clustering algorithms; Feature extraction; Search engines; Vectors; Web pages; Web search; Clustering; Cosine Similarity; Relevance; Result diversification;
Conference_Titel :
Computer and Communication Technology (ICCCT), 2014 International Conference on
Conference_Location :
Allahabad
Print_ISBN :
978-1-4799-6757-5
DOI :
10.1109/ICCCT.2014.7001460