Title :
A Query Expansion Algorithm Based on Phrases Semantic Similarity
Author :
Liu, Yongli ; Li, Chao ; Zhang, Pin ; Xiong, Zhang
Author_Institution :
Sch. of Comput. Sci. & Technol., Beihang Univ., Beijing
Abstract :
During the indexing process of traditional search engine, web pages become a list of terms, but single term cannot represent the rich content of web pages, which makes information retrieval methods mainly based on terms matching often result in depressed precision. This paper proposes a novel query expansion technique that has phrases as its expansion unit. Phrases typically have a higher information content and a smaller degree of ambiguity than their constituent words, and therefore represent the concepts expressed in text more accurately than single terms. This method extracts key phrases from original results, and calculates the semantic similarity between the query phrase and each phrase extracted using the semantic similarity algorithm based on WordNet, and then expands the query with the most similar phrases to search again. Experimental results show that the proposed algorithm can provide more precision than the traditional query expansion methods.
Keywords :
Internet; indexing; query processing; Web pages; WordNet; indexing process; information retrieval methods; phrases semantic similarity; query expansion algorithm; query phrase; Chaos; Computer science; Data mining; Indexing; Information processing; Information retrieval; Q measurement; Search engines; Web pages; Web search; phrase; query expansion; similarity;
Conference_Titel :
Information Processing (ISIP), 2008 International Symposiums on
Conference_Location :
Moscow
Print_ISBN :
978-0-7695-3151-9
DOI :
10.1109/ISIP.2008.57