DocumentCode :
3393089
Title :
A Term Cluster Query Expansion Model Based on Classification Information in Natural Language Information Retrieval
Author :
Kang, Jeon Wook ; Kang, Hyun-Kyu ; Ko, Myeong-Cheol ; Jeon, Heung Seok ; Nam, Junghyun
Author_Institution :
Hankuk Acad. of Foreign Studies, Yongin, South Korea
Volume :
2
fYear :
2010
fDate :
23-24 Oct. 2010
Firstpage :
172
Lastpage :
176
Abstract :
A natural language information retrieval system ranks related documents according to criteria based on user query keywords and document similarities. However, many efforts have been made to make more useful query keywords because users do not use many keywords in their natural language search query when retrieving information on the Web. Because a keyword does not provide much information, however, relevance feedback is generally used to complement the weakness of general retrieval methods. This paper proposes a term cluster query expansion model based on classification information of retrieved documents. This model generates classification information from the upper ranked n documents retrieved by retrieval system. On the basis of the extracted classification information, the term cluster (m) that represents each group is generated, and then the model allows user to select term cluster that corresponds to user information needs. The query keywords are expanded by using a relevance feedback algorithm based on the selected classification information. As a result of the experiments with test collection, the retrieval effectiveness was improved by 13.2% compared to the initial query when the Rocchio method was used.
Keywords :
information needs; natural language processing; pattern classification; query processing; relevance feedback; Rocchio method; classification information; document similarities; natural language information retrieval system; natural language search query; relevance feedback algorithm; term cluster query expansion model; user information needs; user query keywords; Classification algorithms; Clustering algorithms; Geography; History; Industries; Information retrieval; Natural languages; Classification Information; Natural Language Information Retrieval; Query Expansion; Relevance Feedback; Term Cluster;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Artificial Intelligence and Computational Intelligence (AICI), 2010 International Conference on
Conference_Location :
Sanya
Print_ISBN :
978-1-4244-8432-4
Type :
conf
DOI :
10.1109/AICI.2010.159
Filename :
5655201
Link To Document :
بازگشت