DocumentCode :
951083
Title :
Semantic segment extraction and matching for Internet FAQ retrieval
Author :
Wu, Chung-Hsien ; Yeh, Jui-Feng ; Lai, Yu-Sheng
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
Volume :
18
Issue :
7
fYear :
2006
fDate :
7/1/2006 12:00:00 AM
Firstpage :
930
Lastpage :
940
Abstract :
This investigation presents a novel approach to semantic segment extraction and matching for retrieving information from Internet FAQs with natural language queries. Two semantic segments, the question category segment (QS) and the keyword segment (KS), are extracted from the input queries and the FAQ questions with a semiautomatically derived question-semantic grammar. A semantic matching method is presented to estimate the similarity between the semantic segments of the query and the questions in the FAQ collection. Additionally, the vector space model (VSM) is adopted to measure the similarity between the query and the answers of the QA pairs. Finally, a multistage ranking strategy is adopted to determine the optimally performing combination of similarity metrics. The experimental results illustrate that the proposed method achieves an average rank of 4.52 and a top-10 recall rate of 90.89 percent. Compared with the query-expansion method, this method improves the performance by 4.82 places in the average rank of correct answers, 25.34 percent in the top-5 recall rate, and 5.21 percent in the top-10 recall rate.
Keywords :
Internet; information retrieval; natural languages; semantic networks; FAQ retrieval; Internet; keyword segment; multistage ranking strategy; natural language queries; query-expansion method; question category segment; question-semantic grammar; semantic segment extraction; semantic segment matching; vector space model; Data mining; Explosives; Filtering; Helium; Information retrieval; Internet; Natural languages; Navigation; Ontologies; Search engines; Natural language processing; deduction and theorem proving; knowledge processing.; query formulation; retrieval models;
fLanguage :
English
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
1041-4347
Type :
jour
DOI :
10.1109/TKDE.2006.115
Filename :
1637419
Link To Document :
بازگشت