Title :
Semantic Chunk Annotation for questions using Maximum Entropy
Author :
Fan, Shixi ; Zhang, Yaoyun ; Ng, Wing W Y ; Wang, Xuan ; Wang, Xiaolong
Author_Institution :
Shenzhen Grad. Sch., Harbin Inst. of Technol., Shenzhen
Abstract :
We present a ME (Maximum Entropy) model for Semantic Chunk Annotation in a Chinese Question and Answer (Q&A) system. The model was derived from a corpus of real world questions, which are collected from some discussion groups on the Internet. The questions are supposed to be answered by other people, so the questions are very complex. The semantic chunks were introduced. Feature for the model was described and MI (mutual information) was adopted for feature selection. The training data consists of 14000 sentences and the test data consists of 4000 sentences. The result: F-score is 90.68%.
Keywords :
maximum entropy methods; query processing; search engines; semantic Web; Chinese Question and Answer system; Internet; Semantic Chunk Annotation; feature selection; maximum entropy model; mutual information; Computer architecture; Computer science; Databases; Entropy; Information retrieval; Internet; Mutual information; Natural languages; Search engines; Testing; Maximum Entropy; Mutual information; Q&A; Semantic Chunk Annotation;
Conference_Titel :
Systems, Man and Cybernetics, 2008. SMC 2008. IEEE International Conference on
Conference_Location :
Singapore
Print_ISBN :
978-1-4244-2383-5
Electronic_ISBN :
1062-922X
DOI :
10.1109/ICSMC.2008.4811317