DocumentCode :
511253
Title :
Chinese Question Classification Based on Semantic Gram and SVM
Author :
Wang, Liang ; Zhang, Hui ; Wang, Deqing ; Huang, Jia
Author_Institution :
State Key Lab. of Software Dev. Environ., Beihang Univ., Beijing, China
Volume :
1
fYear :
2009
fDate :
25-27 Dec. 2009
Firstpage :
432
Lastpage :
435
Abstract :
Question classification plays a crucial important role in the question answering system. Recent research on question classification for open-domain mostly concentrates on using machine learning methods to resolve the special kind of text classification. This paper presents our research about Chinese question classification using machine learning method and gives our approach based on SVM and semantic gram extraction. SVM has been widely used for question classification and got good performances. We use SVM as the classifier and propose a new feature extraction method of Chinese questions which is called semantic gram extraction. The method is proposed based on the word semantics and N-gram. The experiment results show that the feature extraction can perform well with SVM and our approach can reach high classification accuracy.
Keywords :
classification; feature extraction; information retrieval; learning (artificial intelligence); support vector machines; text analysis; Chinese question classification; SVM; feature extraction; machine learning; question answering system; semantic gram extraction; text classification; Application software; Computer applications; Data mining; Feature extraction; Learning systems; Programming; Support vector machine classification; Support vector machines; Testing; Text categorization; Chinese question classification; SVM; feature extraction; semantic gram;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science-Technology and Applications, 2009. IFCSTA '09. International Forum on
Conference_Location :
Chongqing
Print_ISBN :
978-0-7695-3930-0
Electronic_ISBN :
978-1-4244-5423-5
Type :
conf
DOI :
10.1109/IFCSTA.2009.111
Filename :
5385040
Link To Document :
بازگشت