DocumentCode :
3229468
Title :
Chinese Question Classification Based on Ensemble Learning
Author :
Jia, Keliang ; Chen, Kang ; Fan, Xiaozhong ; Zhang, Yu
Author_Institution :
Beijing Inst. of Technol., Beijing
Volume :
3
fYear :
2007
fDate :
July 30 2007-Aug. 1 2007
Firstpage :
342
Lastpage :
347
Abstract :
In this paper, a method of Chinese question classification based on ensemble learning is proposed. Words and bi-gram are extracted from question as classic features and feature classifiers are constructed. The classifiers are kinds of simple classifier and their generalization ability are not strong enough, but they are a qualified base learner for ensemble because of their low computational cost, and the generalization feasibility improved with the help of ensemble learning. We translate and modify the UIUC question set and TREC2001 question set as Chinese question set. Experimental results on the corpus show that the proposed method can achieve good performance, the classification precision reaches 87.6%, under the fine grained question types.
Keywords :
character recognition; character sets; feature extraction; learning (artificial intelligence); pattern classification; Chinese question classification; Chinese question set; TREC2001 question set; UIUC question set; bigram; ensemble learning; feature classifiers; feature extraction; words; Artificial intelligence; Computer science; Distributed computing; Humans; Learning; Snow; Software engineering; Support vector machine classification; Support vector machines; Taxonomy;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, 2007. SNPD 2007. Eighth ACIS International Conference on
Conference_Location :
Qingdao
Print_ISBN :
978-0-7695-2909-7
Type :
conf
DOI :
10.1109/SNPD.2007.183
Filename :
4287875
Link To Document :
بازگشت