DocumentCode
2665451
Title
Comparative experiments on task classification for spoken language understanding using Naive Bayes classifier
Author
Weilin Wu ; Ruzhan Lu ; Liu, Zheng
Author_Institution
Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., China
fYear
2003
fDate
26-29 Oct. 2003
Firstpage
492
Lastpage
497
Abstract
We present a series of comparative experiments on using statistical classifiers for task classification. Our experiments focus on three aspects: the effect of different features on the performance of the classifier, the robustness of classifiers with different features on data variability and the effect of size of training data on the performance of the classifier. For Chinese input sentences, three linguistics units can be used as the features: Chinese characters, Chinese words and semantic constituents. Both advantages and disadvantages of them are analyzed in details. A controlled study using Naive Bayes classifiers is conducted to examine the impact of different features on the performance of classifiers. The classifiers with different features are evaluated respectively on the clean and noisy test data to investigate their robustness. Learning curves of the classifiers with different features are given to show the effect of size of training data.
Keywords
Bayes methods; computational linguistics; linguistics; natural languages; pattern classification; speech processing; Chinese characters; Chinese words; Naive Bayes classifier; learning curves; semantic constituents; spoken language understanding; statistical classifiers; support vector machines; task classification; Classification algorithms; Computer science; Data mining; Decision trees; Natural languages; Robustness; Routing; Testing; Training data; Transportation;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
Conference_Location
Beijing, China
Print_ISBN
0-7803-7902-0
Type
conf
DOI
10.1109/NLPKE.2003.1275955
Filename
1275955
Link To Document