Title :
Comparative experiments on task classification for spoken language understanding using Naive Bayes classifier
Author :
Weilin Wu ; Ruzhan Lu ; Liu, Zheng
Author_Institution :
Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., China
Abstract :
We present a series of comparative experiments on using statistical classifiers for task classification. Our experiments focus on three aspects: the effect of different features on the performance of the classifier, the robustness of classifiers with different features on data variability and the effect of size of training data on the performance of the classifier. For Chinese input sentences, three linguistics units can be used as the features: Chinese characters, Chinese words and semantic constituents. Both advantages and disadvantages of them are analyzed in details. A controlled study using Naive Bayes classifiers is conducted to examine the impact of different features on the performance of classifiers. The classifiers with different features are evaluated respectively on the clean and noisy test data to investigate their robustness. Learning curves of the classifiers with different features are given to show the effect of size of training data.
Keywords :
Bayes methods; computational linguistics; linguistics; natural languages; pattern classification; speech processing; Chinese characters; Chinese words; Naive Bayes classifier; learning curves; semantic constituents; spoken language understanding; statistical classifiers; support vector machines; task classification; Classification algorithms; Computer science; Data mining; Decision trees; Natural languages; Robustness; Routing; Testing; Training data; Transportation;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
Conference_Location :
Beijing, China
Print_ISBN :
0-7803-7902-0
DOI :
10.1109/NLPKE.2003.1275955