• DocumentCode
    2665451
  • Title

    Comparative experiments on task classification for spoken language understanding using Naive Bayes classifier

  • Author

    Weilin Wu ; Ruzhan Lu ; Liu, Zheng

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Shanghai Jiao Tong Univ., China
  • fYear
    2003
  • fDate
    26-29 Oct. 2003
  • Firstpage
    492
  • Lastpage
    497
  • Abstract
    We present a series of comparative experiments on using statistical classifiers for task classification. Our experiments focus on three aspects: the effect of different features on the performance of the classifier, the robustness of classifiers with different features on data variability and the effect of size of training data on the performance of the classifier. For Chinese input sentences, three linguistics units can be used as the features: Chinese characters, Chinese words and semantic constituents. Both advantages and disadvantages of them are analyzed in details. A controlled study using Naive Bayes classifiers is conducted to examine the impact of different features on the performance of classifiers. The classifiers with different features are evaluated respectively on the clean and noisy test data to investigate their robustness. Learning curves of the classifiers with different features are given to show the effect of size of training data.
  • Keywords
    Bayes methods; computational linguistics; linguistics; natural languages; pattern classification; speech processing; Chinese characters; Chinese words; Naive Bayes classifier; learning curves; semantic constituents; spoken language understanding; statistical classifiers; support vector machines; task classification; Classification algorithms; Computer science; Data mining; Decision trees; Natural languages; Robustness; Routing; Testing; Training data; Transportation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
  • Conference_Location
    Beijing, China
  • Print_ISBN
    0-7803-7902-0
  • Type

    conf

  • DOI
    10.1109/NLPKE.2003.1275955
  • Filename
    1275955