Title :
Unilateral semi-supervised learning of extended hidden vector state for Persian Language Understanding
Author :
Jabbari, Fattaneh ; Sameti, Hossein ; Bokaei, Mohammad Hadi
Author_Institution :
Dept. of Comput. Eng., Sharif Univ. of Technol., Tehran, Iran
Abstract :
The key element of a spoken dialogue system is Spoken Language Understanding (SLU) part. HVS and EHVS are two most popular statistical methods employed to implement the SLU part which need lightly annotated data. Since annotation is a time consuming, we present a novel semi-supervised learning for EHVS to reduce the human labeling effort using two different statistical classifiers, SVM and KNN. Experiments are done on a Persian corpus, the University Information Kiosk corpus. The experimental results show improvements in performance of semi-supervised EHVS, trained by both labeled and unlabeled data, compared to EHVS trained by just initially labeled data. The performance of EHVS improves 13.41% in the case of SVM classifier and 5.16% in the case of KNN. This demonstrates effectiveness and feasibility of the proposed approach.
Keywords :
interactive systems; learning (artificial intelligence); natural language processing; pattern classification; statistical analysis; support vector machines; EHVS; KNN; Persian corpus; Persian language understanding; SVM classifier; University Information Kiosk corpus; extended hidden vector state; spoken dialogue system; spoken language understanding; statistical classifiers; statistical methods; unilateral semisupervised learning; Accuracy; Hidden Markov models; Labeling; Semantics; Support vector machines; Training; Vectors; extended hidden vector state; semi-supervised learning; spoken language understanding; statistical classifier;
Conference_Titel :
Natural Language Processing andKnowledge Engineering (NLP-KE), 2011 7th International Conference on
Conference_Location :
Tokushima
Print_ISBN :
978-1-61284-729-0
DOI :
10.1109/NLPKE.2011.6138187