Title of article :
Speech Acts Classification of Persian Language Texts Using Three Machine Learning Methods
Author/Authors :
Homayounpour ، Mohammad Mehdi نويسنده Laboratory for Intelligent Sound &Speech Processing , , Soltani Panah، Arezou نويسنده Lab. for Intelligent Signal and Speech Proc. Department of Computer Engineering and IT ,
Issue Information :
فصلنامه با شماره پیاپی 5 سال 2010
Pages :
7
From page :
65
To page :
71
Abstract :
The objective of this paper is to design a system to classify Persian speech acts. The driving vision for this work is to provide intelligent systems such as text to speech, machine translation, text summarization, etc. that are sensitive to the speech acts of the input texts and can pronounce the corresponding intonation correctly. Seven speech acts were considered and 3 classification methods including (1) Naive Bayes, (2) K-Nearest Neighbors (KNN), and (3) Tree learner were used. The performance of speech act classification was evaluated using these methods including 10- Fold Cross-Validation, 70-30 Random Sampling and Area under ROC. KNN with an accuracy of 72% was shown to be the best classifier for the classification of Persian speech acts. It was observed that the amount of labeled training data had an important role in the classification performance.
Journal title :
International Journal of Information and Communication Technology Research
Serial Year :
2010
Journal title :
International Journal of Information and Communication Technology Research
Record number :
690543
Link To Document :
بازگشت