Title :
Appropriate Farsi speech recognizer for commanding robots: (Performance evaluation of correlation-based and model-based classifiers for a Farsi isolated word recognition robotic system)
Author :
Rashedi, Ashkan ; Moghaddam, Shahriar Shirvani
Author_Institution :
Dept. of Electr. & Comput. Eng., Shahid Rajaee Teacher Training Univ. (SRTTU), Tehran, Iran
Abstract :
In this research, two different classifier categories, correlation-based and neural network-based, are investigated for a Farsi isolated word recognizer commanding robotic system. Correlation-based category is divided to time and frequency domains. Moreover, in each of them, three decision making methods, Max, Average, and 10-Max are proposed. In addition, in neural network-based category, LPC is considered to extract the features. At first, separated samples of 4 Farsi pronounced commands (Left, Right, Forward, and Backward) go through a pre-processing section. Three methods of correlation-based category are used independently with the same data base and their performances are evaluated word by word as well as in total case. Finally the results of above mentioned methods are compared. On the other hand, LPC features get extracted independently from output of preprocessing section, and are used as inputs of the N.N. In this way one result associated to N.N.-based method is produced. Simulation results show that frequency-domain correlation-based method introduces the best recognition but, it is close to the LPC-based N.N. system. Finally it is preferred to use LPC due to lower processing time with 87.5% recognition.
Keywords :
correlation methods; frequency-domain analysis; neural nets; performance evaluation; robots; speech recognition; time-domain analysis; Farsi isolated word recognition robotic system; Farsi isolated word recognizer commanding robotic system; Farsi pronounced commands; Farsi speech recognizer; LPC; classifier category; commanding robots; correlation-based category; correlation-based classifiers; decision making methods; frequency domain; frequency-domain correlation-based method; model-based classifiers; neural network-based category; performance evaluation; time domain; Correlation-based; LPC; Neural network MLP; Speech Recognizer;
Conference_Titel :
Signal Processing (ICSP), 2010 IEEE 10th International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-5897-4
DOI :
10.1109/ICOSP.2010.5656051