• DocumentCode
    3406832
  • Title

    Appropriate Farsi speech recognizer for commanding robots: (Performance evaluation of correlation-based and model-based classifiers for a Farsi isolated word recognition robotic system)

  • Author

    Rashedi, Ashkan ; Moghaddam, Shahriar Shirvani

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Shahid Rajaee Teacher Training Univ. (SRTTU), Tehran, Iran
  • fYear
    2010
  • fDate
    24-28 Oct. 2010
  • Firstpage
    573
  • Lastpage
    576
  • Abstract
    In this research, two different classifier categories, correlation-based and neural network-based, are investigated for a Farsi isolated word recognizer commanding robotic system. Correlation-based category is divided to time and frequency domains. Moreover, in each of them, three decision making methods, Max, Average, and 10-Max are proposed. In addition, in neural network-based category, LPC is considered to extract the features. At first, separated samples of 4 Farsi pronounced commands (Left, Right, Forward, and Backward) go through a pre-processing section. Three methods of correlation-based category are used independently with the same data base and their performances are evaluated word by word as well as in total case. Finally the results of above mentioned methods are compared. On the other hand, LPC features get extracted independently from output of preprocessing section, and are used as inputs of the N.N. In this way one result associated to N.N.-based method is produced. Simulation results show that frequency-domain correlation-based method introduces the best recognition but, it is close to the LPC-based N.N. system. Finally it is preferred to use LPC due to lower processing time with 87.5% recognition.
  • Keywords
    correlation methods; frequency-domain analysis; neural nets; performance evaluation; robots; speech recognition; time-domain analysis; Farsi isolated word recognition robotic system; Farsi isolated word recognizer commanding robotic system; Farsi pronounced commands; Farsi speech recognizer; LPC; classifier category; commanding robots; correlation-based category; correlation-based classifiers; decision making methods; frequency domain; frequency-domain correlation-based method; model-based classifiers; neural network-based category; performance evaluation; time domain; Correlation-based; LPC; Neural network MLP; Speech Recognizer;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing (ICSP), 2010 IEEE 10th International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    978-1-4244-5897-4
  • Type

    conf

  • DOI
    10.1109/ICOSP.2010.5656051
  • Filename
    5656051