Title :
Digit Recognition in the Náhuatl Language: An Evaluation Using Various Recognition Models
Author :
Sergio Suarez-Guerra;Jose Luis Oropeza-Rodriguez;Juan Carlos Flores-Paulin;Luis Pastor Sanchez-Fernandez
Abstract :
The aim of Automatic Speech Recognition (ASR) is to develop techniques and systems that enable a computer to accept speech input. The digit recognition task has been often employed contributing to the ASR. In this work, we used parameters of Lineal Prediction Codes (LPC) and Mel Frequency Cepstrum Coefficients (MFCCs). For selection of the best analysis interval we used a Vector Quantization Model. For recognition, we applied the Continuous Density Hidden Markov Model (CDHHM), which employed a dictionary conformed of eighteen command words that are specific digits from the Náhuatl language. The obtained results were compared using Discrete Hidden Markov Models and Vector Quantization Models. In this experiment, we obtained a performance of 99% accuracy for digit recognition. In our experiments we used three native speakers.
Keywords :
"Hidden Markov models","Speech recognition","Speech","Training","Viterbi algorithm","Mel frequency cepstral coefficient","Vector quantization"
Conference_Titel :
Artificial Intelligence (MICAI), 2010 Ninth Mexican International Conference on
Print_ISBN :
978-1-4244-9246-6;978-0-7695-4284-3
DOI :
10.1109/MICAI.2010.27