DocumentCode :
3453698
Title :
Speech recognition from adaptive windowing PSD estimation
Author :
Ravan, M. ; Beheshti, Soosan
Author_Institution :
Dept. of Electr. & Comput. Eng., Univ. of Toronto, Toronto, ON, Canada
fYear :
2011
fDate :
8-11 May 2011
Abstract :
Speech-recognition technology is embedded in voice activated routing systems at customer call centers, voice dialing on mobile phones, and many other everyday applications. Consequently, designing a robust speech recognition system that adapts to acoustic conditions, such as the speaker\´s speech rate and accent is of utmost interest. In this paper we present a machine learning approach for speech recognition using the k Nearest Neighbor (k-NN) classifier. A small size vocabulary containing the two words "yes" and "no" is chosen that can be used for personal emergency response systems. In this method first the power spectrum density (PSD) of each frame of speech signal is estimated by using the recently developed adaptive windowing PSD estimation technique. The most relevant features corresponding to the PSD of the frame sequence are then identified using a feature selection scheme. These features are then fed into the A-NN classifier for speech recognition. The performance of the proposed method has been found to exceed 90% accuracy.
Keywords :
estimation theory; learning (artificial intelligence); mobile handsets; speech recognition; telecommunication network routing; vocabulary; acoustic conditions; adaptive windowing PSD estimation; customer call centers; feature selection; frame sequence; k nearest neighbor; k-NN classifier; machine learning; mobile phones; personal emergency response; power spectrum density; small size vocabulary; speaker speech accent; speaker speech rate; speech recognition; voice activated routing systems; voice dialing; Correlation; Estimation; Mean square error methods; Mel frequency cepstral coefficient; Speech; Speech recognition; Training; PSD estimation; Speech recognition; feature selection; k-NN classifier;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical and Computer Engineering (CCECE), 2011 24th Canadian Conference on
Conference_Location :
Niagara Falls, ON
ISSN :
0840-7789
Print_ISBN :
978-1-4244-9788-1
Electronic_ISBN :
0840-7789
Type :
conf
DOI :
10.1109/CCECE.2011.6030506
Filename :
6030506
Link To Document :
بازگشت