Title :
Multiple regression using support vector machines for recognition of speech in a moving car environment
Author :
Lee, Weifeng ; Sekhar, C. Chandra ; Takeda, Kazuya ; Itakura, Fumitada
Author_Institution :
Dept. of Inf. Electron., Nagoya Univ., Japan
Abstract :
In a moving car environment, speech data is collected using a close-talking microphone placed in the headset of driver and multiple distant microphones placed around the driver. We address the issues in estimating spectral features of speech data collected using the close-talking microphone from the spectral features of data recorded on the distant microphones. We study methods such as concatenation, averaging, linear regression and nonlinear regression for estimation. We consider support vector machines (SVMs) for nonlinear regression of multiple spectral coefficients. We compare the performance of SVMs and hidden Markov models (HMMs) in recognition of subword units of speech using the original spectral features and the estimated spectral features. A Japanese speech corpus consisting of recordings in a moving car environment is used for our studies on estimation of spectral features and recognition of subword units of speech. Results of our studies show that SVM based regression performs better compared to linear regression, and SVMs give a higher recognition accuracy compared to HMMs.
Keywords :
automobiles; hidden Markov models; natural languages; regression analysis; spectral analysis; speech recognition; support vector machines; HMMs; Japanese speech corpus; SVM based regression; close-talking microphone; hidden Markov models; linear regression; moving car environment; multiple distant microphones; multiple regression; multiple spectral coefficients; nonlinear regression; spectral features; speech data; speech recognition; subword units; support vector machines; Driver circuits; Hidden Markov models; Linear regression; Microphone arrays; Signal processing; Speech enhancement; Speech recognition; Support vector machines; Vehicles; Working environment noise;
Conference_Titel :
Neural Information Processing, 2002. ICONIP '02. Proceedings of the 9th International Conference on
Print_ISBN :
981-04-7524-1
DOI :
10.1109/ICONIP.2002.1198192