DocumentCode :
394170
Title :
Multiple regression using support vector machines for recognition of speech in a moving car environment
Author :
Lee, Weifeng ; Sekhar, C. Chandra ; Takeda, Kazuya ; Itakura, Fumitada
Author_Institution :
Dept. of Inf. Electron., Nagoya Univ., Japan
Volume :
2
fYear :
2002
fDate :
18-22 Nov. 2002
Firstpage :
904
Abstract :
In a moving car environment, speech data is collected using a close-talking microphone placed in the headset of driver and multiple distant microphones placed around the driver. We address the issues in estimating spectral features of speech data collected using the close-talking microphone from the spectral features of data recorded on the distant microphones. We study methods such as concatenation, averaging, linear regression and nonlinear regression for estimation. We consider support vector machines (SVMs) for nonlinear regression of multiple spectral coefficients. We compare the performance of SVMs and hidden Markov models (HMMs) in recognition of subword units of speech using the original spectral features and the estimated spectral features. A Japanese speech corpus consisting of recordings in a moving car environment is used for our studies on estimation of spectral features and recognition of subword units of speech. Results of our studies show that SVM based regression performs better compared to linear regression, and SVMs give a higher recognition accuracy compared to HMMs.
Keywords :
automobiles; hidden Markov models; natural languages; regression analysis; spectral analysis; speech recognition; support vector machines; HMMs; Japanese speech corpus; SVM based regression; close-talking microphone; hidden Markov models; linear regression; moving car environment; multiple distant microphones; multiple regression; multiple spectral coefficients; nonlinear regression; spectral features; speech data; speech recognition; subword units; support vector machines; Driver circuits; Hidden Markov models; Linear regression; Microphone arrays; Signal processing; Speech enhancement; Speech recognition; Support vector machines; Vehicles; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Information Processing, 2002. ICONIP '02. Proceedings of the 9th International Conference on
Print_ISBN :
981-04-7524-1
Type :
conf
DOI :
10.1109/ICONIP.2002.1198192
Filename :
1198192
Link To Document :
بازگشت