DocumentCode :
745001
Title :
Adaptive log-spectral regression for in-car speech recognition using multiple distributed microphones
Author :
Li, Weifeng ; Takeda, Kazuya ; Itakura, Fumitada
Author_Institution :
Dept. of Inf. Electron., Nagoya Univ., Japan
Volume :
12
Issue :
4
fYear :
2005
fDate :
4/1/2005 12:00:00 AM
Firstpage :
340
Lastpage :
343
Abstract :
This letter addresses issues in improving hands-free speech recognition performance in different car environments. We propose a new speech-enhancement approach based on optimizing regression of the log-spectra, which is used to estimate the log-spectra of speech at a close-talking microphone by using multiple spatially distributed microphones. The regression weights can be adapted automatically for different noise environments. Compared to the nearest distant microphone and adaptive beamformer generalized sidelobe canceller (GSC), the proposed approach shows an advantage in the average relative word error rate (WER) reduction of 58.5 and 10.3%, respectively, for isolated word recognition under 15 real-car environments.
Keywords :
array signal processing; microphone arrays; multilayer perceptrons; optimisation; regression analysis; speech enhancement; speech recognition; support vector machines; WER; adaptive beamforming; adaptive log-spectral regression; car environment; close-talking microphone; hands-free speech recognition; in-car speech recognition; multilayer perceptron; multiple distributed microphone; multiple spatially distributed microphone; noise environment; optimisation; regression weight; speech-enhancement; support vector machine; word error rate; word recognition; Array signal processing; Automatic speech recognition; Filter bank; Linear regression; Microphone arrays; Multilayer perceptrons; Noise cancellation; Speech recognition; Support vector machines; Working environment noise; Adaptive beamforming; k-means; multilayer perceptron; speech recognition; support vector machine;
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/LSP.2005.843761
Filename :
1407935
Link To Document :
بازگشت