Title :
An evaluation of adaptive beamformer based on average speech spectrum for noisy speech recognition
Author :
Nishiura, Takanobu ; Nakayama, Masato ; Nakamura, Satoshi
Author_Institution :
ATR Spoken Language Translation Res. Labs., Kyoto, Japan
Abstract :
Distant-talking speech recognition in noisy environments is indispensable for self-moving robots or tele-conference systems. However, background noise and room reverberations seriously degrade the sound-capture quality in real acoustic environments. A microphone array is an ideal candidate as an effective method for capturing distant-talking speech. AMNOR (adaptive microphone-array for noise reduction) was proposed as an adaptive beamformer for capturing the desired distant signals in noisy environments by Kaneda et al. Although the AMNOR has been proven effective, it can be further improved if we know spectrum characteristics of the desired distant signals in advance. Therefore, we regarded speech as a desired distant signal and designed an AMNOR based on the average speech spectrum. In this paper, we particularly focused on the performance of AMNOR based on the average speech spectrum for distant-talking speech capture and recognition. As a result of evaluation experiments in real acoustic environments, we confirmed that the ASR (automatic speech recognition) performance was improved 5-10% by using AMNOR based on the average speech spectrum in noisy environments. In addition, the proposed AMNOR provides better noise reduction performance than that of conventional AMNOR.
Keywords :
adaptive signal processing; array signal processing; interference suppression; microphones; speech processing; speech recognition; adaptive beamformer; adaptive microphone-array for noise reduction; automatic speech recognition; average speech spectrum; distant-talking speech recognition; noisy speech recognition; self-moving robots; sound-capture quality; teleconference systems; Acoustic noise; Automatic speech recognition; Background noise; Microphone arrays; Noise reduction; Robots; Speech analysis; Speech recognition; Teleconferencing; Working environment noise;
Conference_Titel :
Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
Print_ISBN :
0-7803-7965-9
DOI :
10.1109/ICME.2003.1221285