DocumentCode :
394273
Title :
Subband parameter optimization of microphone arrays for speech recognition in reverberant environments
Author :
Seltzer, Michael L. ; Stern, Richard M.
Author_Institution :
Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume :
1
fYear :
2003
fDate :
6-10 April 2003
Abstract :
We present a new subband microphone array processing algorithm specifically designed for speech recognition applications. We previously proposed a speech recognizer-based array processing algorithm which resulted in significant improvements in recognition accuracy when the speech was corrupted by additive noise and moderate levels of reverberation. However, little improvement was achieved over conventional beamforming methods in highly reverberant environments. Subband processing has been used to improve the poor performance of LMS-type algorithms when the number of filter parameters to estimate is large and the noise is highly correlated to the speech signal, e.g. in highly reverberant environments. We apply a subband approach to a new array processing architecture in which select groups of subbands are processed jointly to maximize the likelihood of the resulting speech recognition features, as measured by the recognition system itself. By incorporating the recognizer into the filter optimization scheme we ensure that signal components important for recognition are emphasized without undue emphasis on less critical components. By utilizing a subband approach, we can effectively apply this framework to highly reverberant environments. In doing so, we are able to achieve improvements in word error rate of over 20% compared to conventional methods in highly reverberant environments.
Keywords :
acoustic transducer arrays; array signal processing; filtering theory; microphones; optimisation; reverberation; speech processing; speech recognition; LMS-type algorithms; additive noise; array processing architecture; beamforming methods; filter optimization; filter parameters; log mel spectrum subband filtering; recognition accuracy; reverberant environments; reverberation; signal components; speech recognition applications; speech signal; subband microphone array processing algorithm; subband parameter optimization; word error rate; Additive noise; Algorithm design and analysis; Array signal processing; Filters; Microphone arrays; Process design; Reverberation; Speech enhancement; Speech processing; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1198804
Filename :
1198804
Link To Document :
بازگشت