DocumentCode :
2155794
Title :
Spectrum-entropy based beam-former with speaker tracking for hands-free continuous speech recognition in noise
Author :
George, Nokas ; Evangelos, Dermatas
Author_Institution :
Dept. of Electr. & Comput. Eng., Patras Univ., Greece
Volume :
1
fYear :
2002
fDate :
2002
Firstpage :
251
Abstract :
In hands-free speech recognition of moving speakers, the time interval where the source position can be assumed stationary varies. It is very common for the speaker to move rapidly within the data window exploited. In such cases the conventional fixed-window direction of arrival (DOA) estimation may lead to poor tracking performance. In this paper we present a novel speech beamformer for moving speakers in noisy environments. The localization algorithm extracts a set of candidate DOA of the signal sources using array signal processing methods in the frequency domain. A minimum variance (MV) beamformer identifies the speech signal DOA in the direction where the signal´s spectrum entropy is minimized. The same localization algorithm is used to detect the closest direction to the initial estimation using a smaller window. The proposed method is evaluated using a phoneme recognition system and noise recordings from an air-condition fan and the TIMIT speech corpus. Extended experiments, carried out in the range of 25-0 dB SNR, show significant improvement in the recognition rate of moving speakers especially in very low SNR.
Keywords :
array signal processing; direction-of-arrival estimation; frequency-domain analysis; identification; minimum entropy methods; spectral analysis; speech processing; speech recognition; DOA estimation; array signal processing; direction of arrival estimation; frequency domain; hands-free continuous speech recognition; identification; localization algorithm; minimum variance beamformer; moving speakers; noise recordings; noisy environments; phoneme recognition system; speaker tracking; spectrum entropy minimization; speech signal; Array signal processing; Data mining; Direction of arrival estimation; Entropy; Frequency domain analysis; Signal processing; Signal processing algorithms; Signal to noise ratio; Speech recognition; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Digital Signal Processing, 2002. DSP 2002. 2002 14th International Conference on
Print_ISBN :
0-7803-7503-3
Type :
conf
DOI :
10.1109/ICDSP.2002.1027881
Filename :
1027881
Link To Document :
بازگشت