Title :
A binaural speech processing method using subband-cross correlation analysis for noise robust recognition
Author :
Kajita, Shoji ; Takeda, Kazuya ; Itakura, Fumitada
Author_Institution :
Graduate Sch. of Eng., Nagoya Univ., Japan
Abstract :
This paper describes an extended subband-cross-correlation (SBXCOR) analysis to improve the robustness against noise. The SBXCOR analysis, which has been already proposed, is a binaural speech processing technique using two input signals and extracts the periodicities associated with the inverse of the center frequency (CF) in each subband. In this paper, by taking an exponentially weighted sum of crosscorrelation at the integral multiples of the inverse of CF, SBXCOR is extended so as to capture more periodicities included in two input signals. The experimental results using a DTW word recognizer showed that the processing improves the performance of SBXCOR for both that of the white noise and a computer room noise. For white noise, the extended SBXCOR performed significantly better than the smoothed group delay spectrum and the mel-frequency cepstral coefficient (MFCC) extracted from both monaural and binaural signals. However, for the computer room noise, it outperformed only at SNR 0 dB
Keywords :
acoustic correlation; acoustic noise; inverse problems; speech processing; speech recognition; white noise; DTW word recognizer; SBXCOR analysis; binaural speech processing method; center frequency; computer room noise; input signals; inverse; noise robust recognition; periodicities; subband-cross correlation analysis; white noise; Cepstral analysis; Delay; Mel frequency cepstral coefficient; Noise robustness; Signal analysis; Signal processing; Signal to noise ratio; Speech analysis; Speech processing; White noise;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.596170