Title :
Optimized joint noise suppression and dereverberation based on blind signal extraction for hands-free speech recognition system
Author :
Aprilyanti, Fine D. ; Saruwatari, Hiroshi ; Nakamura, Shigenari ; Takatani, Tomoya
Author_Institution :
Nara Inst. of Sci. & Technol., Ikoma, Japan
Abstract :
In this paper, we build a method based on frequency-domain blind signal extraction (FD-BSE) to jointly suppress diffuse background noise and late reverberation for a hands-free automatic speech recognition (ASR) system. The proposed method utilizes FD-BSE to extract speech signal from the noisy mixture, and a multichannel postprocessing filter, such as a Wiener filter or generalized minimum mean-square error short-time spectral amplitude estimator, for dereverberation. We also apply parameter optimization that maximize the likelihood of the acoustic model of ASR. Experimental results confirm that the proposed method can improve the word recognition accuracy by up to approximately 17% compared with FD-BSE alone under a signal-to-noise ratio of 10 dB.
Keywords :
feature extraction; mean square error methods; optimisation; reverberation; speech recognition; ASR system; FD-BSE; Wiener filter; blind signal extraction; frequency domain blind signal extraction; hands free automatic speech recognition; hands free speech recognition system; mean-square error; multichannel postprocessing filter; noisy mixture; optimized joint noise dereverberation; optimized joint noise suppression; parameter optimization; speech signal extraction; Joints; Microphones; Noise measurement; Reverberation; Speech; Speech recognition; FD-BSE; Wiener filter; dereverberation; generalized MMSE-STSA; noise suppression;
Conference_Titel :
Hands-free Speech Communication and Microphone Arrays (HSCMA), 2014 4th Joint Workshop on
Conference_Location :
Villers-les-Nancy
DOI :
10.1109/HSCMA.2014.6843276