Title :
Joint estimation of noise and channel distortion in a generalized EM framework
Author :
Krisjansson, T. ; Frey, B. ; Deng, L. ; Acero, A.
Author_Institution :
Waterloo Univ., Ont., Canada
Abstract :
The performance of speech cleaning and noise adaptation algorithms is heavily dependent on the quality of the noise and channel models. Various strategies have been proposed in the literature for adapting to the current noise and channel conditions. We describe the joint learning of noise and channel distortion in a novel framework called ALGONQUIN. The learning algorithm employs a generalized EM strategy wherein the E step is approximate. We discuss the characteristics of the new algorithm, with a focus on convergence rates and parameter initialization. We show that the learning algorithm can successfully disentangle the non-linear effects of noise and linear effects of the channel and achieve a relative reduction in WER of 21.8% over the non-adaptive algorithm.
Keywords :
acoustic distortion; acoustic noise; acoustic signal processing; convergence of numerical methods; error statistics; interference suppression; learning (artificial intelligence); optimisation; parameter estimation; speech recognition; WER; channel distortion estimation; convergence rates; expectation maximisation; generalized EM framework; learning algorithm; noise adaptation; noise estimation; parameter initialization; speech cleaning; speech recognition; word error rate; Cepstral analysis; Cleaning; Convergence; Equations; Gaussian noise; Noise reduction; Nonlinear distortion; Parameter estimation; Speech enhancement; Speech recognition;
Conference_Titel :
Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
Print_ISBN :
0-7803-7343-X
DOI :
10.1109/ASRU.2001.1034611