Title :
Nonlinear Acoustic Echo Cancellation Based on a Sliding-Window Leaky Kernel Affine Projection Algorithm
Author :
Gil-Cacho, Jose M. ; Signoretto, Marco ; van Waterschoot, Toon ; Moonen, Marc ; Jensen, Soren Holdt
Author_Institution :
iMinds Future Health Dept., KU Leuven, Leuven, Belgium
Abstract :
Acoustic echo cancellation (AEC) is used in speech communication systems where the existence of echoes degrades the speech intelligibility. Standard approaches to AEC rely on the assumption that the echo path to be identified can be modeled by a linear filter. However, some elements introduce nonlinear distortion and must be modeled as nonlinear systems. Several nonlinear models have been used with more or less success. The kernel affine projection algorithm (KAPA) has been successfully applied to many areas in signal processing but not yet to nonlinear AEC (NLAEC). The contribution of this paper is three-fold: (1) to apply KAPA to the NLAEC problem, (2) to develop a sliding-window leaky KAPA (SWL-KAPA) that is well suited for NLAEC applications, and (3) to propose a kernel function, consisting of a weighted sum of a linear and a Gaussian kernel. In our experiment set-up, the proposed SWL-KAPA for NLAEC consistently outperforms the linear APA, resulting in up to 12 dB of improvement in ERLE at a computational cost that is only 4.6 times higher. Moreover, it is shown that the SWL-KAPA outperforms, by 4-6 dB, a Volterra-based NLAEC, which itself has a much higher 413 times computational cost than the linear APA.
Keywords :
Gaussian processes; echo suppression; filtering theory; nonlinear distortion; speech intelligibility; Gaussian kernel; NLAEC; SWL-KAPA; Volterra-based NLAEC; echo path; kernel function; linear filter; nonlinear AEC; nonlinear acoustic echo cancellation; nonlinear distortion; nonlinear systems; signal processing; sliding-window leaky kernel affine projection algorithm; speech communication systems; speech intelligibility; weighted linear sum; Kernel adaptive filters; nonlinear acoustic echo cancellation;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2013.2260742