DocumentCode :
1515777
Title :
Nonlinear Compensation Using the Gauss–Newton Method for Noise-Robust Speech Recognition
Author :
Zhao, Yong ; Juang, Biing-Hwang
Author_Institution :
Dept. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA, USA
Volume :
20
Issue :
8
fYear :
2012
Firstpage :
2191
Lastpage :
2206
Abstract :
In this paper, we present the Gauss-Newton method as a unified approach to estimating noise parameters of the prevalent nonlinear compensation models, such as vector Taylor series (VTS), data-driven parallel model combination (DPMC), and unscented transform (UT), for noise-robust speech recognition. While iterative estimation of noise means in a generalized EM framework has been widely known, we demonstrate that such approaches are variants of the Gauss-Newton method. Furthermore, we propose a novel noise variance estimation algorithm that is consistent with the Gauss-Newton principle. The formulation of the Gauss-Newton method reduces the noise estimation problem to determining the Jacobians of the corrupted speech parameters. For sampling-based compensations, we present two methods, sample Jacobian average (SJA) and cross-covariance (XCOV), to evaluate these Jacobians. The proposed noise estimation algorithm is evaluated for various compensation models on two tasks. The first is to fit a Gaussian mixture model (GMM) model to artificially corrupted samples, and the second is to perform speech recognition on the Aurora 2 database. The significant performance improvements confirm the efficacy of the Gauss-Newton method to estimating the noise parameters of the nonlinear compensation models.
Keywords :
Gaussian processes; Jacobian matrices; Newton method; series (mathematics); speech recognition; Aurora 2 database; DPMC; GMM model; Gauss-Newton method; Gauss-Newton principle; Gaussian mixture model; Jacobians; SJA; UT; VTS; XCOV; corrupted speech parameters; cross-covariance; data-driven parallel model combination; generalized EM framework; iterative estimation; noise estimation algorithm; noise estimation problem; noise parameter estimation; noise variance estimation algorithm; noise-robust speech recognition; nonlinear compensation models; sample Jacobian average; sampling-based compensations; unscented transform; vector Taylor series; Adaptation models; Estimation; Jacobian matrices; Noise; Speech; Speech processing; Speech recognition; Gauss–Newton method; nonlinear compensation; robust speech recognition; vector Taylor series (VTS);
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2012.2199107
Filename :
6198873
Link To Document :
بازگشت