Title :
Variational Bayesian Inference for Multichannel Dereverberation and Noise Reduction
Author :
Schmid, Daniel ; Enzner, Gerald ; Malik, S. ; Kolossa, Dorothea ; Martin, Rashad
Author_Institution :
Inst. of Commun. Acoust., Ruhr-Univ. Bochum, Bochum, Germany
Abstract :
Room reverberation and background noise severely degrade the quality of hands-free speech communication systems. In this work, we address the problem of combined speech dereverberation and noise reduction using a variational Bayesian (VB) inference approach. Our method relies on a multichannel state-space model for the acoustic channels that combines frame-based observation equations in the frequency domain with a first-order Markov model to describe the time-varying nature of the room impulse responses. By modeling the channels and the source signal as latent random variables, we formulate a lower bound on the log-likelihood function of the model parameters given the observed microphone signals and iteratively maximize it using an online expectation-maximization approach. Our derivation yields update equations to jointly estimate the channel and source posterior distributions and the remaining model parameters. An inspection of the resulting VB algorithm for blind equalization and channel identification (VB-BENCH) reveals that the presented framework includes previously proposed methods as special cases. Finally, we evaluate the performance of our approach in terms of speech quality, adaptation times, and speech recognition results to demonstrate its effectiveness for a wide range of reverberation and noise conditions.
Keywords :
Bayes methods; Markov processes; blind equalisers; channel estimation; expectation-maximisation algorithm; noise abatement; speech processing; speech recognition; variational techniques; VB-BENCH; acoustic channels; adaptation times; blind equalization; channel identification; first-order Markov model; frame-based observation equations; frequency domain; hands-free speech communication systems; log-likelihood function; microphone signals; multichannel dereverberation; multichannel state-space model; noise reduction; online expectation-maximization approach; room impulse responses; room reverberation; source posterior distributions; speech dereverberation; speech quality; speech recognition; variational Bayesian inference; Frequency-domain analysis; Mathematical model; Microphones; Reverberation; Speech; Speech processing; Dereverberation; expectation-maximization algorithm; noise reduction; state-space model; variational Bayes;
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
DOI :
10.1109/TASLP.2014.2329732