مرکز منطقه ای اطلاع رساني علوم و فناوري - Variational Bayesian Inference for Multichannel Dereverberation and Noise Reduction

DocumentCode :

43350

Title :

Variational Bayesian Inference for Multichannel Dereverberation and Noise Reduction

Author :

Schmid, Daniel ; Enzner, Gerald ; Malik, S. ; Kolossa, Dorothea ; Martin, Rashad

Author_Institution :

Inst. of Commun. Acoust., Ruhr-Univ. Bochum, Bochum, Germany

Volume :

Issue :

fYear :

2014

fDate :

Aug. 2014

Firstpage :

1320

Lastpage :

1335

Abstract :

Room reverberation and background noise severely degrade the quality of hands-free speech communication systems. In this work, we address the problem of combined speech dereverberation and noise reduction using a variational Bayesian (VB) inference approach. Our method relies on a multichannel state-space model for the acoustic channels that combines frame-based observation equations in the frequency domain with a first-order Markov model to describe the time-varying nature of the room impulse responses. By modeling the channels and the source signal as latent random variables, we formulate a lower bound on the log-likelihood function of the model parameters given the observed microphone signals and iteratively maximize it using an online expectation-maximization approach. Our derivation yields update equations to jointly estimate the channel and source posterior distributions and the remaining model parameters. An inspection of the resulting VB algorithm for blind equalization and channel identification (VB-BENCH) reveals that the presented framework includes previously proposed methods as special cases. Finally, we evaluate the performance of our approach in terms of speech quality, adaptation times, and speech recognition results to demonstrate its effectiveness for a wide range of reverberation and noise conditions.

Keywords :

Bayes methods; Markov processes; blind equalisers; channel estimation; expectation-maximisation algorithm; noise abatement; speech processing; speech recognition; variational techniques; VB-BENCH; acoustic channels; adaptation times; blind equalization; channel identification; first-order Markov model; frame-based observation equations; frequency domain; hands-free speech communication systems; log-likelihood function; microphone signals; multichannel dereverberation; multichannel state-space model; noise reduction; online expectation-maximization approach; room impulse responses; room reverberation; source posterior distributions; speech dereverberation; speech quality; speech recognition; variational Bayesian inference; Frequency-domain analysis; Mathematical model; Microphones; Reverberation; Speech; Speech processing; Dereverberation; expectation-maximization algorithm; noise reduction; state-space model; variational Bayes;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE/ACM Transactions on

Publisher :

ieee

ISSN :

2329-9290

Type :

jour

DOI :

10.1109/TASLP.2014.2329732

Filename :

6827932

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=43350