Title :
Time-varying noise compensation by sequential Monte Carlo method
Author :
Yao, Kaisheng ; Nakamura, Satoshi
Author_Institution :
ATR Spoken Language Translation Res. Labs., Kyoto, Japan
Abstract :
We present a sequential Monte Carlo method applied to additive noise compensation for robust speech recognition in time-varying noise. At each frame, the method generates a set of samples, approximating the posterior distribution of speech and noise parameters for given observation sequences to the current frame. An explicit model representing noise effects on speech features is used, so that an extended Kalman filter is constructed for each sample, generating an updated continuous state as the estimation of the noise parameter, and prediction likelihood as the weight of each sample for minimum mean square error inference of the time-varying noise parameter over these samples. A selection step and a smoothing step are used to improve efficiency. Through experiments, we observed significant performance improvement over that achieved by noise compensation with a stationary noise assumption. It also performed better than the sequential EM algorithm in machine-gun noise.
Keywords :
Kalman filters; Monte Carlo methods; acoustic noise; inference mechanisms; interference suppression; least mean squares methods; parameter estimation; prediction theory; speech recognition; MMSE inference; extended Kalman filter; machine-gun noise; minimum mean square error inference; noise parameter estimation; prediction likelihood; robust speech recognition; sequential Monte Carlo method; time-varying noise compensation; Additive noise; Inference algorithms; Mean square error methods; Noise generators; Noise robustness; Predictive models; Smoothing methods; Speech enhancement; Speech recognition; State estimation;
Conference_Titel :
Automatic Speech Recognition and Understanding, 2001. ASRU '01. IEEE Workshop on
Print_ISBN :
0-7803-7343-X
DOI :
10.1109/ASRU.2001.1034613