Title :
Robust speech recognition using a Small Power Boosting algorithm
Author :
Kim, Chanwoo ; Kumar, Kshitiz ; Stern, Richard M.
Author_Institution :
Language Technol. Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA
fDate :
Nov. 13 2009-Dec. 17 2009
Abstract :
In this paper, we present a noise robustness algorithm called small power boosting (SPB). We observe that in the spectral domain, time-frequency bins with smaller power are more affected by additive noise. The conventional way of handling this problem is estimating the noise from the test utterance and doing normalization or subtraction. In our work, in contrast, we intentionally boost the power of time-frequency bins with small energy for both the training and testing datasets. Since time-frequency bins with small power no longer exist after this power boosting, the spectral distortion between the clean and corrupt test sets becomes reduced. This type of small power boosting is also highly related to physiological nonlinearity. We observe that when small power boosting is done, suitable weighting smoothing becomes highly important. Our experimental results indicate that this simple idea is very helpful for very difficult noisy environments such as corruption by background music.
Keywords :
signal denoising; smoothing methods; speech recognition; time-frequency analysis; additive noise; noise estimation; physiological nonlinearity; robust speech recognition; small power boosting algorithm; spectral distortion; spectral domain; time-frequency bins; Additive noise; Boosting; Equations; Hidden Markov models; Robustness; Smoothing methods; Speech recognition; Testing; Time frequency analysis; Working environment noise; Robust speech recognition; physiological modeling; rate-level curve; weight smoothing;
Conference_Titel :
Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on
Conference_Location :
Merano
Print_ISBN :
978-1-4244-5478-5
Electronic_ISBN :
978-1-4244-5479-2
DOI :
10.1109/ASRU.2009.5373230