مرکز منطقه ای اطلاع رساني علوم و فناوري - Robust speech recognition using a Small Power Boosting algorithm

DocumentCode :

2970353

Title :

Robust speech recognition using a Small Power Boosting algorithm

Author :

Kim, Chanwoo ; Kumar, Kshitiz ; Stern, Richard M.

Author_Institution :

Language Technol. Inst., Carnegie Mellon Univ., Pittsburgh, PA, USA

fYear :

2009

fDate :

Nov. 13 2009-Dec. 17 2009

Firstpage :

243

Lastpage :

248

Abstract :

In this paper, we present a noise robustness algorithm called small power boosting (SPB). We observe that in the spectral domain, time-frequency bins with smaller power are more affected by additive noise. The conventional way of handling this problem is estimating the noise from the test utterance and doing normalization or subtraction. In our work, in contrast, we intentionally boost the power of time-frequency bins with small energy for both the training and testing datasets. Since time-frequency bins with small power no longer exist after this power boosting, the spectral distortion between the clean and corrupt test sets becomes reduced. This type of small power boosting is also highly related to physiological nonlinearity. We observe that when small power boosting is done, suitable weighting smoothing becomes highly important. Our experimental results indicate that this simple idea is very helpful for very difficult noisy environments such as corruption by background music.

Keywords :

signal denoising; smoothing methods; speech recognition; time-frequency analysis; additive noise; noise estimation; physiological nonlinearity; robust speech recognition; small power boosting algorithm; spectral distortion; spectral domain; time-frequency bins; Additive noise; Boosting; Equations; Hidden Markov models; Robustness; Smoothing methods; Speech recognition; Testing; Time frequency analysis; Working environment noise; Robust speech recognition; physiological modeling; rate-level curve; weight smoothing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Automatic Speech Recognition & Understanding, 2009. ASRU 2009. IEEE Workshop on

Conference_Location :

Merano

Print_ISBN :

978-1-4244-5478-5

Electronic_ISBN :

978-1-4244-5479-2

Type :

conf

DOI :

10.1109/ASRU.2009.5373230

Filename :

5373230

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2970353