DocumentCode :
3245302
Title :
Optimal filtering of noisy cepstral coefficients for robust ASR
Author :
Myrvoll, T.A. ; Nakamura, Satoshi
Author_Institution :
Spoken Language Translation Res. Lab., ATR, Kyoto, Japan
fYear :
2003
fDate :
30 Nov.-3 Dec. 2003
Firstpage :
381
Lastpage :
386
Abstract :
In this work we investigate the use of a technique for optimal, in the mean-square-error sense, filtering of noisy cepstral coefficients for use with robust ASR. The filtering is done in the log-spectral domain using a clean speech model and a noise model whose parameters are estimated by a nonapproximative maximum likelihood formulation. As the assumption of additive noise in the time and spectral domains leads to a highly nonlinear mixing function in the log-spectral domain, we have to resort to numerical integration routines to perform the estimation and filtering. To make sure that the numerical integrals are robust and accurate we develop closed form solutions that cover critical parts of the integration domain.
Keywords :
cepstral analysis; filtering theory; integral equations; integration; maximum likelihood estimation; mean square error methods; speech recognition; additive noise; automatic speech recognition; clean speech model; closed form solutions; log-spectral domain; mean square error; noise model; noisy cepstral coefficients; nonapproximative maximum likelihood formulation; nonlinear mixing function; numerical integrals; numerical integration routines; optimal filtering; parameter estimation; robust ASR; Additive noise; Automatic speech recognition; Cepstral analysis; Closed-form solution; Filtering; Laboratories; Maximum likelihood estimation; Natural languages; Noise robustness; Speech enhancement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
Print_ISBN :
0-7803-7980-2
Type :
conf
DOI :
10.1109/ASRU.2003.1318471
Filename :
1318471
Link To Document :
بازگشت