مرکز منطقه ای اطلاع رساني علوم و فناوري - Cepstrum-domain model combination based on decomposition of speech and noise for noisy speech recognition

DocumentCode :

542202

Title :

Cepstrum-domain model combination based on decomposition of speech and noise for noisy speech recognition

Author :

Kim, H.K. ; Rose, R.C.

Author_Institution :

AT&T Labs-Research, .Florham Park, NJ., USA

Volume :

fYear :

2002

fDate :

13-17 May 2002

Abstract :

We propose a cepstrum-domain model combination method for automatic speech recognition in noisy environments. The distinguishing aspect of the method is that noise-corrupted speech is decomposed into clean speech and noise components directly in the cepstrum domain without having to transform to the linear spectrum domain as is necessary for many existing model combination approaches. This is accomplished by exploiting the properties of the minimum mean squared error-log spectral amplitude (MMSE-LSA) based speech enhancement algorithm. As a result, a clean speech hidden Markov model (HMM) is easily compensated for a noise-corrupted domain by adding the means and covariance matrices of the clean speech HMM and those of an estimated noise model. The complexity of the proposed model combination procedure is significantly reduced with respect to conventional parallel model combination. The procedure was applied to a noisy connected digit recognition task. A 40% reduction in word error rate was achieved when it was combined with acoustic feature compensation techniques under mismatched environmental and channel conditions.

Keywords :

Cepstrum; Covariance matrix; Hidden Markov models; Magnetic heads; Signal to noise ratio; Speech; Speech coding;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on

Conference_Location :

Orlando, FL

ISSN :

1520-6149

Print_ISBN :

0-7803-7402-9

Type :

conf

DOI :

10.1109/ICASSP.2002.5743691

Filename :

5743691

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=542202