Discriminative learning of additive noise and channel distortions for robust speech recognition

Author

Han, Jiqing ; Han, Munsung ; Park, Gyu-Bong ; Park, Jeongue ; Gao, Wen ; Hwang, Doosung

Author_Institution

Language Understanding Lab., ETRI, South Korea

Volume

1

fYear

1998

fDate

12-15 May 1998

Firstpage

81

Abstract

Learning the influence of additive noise and channel distortions from training data is an effective approach for robust speech recognition. Most of the previous methods are based on maximum likelihood estimation criterion. We propose a new method of discriminative learning environmental parameters, which is based on the minimum classification error (MCE) criterion. By using a simple classifier defined by ourselves and the generalized probabilistic descent (GPD) algorithm, we iteratively learn environmental parameters. After getting the parameters, we estimate the clean speech features from the observed speech features and then use the estimation of the clean speech features to train or test the back-end HMM classifier. The best error rate reduction of 32.1% is obtained, tested on a Korean 18 isolated confusion words task, relative to the conventional HMM system

Keywords

Gaussian distribution; cepstral analysis; error statistics; feature extraction; hidden Markov models; learning systems; noise; parameter estimation; pattern classification; probability; speech recognition; Gaussian distribution; Korean; additive noise; back-end HMM classifier; cepstral training sequences; channel distortions; clean speech features; discriminative learning; environmental parameters; error rate reduction; generalized probabilistic descent algorithm; isolated confusion words task; maximum likelihood estimation; minimum classification error; observed speech features; parameter estimation; robust speech recognition; training data; Additive noise; Error analysis; Hidden Markov models; Iterative algorithms; Maximum likelihood estimation; Noise robustness; Parameter estimation; Speech recognition; Testing; Training data;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on

Conference_Location

Seattle, WA

ISSN

1520-6149

Print_ISBN

0-7803-4428-6

Type

conf

DOI

10.1109/ICASSP.1998.674372

Filename

674372