DocumentCode
323495
Title
Discriminative learning of additive noise and channel distortions for robust speech recognition
Author
Han, Jiqing ; Han, Munsung ; Park, Gyu-Bong ; Park, Jeongue ; Gao, Wen ; Hwang, Doosung
Author_Institution
Language Understanding Lab., ETRI, South Korea
Volume
1
fYear
1998
fDate
12-15 May 1998
Firstpage
81
Abstract
Learning the influence of additive noise and channel distortions from training data is an effective approach for robust speech recognition. Most of the previous methods are based on maximum likelihood estimation criterion. We propose a new method of discriminative learning environmental parameters, which is based on the minimum classification error (MCE) criterion. By using a simple classifier defined by ourselves and the generalized probabilistic descent (GPD) algorithm, we iteratively learn environmental parameters. After getting the parameters, we estimate the clean speech features from the observed speech features and then use the estimation of the clean speech features to train or test the back-end HMM classifier. The best error rate reduction of 32.1% is obtained, tested on a Korean 18 isolated confusion words task, relative to the conventional HMM system
Keywords
Gaussian distribution; cepstral analysis; error statistics; feature extraction; hidden Markov models; learning systems; noise; parameter estimation; pattern classification; probability; speech recognition; Gaussian distribution; Korean; additive noise; back-end HMM classifier; cepstral training sequences; channel distortions; clean speech features; discriminative learning; environmental parameters; error rate reduction; generalized probabilistic descent algorithm; isolated confusion words task; maximum likelihood estimation; minimum classification error; observed speech features; parameter estimation; robust speech recognition; training data; Additive noise; Error analysis; Hidden Markov models; Iterative algorithms; Maximum likelihood estimation; Noise robustness; Parameter estimation; Speech recognition; Testing; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location
Seattle, WA
ISSN
1520-6149
Print_ISBN
0-7803-4428-6
Type
conf
DOI
10.1109/ICASSP.1998.674372
Filename
674372
Link To Document