DocumentCode :
323495
Title :
Discriminative learning of additive noise and channel distortions for robust speech recognition
Author :
Han, Jiqing ; Han, Munsung ; Park, Gyu-Bong ; Park, Jeongue ; Gao, Wen ; Hwang, Doosung
Author_Institution :
Language Understanding Lab., ETRI, South Korea
Volume :
1
fYear :
1998
fDate :
12-15 May 1998
Firstpage :
81
Abstract :
Learning the influence of additive noise and channel distortions from training data is an effective approach for robust speech recognition. Most of the previous methods are based on maximum likelihood estimation criterion. We propose a new method of discriminative learning environmental parameters, which is based on the minimum classification error (MCE) criterion. By using a simple classifier defined by ourselves and the generalized probabilistic descent (GPD) algorithm, we iteratively learn environmental parameters. After getting the parameters, we estimate the clean speech features from the observed speech features and then use the estimation of the clean speech features to train or test the back-end HMM classifier. The best error rate reduction of 32.1% is obtained, tested on a Korean 18 isolated confusion words task, relative to the conventional HMM system
Keywords :
Gaussian distribution; cepstral analysis; error statistics; feature extraction; hidden Markov models; learning systems; noise; parameter estimation; pattern classification; probability; speech recognition; Gaussian distribution; Korean; additive noise; back-end HMM classifier; cepstral training sequences; channel distortions; clean speech features; discriminative learning; environmental parameters; error rate reduction; generalized probabilistic descent algorithm; isolated confusion words task; maximum likelihood estimation; minimum classification error; observed speech features; parameter estimation; robust speech recognition; training data; Additive noise; Error analysis; Hidden Markov models; Iterative algorithms; Maximum likelihood estimation; Noise robustness; Parameter estimation; Speech recognition; Testing; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
ISSN :
1520-6149
Print_ISBN :
0-7803-4428-6
Type :
conf
DOI :
10.1109/ICASSP.1998.674372
Filename :
674372
Link To Document :
بازگشت