DocumentCode :
2701853
Title :
Adaptive Training with Joint Uncertainty Decoding for Robust Recognition of Noisy Data
Author :
Liao, Haitao ; Gales, Mark J.F.
Author_Institution :
Dept. of Eng., Cambridge Univ., UK
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
Standard noise compensation techniques for automatic speech recognition assume a clean trained acoustic model. What is thought of as "clean" data, may still have a variety of speakers, different channels and varying noise conditions. Hence it may be more reasonable to consider such data multi-conditional for multistyle training. This paper shows that multistyle models benefit from VTS compensation or joint uncertainty decoding by reducing the mismatch between training and test. An EM-based noise estimation procedure that produces ML VTS or joint noise models is also described. Alternatively, adaptive training with joint uncertainty transforms factors out the noise from the data. The uncertainty variance bias de-weights observations in the training data where the SNR is low. This property allows data with a wide SNR range to be used and produces canonical models that truly represent clean speech, whereas multistyle trained models must account for all acoustic variation associated with different noise conditions. This paper presents joint adaptive training including formula for estimating the transforms and canonical model parameters. Experiments are conducted on the resource management and broadcast news corpora.
Keywords :
decoding; noise; speech coding; speech recognition; EM-based noise estimation; SNR; acoustic model; adaptive training; automatic speech recognition; broadcast news corpora; canonical model parameters; joint uncertainty decoding; multistyle trained models; noise compensation techniques; noisy data; resource management; robust recognition; Acoustic noise; Automatic speech recognition; Decoding; Loudspeakers; Maximum likelihood estimation; Noise robustness; Signal to noise ratio; Testing; Training data; Uncertainty; Adaptive Training; Broadcast News; Noise Robust Speech Recognition; Uncertainty Decoding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.366931
Filename :
4218119
Link To Document :
بازگشت