مرکز منطقه ای اطلاع رساني علوم و فناوري - Adaptive Training with Joint Uncertainty Decoding for Robust Recognition of Noisy Data

DocumentCode :

2701853

Title :

Adaptive Training with Joint Uncertainty Decoding for Robust Recognition of Noisy Data

Author :

Liao, Haitao ; Gales, Mark J.F.

Author_Institution :

Dept. of Eng., Cambridge Univ., UK

Volume :

fYear :

2007

fDate :

15-20 April 2007

Abstract :

Standard noise compensation techniques for automatic speech recognition assume a clean trained acoustic model. What is thought of as "clean" data, may still have a variety of speakers, different channels and varying noise conditions. Hence it may be more reasonable to consider such data multi-conditional for multistyle training. This paper shows that multistyle models benefit from VTS compensation or joint uncertainty decoding by reducing the mismatch between training and test. An EM-based noise estimation procedure that produces ML VTS or joint noise models is also described. Alternatively, adaptive training with joint uncertainty transforms factors out the noise from the data. The uncertainty variance bias de-weights observations in the training data where the SNR is low. This property allows data with a wide SNR range to be used and produces canonical models that truly represent clean speech, whereas multistyle trained models must account for all acoustic variation associated with different noise conditions. This paper presents joint adaptive training including formula for estimating the transforms and canonical model parameters. Experiments are conducted on the resource management and broadcast news corpora.

Keywords :

decoding; noise; speech coding; speech recognition; EM-based noise estimation; SNR; acoustic model; adaptive training; automatic speech recognition; broadcast news corpora; canonical model parameters; joint uncertainty decoding; multistyle trained models; noise compensation techniques; noisy data; resource management; robust recognition; Acoustic noise; Automatic speech recognition; Decoding; Loudspeakers; Maximum likelihood estimation; Noise robustness; Signal to noise ratio; Testing; Training data; Uncertainty; Adaptive Training; Broadcast News; Noise Robust Speech Recognition; Uncertainty Decoding;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on

Conference_Location :

Honolulu, HI

ISSN :

1520-6149

Print_ISBN :

1-4244-0727-3

Type :

conf

DOI :

10.1109/ICASSP.2007.366931

Filename :

4218119

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2701853