Title :
Comparing Jacobian adaptation with cepstral mean normalization and parallel model combination for noise robust speech recognition
Author :
Parssinen, Kimmo ; Salmela, Petri ; Harju, Mikko ; Kiss, Imre
Author_Institution :
Tampere University of Technology, Institute of Digital and Computer Systems, P.O.Box 553, FIN-33101, Finland
Abstract :
In this paper, two techniques are researched for Jacobian adaptation (JA) in the presence of additive noise. Since the original concept of JA was presented only for static cepstral coefficients, the performance of JA is researched when it is extended to cover also the delta cepstrum. However, this extension or the original concept can not provide accurate recognition performance when the mismatch between the training and recognition environments is out of the linear range of JA. Hence, this problem can be alleviated to some extent by dividing JA into two steps. At first, the adaptation is done e.g. from clean to the target environment having “high” SNR level. After that, the new JA matrixes are calculated and they are used in the second step to adapt the system to the lower target SNR leve1. Both of the above adaptation methods have been compared to cepstral mean normalization (CMN) and parallel model combination (PMC) in isolated word recognition task having a vocabulary of 200 English words. The best performance was achieved with PMC but JA showed comparable performance to CMN and outperformed it when JA was done in two steps from SNR of 25 dB to 5 dB. The system was tested with SpeechDat(II) database by adding noise recorded inside a car to the test set utterances at various SNR levels.
Keywords :
Adaptation model; Hidden Markov models; Jacobian matrices; Noise measurement; Signal to noise ratio; Speech;
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
Print_ISBN :
0-7803-7402-9
DOI :
10.1109/ICASSP.2002.5743687