Title :
Joint uncertainty decoding with the second order approximation for noise robust speech recognition
Author :
Xu, Haitian ; Chin, K.K.
Author_Institution :
Speech Technol. Group, Cambridge Res. Lab., Cambridge
Abstract :
Joint uncertainty decoding has recently achieved promising results by integrating the front-end uncertainty into the back-end in a mathematically consistent framework. In this paper, joint uncertainty decoding is compared with the widely used vector Taylor series (VTS). We show that the two methods are identical except that joint uncertainty decoding applies the Taylor expansion on each regression class whereas VTS applies it to each HMM mixture. The relatively rougher expansion points used in joint uncertainty decoding make it computationally cheaper than VTS but inevitably worse on recognition accuracy. To overcome this drawback, this paper proposes an improved joint uncertainty decoding algorithm which employs second-order Taylor expansion on each regression class in order to reduce the expansion errors. Special considerations are further given to limit the overall computational cost by adopting different number of regression classes for different orders in the Taylor expansion. Experiments on the Aurora 2 database show that the proposed method is able to beat VTS on recognition accuracy and computational cost with relative improvement up to 6% and 60%, respectively.
Keywords :
approximation theory; decoding; hidden Markov models; speech recognition; Aurora 2 database; front-end uncertainty; hidden Markov models; joint uncertainty decoding; noise robustness; second-order Taylor expansion; speech recognition; vector Taylor series; Acoustic noise; Automatic speech recognition; Computational efficiency; Decoding; Hidden Markov models; Noise robustness; Speech recognition; Taylor series; Uncertainty; Working environment noise; VTS; noise robustness; speech recognition; uncertainty decoding;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4960465