Title :
Extended VTS for noise-robust speech recognition
Author :
van Dalen, R.C. ; Gales, M.J.F.
Author_Institution :
Eng. Dept., Cambridge Univ., Cambridge
Abstract :
Model compensation is a standard way of improving speech recognisers´ robustness to noise. Currently popular schemes are based on vector Taylor series (VTS) compensation. They often use the continuous time approximation to compensate dynamic parameters. In this paper, the accuracy of dynamic parameter compensation is improved by representing the dynamic features as a linear transformation of a window of static features. A modified version of VTS compensation is applied to the distribution of the window of static features and, importantly, their correlations. These compensated distributions are then transformed to standard static and dynamic distributions. The proposed scheme outperformed the standard VTS scheme by about 10% relative.
Keywords :
acoustic noise; speech recognition; continuous time approximation; dynamic distribution; dynamic parameter compensation; extended VTS; model compensation; noise-robust speech recognition; static distribution; vector Taylor series compensation; Acoustic noise; Additive noise; Background noise; Distributed computing; Hidden Markov models; Noise robustness; Speech enhancement; Speech recognition; Taylor series; Vectors; Speech recognition; acoustic noise; robustness;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4960462