Title :
Factor analysis based VTS discriminative adaptive training
Author :
Flego, F. ; Gales, M.J.F.
Author_Institution :
Eng. Dept., Cambridge Univ., Cambridge, UK
Abstract :
Vector Taylor Series (VTS) model based compensation is a powerful approach for noise robust speech recognition. An important extension to this approach is VTS adaptive training (VAT), which allows canonical models to be estimated on diverse noise-degraded training data. These canonical model can be estimated using EM-based approaches, allowing simple extensions to discriminative VAT (DVAT). However to ensure a diagonal corrupted speech covariance matrix the Jacobian (loading matrix) relating the noise and clean speech is diagonalised. In this work an approach for yielding optimal diagonal loading matrices based on minimising the expected KL-divergence between the diagonal loading matrix and “correct” distributions is proposed. The performance of DVAT using the standard and optimal diagonalisation was evaluated on both in-car collected data and the Aurora4 task.
Keywords :
Jacobian matrices; covariance matrices; noise; speech recognition; training; vectors; Aurora4 task; DVAT; EM-based approach; Jacobian; KL-divergence; canonical models; clean speech; diagonal corrupted speech covariance matrix; discriminative VAT; factor analysis based VTS discriminative adaptive training; in-car collected data; loading matrix; noise robust speech recognition; noise speech; optimal diagonal loading matrices; powerful approach; vector taylor series model based compensation; Adaptation models; Jacobian matrices; Load modeling; Loading; Noise; Speech; Training; Speech recognition; adaptive training; generative processes; noise robustness;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288960