DocumentCode :
3164918
Title :
Factor analysis based VTS discriminative adaptive training
Author :
Flego, F. ; Gales, M.J.F.
Author_Institution :
Eng. Dept., Cambridge Univ., Cambridge, UK
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4669
Lastpage :
4672
Abstract :
Vector Taylor Series (VTS) model based compensation is a powerful approach for noise robust speech recognition. An important extension to this approach is VTS adaptive training (VAT), which allows canonical models to be estimated on diverse noise-degraded training data. These canonical model can be estimated using EM-based approaches, allowing simple extensions to discriminative VAT (DVAT). However to ensure a diagonal corrupted speech covariance matrix the Jacobian (loading matrix) relating the noise and clean speech is diagonalised. In this work an approach for yielding optimal diagonal loading matrices based on minimising the expected KL-divergence between the diagonal loading matrix and “correct” distributions is proposed. The performance of DVAT using the standard and optimal diagonalisation was evaluated on both in-car collected data and the Aurora4 task.
Keywords :
Jacobian matrices; covariance matrices; noise; speech recognition; training; vectors; Aurora4 task; DVAT; EM-based approach; Jacobian; KL-divergence; canonical models; clean speech; diagonal corrupted speech covariance matrix; discriminative VAT; factor analysis based VTS discriminative adaptive training; in-car collected data; loading matrix; noise robust speech recognition; noise speech; optimal diagonal loading matrices; powerful approach; vector taylor series model based compensation; Adaptation models; Jacobian matrices; Load modeling; Loading; Noise; Speech; Training; Speech recognition; adaptive training; generative processes; noise robustness;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288960
Filename :
6288960
Link To Document :
بازگشت