DocumentCode :
394367
Title :
Maximum likelihood training of subspaces for inverse covariance modeling
Author :
Visweswariah, K. ; Olsen, P. ; Gopinath, R. ; Axelrod, S.
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
Volume :
1
fYear :
2003
fDate :
6-10 April 2003
Abstract :
Speech recognition systems typically use mixtures of diagonal Gaussians to model the acoustics. Using Gaussians with a more general covariance structure can give improved performance; EM-LLT and SPAM models give improvements by restricting the inverse covariance to a linear/affine subspace spanned by rank one and full rank matrices respectively. We consider training these subspaces to maximize likelihood. For EMLLT ML training the subspace results in significant gains over the scheme proposed by Olsen and Gopinath (see Proceedings of ICASSP, 2002). For SPAM ML training of the subspace slightly improves performance over the method reported by Axelrod, Gopinath and Olsen (see Proceedings of ICSLP, 2002). For the same subspace size an EMLLT model is more efficient computationally than a SPAM model, while the SPAM model is more accurate. This paper proposes a hybrid method of structuring the inverse covariances that both has good accuracy and is computationally efficient.
Keywords :
Gaussian processes; acoustic signal processing; covariance analysis; hidden Markov models; inverse problems; matrix algebra; maximum likelihood estimation; optimisation; speech recognition; EMLLT ML training; EMLLT model; ML parameter estimation; SPAM ML training; SPAM model; acoustics modelling; computationally efficient method; diagonal Gaussians; full rank matrix; hybrid method; inverse covariance; inverse covariance modeling; inverse covariances; linear/affine subspace; maximum likelihood linear transform; maximum likelihood training; numerical optimization; rank one matrix; speech recognition systems; subspace size; subspaces training; Computational efficiency; Costs; Covariance matrix; Gaussian processes; Hidden Markov models; Inverse problems; Maximum likelihood estimation; Speech recognition; Symmetric matrices; Unsolicited electronic mail;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1198914
Filename :
1198914
Link To Document :
بازگشت