Title :
Semi-tied covariance matrices
Author_Institution :
Dept. of Eng., Cambridge Univ., UK
Abstract :
A standard problem in many classification tasks is how to model feature vectors whose elements are highly correlated. If multi-variate Gaussian distributions are used to model the data then they must have full covariance matrices to accurately do so. This requires a large number of parameters per distribution which restricts the number of distributions that may be robustly estimated, particularly when high dimensional feature vectors are required. This paper describes an alternative to full covariance matrices in these situations. An approximate full covariance matrix is used. The covariance matrix is now split into two elements, one full and one diagonal, which may be tied at completely separate levels. Typically, the full elements are extensively tied, resulting in only a small increase in the number of parameters compared to the diagonal case. Thus dramatically increasing the number of distributions that may be robustly estimated. Simple iterative re-estimation formulae for all the parameters within the standard EM framework are presented. On a large vocabulary speech recognition task a 10% reduction in word error rate over a standard system was achieved
Keywords :
Gaussian distribution; approximation theory; correlation methods; covariance matrices; error statistics; feature extraction; hidden Markov models; iterative methods; maximum likelihood estimation; pattern recognition; speech recognition; EM framework; approximate full covariance matrix; classification tasks; continuous-density HMM; correlated data modelling; diagonal elements; feature vectors; full elements; iterative re-estimation formulae; large vocabulary speech recognition; maximum likelihood estimation; multi-variate Gaussian distributions; parameters; pattern recognition; semi-tied covariance matrices; standard system; word error rate reduction; Covariance matrix; Decorrelation; Error analysis; Gaussian distribution; Hidden Markov models; Maximum likelihood estimation; Pattern recognition; Robustness; Speech recognition; Vocabulary;
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7803-4428-6
DOI :
10.1109/ICASSP.1998.675350