DocumentCode :
2703560
Title :
Latent Correlation Analysis of HMM Parameters for Speech Recognition
Author :
Zhijian Ou ; Jun Luo
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
Correlation between HMM parameters has been utilized for various rapid speaker adaptation, e.g. eigenvoice adaptation. The covariance matrix of the supervector which is a concatenation of all the Gaussian means in HMM, is clearly a good measure of such parameter correlation. In this paper, we propose to treat the supervector as a latent variable under HMM, and perform estimation of the hidden supervector´s covariance matrix directly from the acoustic frames using EM algorithm. In contrast to traditional methods which depend on using well-trained/adapted supervector samples, the proposed method is more theoretically sound and capable of dealing well with speaker-specific data sparseness. Moreover, the idea of conducting utterance-level correlation analysis, estimating utterance eigenvoices, and performing (unsupervised) utterance adaptation is explored. Experiments on the OGI Numbers database show that the proposed approach achieves better adaptation performance than the traditional methods, and the utterance-level correlation analysis is found to be useful.
Keywords :
correlation methods; covariance matrices; eigenvalues and eigenfunctions; expectation-maximisation algorithm; hidden Markov models; speaker recognition; EM algorithm; HMM parameters; acoustic frames; conducting utterance-level correlation analysis; covariance matrix; latent correlation analysis; rapid speaker adaptation; speaker-specific data sparseness; speech recognition; utterance eigenvoices; Acoustic measurements; Covariance matrix; Databases; Decoding; Hidden Markov models; Loudspeakers; Performance analysis; Principal component analysis; Speech analysis; Speech recognition; HMM; correlation analysis; eigenvoice;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.367022
Filename :
4218210
Link To Document :
بازگشت