DocumentCode :
323478
Title :
Exploiting acoustic feature correlations by joint neural vector quantizer design in a discrete HMM system
Author :
Neukirchen, Christoph ; Willett, Daniel ; Eickeler, Stefan ; Müller, Stefan
Author_Institution :
Dept. of Comput. Sci, Gerhard-Mercator Univ., Duisburg, Germany
Volume :
1
fYear :
1998
fDate :
12-15 May 1998
Firstpage :
5
Abstract :
In previous work about hybrid speech recognizers with discrete HMMs we have shown that VQs, that are trained according to a maximum mutual information (MMI) criterion, are well suited for ML estimated Bayes classifiers. This is only valid for single VQ systems. In this paper we extend the theory to speech recognizers with multiple VQs. This leads to a joint training criterion for arbitrary multiple neural VQs that considers the inter VQ correlation during parameter estimation. The idea of a gradient based joint training method is derived. Experimental results indicate that inter VQ correlations can cause some degradation of recognition performance. The joint multiple VQ training decorrelates the quantizer labels and improves system performance. In addition the new training criterion allows for a less careful way of splitting up the feature vector into multiple streams that do not have to be statistically independent. In particular the use of highly correlated features in conjunction with the novel training criterion in the experiments leads to important gains in recognition performance for the speaker independent resource management database and gives the lowest error rate of 5.0% we ever obtained in this framework
Keywords :
correlation methods; hidden Markov models; learning (artificial intelligence); neural nets; parameter estimation; speech recognition; vector quantisation; ML estimated Bayes classifiers; VQ; acoustic feature correlations; discrete HMM system; feature vector; gradient based joint training method; highly correlated features; hybrid speech recognizers; inter VQ correlation; joint neural vector quantizer design; joint training criterion; maximum mutual information criterion; multiple VQ training; parameter estimation; speaker independent resource management database; speech recognition; Decorrelation; Degradation; Management training; Maximum likelihood estimation; Mutual information; Parameter estimation; Performance gain; Resource management; Speech recognition; System performance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
ISSN :
1520-6149
Print_ISBN :
0-7803-4428-6
Type :
conf
DOI :
10.1109/ICASSP.1998.674353
Filename :
674353
Link To Document :
بازگشت