DocumentCode :
1020892
Title :
Maximum mutual information neural networks for hybrid connectionist-HMM speech recognition systems
Author :
Rigoll, Gerhard
Author_Institution :
Dept. of Comput. Sci., Duisburg Univ., Germany
Volume :
2
Issue :
1
fYear :
1994
Firstpage :
175
Lastpage :
184
Abstract :
This paper proposes a novel approach for a hybrid connectionist-hidden Markov model (HMM) speech recognition system based on the use of a neural network as vector quantizer. The neural network is trained with a new learning algorithm offering the following innovations. (1) It is an unsupervised learning algorithm for perceptron-like neural networks that are usually trained in the supervised mode. (2) Information theory principles are used as learning criteria, making the network especially suitable for combination with a HMM-based speech recognition system. (3) The neural network is not trained using the standard error-backpropagation algorithm but using instead a newly developed self-organizing learning approach. The use of the hybrid system with the neural vector quantizer results in a 25% error reduction compared with the same HMM system using a standard k-means vector quantizer. The training algorithm can be further refined by using a combination of unsupervised and supervised learning algorithms. Finally, it is demonstrated how the new learning approach can be applied to multiple-feature hybrid speech recognition systems, using a joint information theory-based optimization procedure for the multiple neural codebooks, resulting in a 30% error reduction.
Keywords :
feedforward neural nets; hidden Markov models; information theory; optimisation; self-organising feature maps; speech recognition; unsupervised learning; vector quantisation; error reduction; hidden Markov model; hybrid connectionist-HMM; information theory; maximum mutual information neural networks; multilayer neural networks; multiple neural codebooks; multiple-feature speech recognition; optimization; perceptron-like neural networks; self-organizing learning method; speech recognition systems; supervised learning algorithms; unsupervised learning algorithm; vector quantizer; Hidden Markov models; Information theory; Mutual information; Neural networks; Speech recognition; Standards development; Supervised learning; Technological innovation; Unsupervised learning; Vocabulary;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.260360
Filename :
260360
Link To Document :
بازگشت