DocumentCode
2634292
Title
Continuous speech recognition with the connectionist Viterbi training procedure: a summary of recent work
Author
Franzini, Michael ; Waibel, Alex ; Lee, Kai-Fu
Author_Institution
Telefonica Investigacion y Desarrollo, Madrid, Spain
fYear
1991
fDate
18-21 Nov 1991
Firstpage
1855
Abstract
Various means by which hidden Markov models (HMMs) and neural networks (NNs) can be combined for continuous speech recognition are studied. The authors describe the connectionist Viterbi training (CVT) procedure, discuss the factors most important to its design, and report its recognition performance. Several changes made to the system are reported, including: (1) the change from recurrent to non-recurrent NNs, (2) the change from Sphinx-style phone-based HMMs to word-based HMMs, (3) the addition of a corrective training procedure, and (4) the addition of an alternate model for every word. The CVT system incorporating these changes achieved 99.1% word accuracy and 98.0% string accuracy on the TI/NBS connected digits task
Keywords
Markov processes; neural nets; speech recognition; Sphinx-style phone-based HMMs; TI/NBS connected digits task; connectionist Viterbi training; continuous speech recognition; corrective training; hidden Markov models; neural networks; nonrecurrent neural nets; recognition performance; recurrent neural nets; string accuracy; word accuracy; word-based HMMs; Computer networks; Computer science; Distributed computing; Hidden Markov models; Maximum likelihood estimation; NIST; Neural networks; Rails; Speech recognition; Viterbi algorithm;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks, 1991. 1991 IEEE International Joint Conference on
Print_ISBN
0-7803-0227-3
Type
conf
DOI
10.1109/IJCNN.1991.170628
Filename
170628
Link To Document