Speaker-independent recognition of connected utterances using recurrent and non-recurrent neural networks

Author

Franzini, Michael A. ; Witbrock, Michael J. ; Lee, Kai-Fu

Author_Institution

Dept. of Comput. Sci., Carnegie-Mellon Univ., Pittsburgh, PA, USA

fYear

1989

fDate

0-0 1989

Firstpage

1

Abstract

Connectionist learning procedures are applied to the task of speaker-independent continuous speech recognition, creating a system which has achieved a recognition rate of 97% correct in preliminary tests on the Texas Instruments/National Bureau of Standards Connected Digits Database. Two versions of the system were implemented, both of which used four-layer backpropagation networks. One used a static (nonrecurrent) network with a history mechanism, in which the input weights were slaved together, as they are in time-delay neural networks (TDNNs), and the other used a recurrent connection structure similar to that proposed by J.L. Elman (Tech. Rep., Univ. of California, San Diego, April 1988). The final recognition accuracies produced by the two approaches were not significantly different. The networks generated and refined hypotheses about the identity of utterances over successive intervals. The hypotheses generated by the networks were used as input to a Markov-chain-based Viterbi recognizer which produced a final identification of the entire utterance.<>

Keywords

neural nets; speech recognition; backpropagation; connectionist learning; neural networks; recurrent connection structure; speaker-independent continuous speech recognition; utterances; Neural networks; Speech recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Neural Networks, 1989. IJCNN., International Joint Conference on

Conference_Location

Washington, DC, USA

Type

conf

DOI

10.1109/IJCNN.1989.118670

Filename

118670