Title :
Back-propagation training of a neural network for word spotting
Author :
English, Thomas M. ; Boggess, Lois C.
Author_Institution :
Dept. of Comput. Sci., Texas Tech Univ., Lubbock, TX, USA
Abstract :
An approach to backpropagation training of a neural network for word spotting is described. It is assumed that the network has one output unit for each keyword to be detected, and that features of the speech signal are input at fixed intervals. The goal of training is to obtain a network that emits a detection pulse at the appropriate output unit when the utterance of a keyword is completed. The authors have developed a successful backpropagation strategy which incorporates `don´t care´ targets for outputs expected to be in the process of rising or falling, propagation of errors for only a subset of those times at which no detection pulse is expected, iterative refinement of the temporal placement of target outputs, and use of a super-squared error criterion. In an application of the strategy to speaker-dependent, continuous digit recognition (i.e., digit spotting with no utterances of nondigits), word-error rates of 0% and 2.5% were achieved for the training and test utterances, respectively
Keywords :
backpropagation; feedforward neural nets; learning (artificial intelligence); speech recognition; backpropagation training; continuous digit recognition; detection pulse; errors propagation; keyword; neural network; word spotting; word-error rates; Airplanes; Computer science; Feedforward systems; Information retrieval; Neural networks; Signal generators; Signal processing; Speech; Testing; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7803-0532-9
DOI :
10.1109/ICASSP.1992.226046