DocumentCode :
1910338
Title :
Multiple neural network topologies applied to keyword spotting
Author :
Morgan, David P. ; Scofield, Christopher L. ; Adcock, John E.
Author_Institution :
Lockheed Sanders Inc., Nashua, NH, USA
fYear :
1991
fDate :
14-17 Apr 1991
Firstpage :
313
Abstract :
The authors describe several experiments in which the use of artificial neural networks (ANNs) for the continuous speech speaker-independent keyword recognition problem was investigated. They discuss methodologies for reducing a primary keyword spotting system´s susceptibility to false alarms while maintaining recognition accuracy. The keyword spotter uses a conventional dynamic time warping algorithm to detect the start- and end-point of each potential keyword. The ANNs serve as a secondary processing stage for this segmented utterance. The ANNs attempt to classify this utterance by formulating the recognition problem as a pattern matching problem. In the hybrid network experiments, the utterance was processed into features derived from the activation at the hidden layer of a back-propagation trained network. Hybrid representations were grouped with two other feature representations in a multiple neural network system. A recognition accuracy of 78% on the Stonehenge X database was obtained while rejecting 72% of the false alarms which were detected by the primary keyword spotting system
Keywords :
neural nets; speech recognition; Fourier transforms; Stonehenge X database; artificial neural networks; back-propagation trained network; continuous speech speaker-independent keyword recognition; dynamic time warping algorithm; end-point; false alarms; hidden layer; hybrid network; multiple neural network system; pattern matching problem; primary keyword spotting; recognition accuracy; segmented utterance; speech recognition; start point; Artificial neural networks; Heuristic algorithms; Multi-layer neural network; Network topology; Neural networks; Pattern recognition; Signal processing; Speech processing; Speech recognition; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
ISSN :
1520-6149
Print_ISBN :
0-7803-0003-3
Type :
conf
DOI :
10.1109/ICASSP.1991.150339
Filename :
150339
Link To Document :
بازگشت