DocumentCode :
3331793
Title :
Abilities and limitations of a neural network model for spoken work recognition
Author :
Kurogi, Shuichi
Author_Institution :
Dept. of Control Eng., Kyushu Inst. of Technol., Kitakyushu, Japan
fYear :
1988
fDate :
24-27 July 1988
Firstpage :
205
Abstract :
A neural-network model for spoken-word recognition, utilizing information on loudness, is presented. The network is a discrete and simplified version of the model neural network for spatiotemporal pattern recognition presented by S. Kurogi (1987). It combines two characteristic functions: short-term storage of a kind in a synapse, and a maximum detection function. A nonnegative real number vector is regarded as the spectra pattern of a phoneme, the amplitude of a vector as loudness, and a string of vectors as a spoken work. Abilities and limitations of the network are analyzed mathematically. A time-warped word is identified as its original work if the loudness of its constituent phonemes is adjusted adequately, and successive words are segmented and recognized correctly if the loudness of the words is adjusted adequately. It is shown that the model has valid supports in physiological findings. The effectiveness of the model as an algorithm for speech recognition is also shown.<>
Keywords :
natural languages; neural nets; speech recognition; discrete network; loudness; maximum detection function; neural network model; nonnegative real number vector; phoneme; physiology; short-term storage; spatiotemporal pattern recognition; spectra pattern; spoken work recognition; synapse; time-warped word; vector amplitude; vector string; Natural languages; Neural networks; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 1988., IEEE International Conference on
Conference_Location :
San Diego, CA, USA
Type :
conf
DOI :
10.1109/ICNN.1988.23930
Filename :
23930
Link To Document :
بازگشت