• DocumentCode
    3331793
  • Title

    Abilities and limitations of a neural network model for spoken work recognition

  • Author

    Kurogi, Shuichi

  • Author_Institution
    Dept. of Control Eng., Kyushu Inst. of Technol., Kitakyushu, Japan
  • fYear
    1988
  • fDate
    24-27 July 1988
  • Firstpage
    205
  • Abstract
    A neural-network model for spoken-word recognition, utilizing information on loudness, is presented. The network is a discrete and simplified version of the model neural network for spatiotemporal pattern recognition presented by S. Kurogi (1987). It combines two characteristic functions: short-term storage of a kind in a synapse, and a maximum detection function. A nonnegative real number vector is regarded as the spectra pattern of a phoneme, the amplitude of a vector as loudness, and a string of vectors as a spoken work. Abilities and limitations of the network are analyzed mathematically. A time-warped word is identified as its original work if the loudness of its constituent phonemes is adjusted adequately, and successive words are segmented and recognized correctly if the loudness of the words is adjusted adequately. It is shown that the model has valid supports in physiological findings. The effectiveness of the model as an algorithm for speech recognition is also shown.<>
  • Keywords
    natural languages; neural nets; speech recognition; discrete network; loudness; maximum detection function; neural network model; nonnegative real number vector; phoneme; physiology; short-term storage; spatiotemporal pattern recognition; spectra pattern; spoken work recognition; synapse; time-warped word; vector amplitude; vector string; Natural languages; Neural networks; Speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 1988., IEEE International Conference on
  • Conference_Location
    San Diego, CA, USA
  • Type

    conf

  • DOI
    10.1109/ICNN.1988.23930
  • Filename
    23930