• DocumentCode
    2409157
  • Title

    A neural network model for spoken word recognition

  • Author

    Tsai, Hsien-Leing ; Lee, Shie-Jue

  • Author_Institution
    Dept. of Electr. Eng., Nat. Sun Yat-Sen Univ., Kaohsiung, Taiwan
  • Volume
    5
  • fYear
    1997
  • fDate
    12-15 Oct 1997
  • Firstpage
    4029
  • Abstract
    In this paper, we propose a neural approach to the speaker-independent word recognition, based on the algorithms of dynamic time warping (DTW) and fuzzy ARTMAP. DTW has some drawbacks: (1) It is space and time consuming for a large set of training patterns. (2) It gives an equal importance to each frame of a pattern. To obtain better performance, the training patterns need to be prefiltered by human experts. Our approach attempts to address these shortcomings of DTW. We use a modified Fuzzy ARTMAP to be the framework of our approach. Our architecture is a four-layer sequential neural network. Our training algorithm and recalling algorithm are similar to fuzzy ARTMAP. However, our neural approach is a sequential algorithm. Experiments on the recognition of English alphabets have been performed. The recognition rates obtained by our approach and DTW are 87% and 80%, respectively, while memory space used in our approach is two or three times smaller than that used in DTW. Furthermore, prefiltering on training patterns is not required
  • Keywords
    ART neural nets; neural nets; speech recognition; time warp simulation; dynamic time warping; fuzzy ARTMAP; neural network model; speaker-independent; spoken word recognition; word recognition; Cepstrum; Heuristic algorithms; Humans; Linear predictive coding; Neural networks; Pattern recognition; Recurrent neural networks; Speech processing; Speech recognition; Wounds;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics, 1997. Computational Cybernetics and Simulation., 1997 IEEE International Conference on
  • Conference_Location
    Orlando, FL
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-4053-1
  • Type

    conf

  • DOI
    10.1109/ICSMC.1997.637289
  • Filename
    637289