Title :
A neural network model for spoken word recognition
Author :
Tsai, Hsien-Leing ; Lee, Shie-Jue
Author_Institution :
Dept. of Electr. Eng., Nat. Sun Yat-Sen Univ., Kaohsiung, Taiwan
Abstract :
In this paper, we propose a neural approach to the speaker-independent word recognition, based on the algorithms of dynamic time warping (DTW) and fuzzy ARTMAP. DTW has some drawbacks: (1) It is space and time consuming for a large set of training patterns. (2) It gives an equal importance to each frame of a pattern. To obtain better performance, the training patterns need to be prefiltered by human experts. Our approach attempts to address these shortcomings of DTW. We use a modified Fuzzy ARTMAP to be the framework of our approach. Our architecture is a four-layer sequential neural network. Our training algorithm and recalling algorithm are similar to fuzzy ARTMAP. However, our neural approach is a sequential algorithm. Experiments on the recognition of English alphabets have been performed. The recognition rates obtained by our approach and DTW are 87% and 80%, respectively, while memory space used in our approach is two or three times smaller than that used in DTW. Furthermore, prefiltering on training patterns is not required
Keywords :
ART neural nets; neural nets; speech recognition; time warp simulation; dynamic time warping; fuzzy ARTMAP; neural network model; speaker-independent; spoken word recognition; word recognition; Cepstrum; Heuristic algorithms; Humans; Linear predictive coding; Neural networks; Pattern recognition; Recurrent neural networks; Speech processing; Speech recognition; Wounds;
Conference_Titel :
Systems, Man, and Cybernetics, 1997. Computational Cybernetics and Simulation., 1997 IEEE International Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
0-7803-4053-1
DOI :
10.1109/ICSMC.1997.637289