DocumentCode
2409157
Title
A neural network model for spoken word recognition
Author
Tsai, Hsien-Leing ; Lee, Shie-Jue
Author_Institution
Dept. of Electr. Eng., Nat. Sun Yat-Sen Univ., Kaohsiung, Taiwan
Volume
5
fYear
1997
fDate
12-15 Oct 1997
Firstpage
4029
Abstract
In this paper, we propose a neural approach to the speaker-independent word recognition, based on the algorithms of dynamic time warping (DTW) and fuzzy ARTMAP. DTW has some drawbacks: (1) It is space and time consuming for a large set of training patterns. (2) It gives an equal importance to each frame of a pattern. To obtain better performance, the training patterns need to be prefiltered by human experts. Our approach attempts to address these shortcomings of DTW. We use a modified Fuzzy ARTMAP to be the framework of our approach. Our architecture is a four-layer sequential neural network. Our training algorithm and recalling algorithm are similar to fuzzy ARTMAP. However, our neural approach is a sequential algorithm. Experiments on the recognition of English alphabets have been performed. The recognition rates obtained by our approach and DTW are 87% and 80%, respectively, while memory space used in our approach is two or three times smaller than that used in DTW. Furthermore, prefiltering on training patterns is not required
Keywords
ART neural nets; neural nets; speech recognition; time warp simulation; dynamic time warping; fuzzy ARTMAP; neural network model; speaker-independent; spoken word recognition; word recognition; Cepstrum; Heuristic algorithms; Humans; Linear predictive coding; Neural networks; Pattern recognition; Recurrent neural networks; Speech processing; Speech recognition; Wounds;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man, and Cybernetics, 1997. Computational Cybernetics and Simulation., 1997 IEEE International Conference on
Conference_Location
Orlando, FL
ISSN
1062-922X
Print_ISBN
0-7803-4053-1
Type
conf
DOI
10.1109/ICSMC.1997.637289
Filename
637289
Link To Document