DocumentCode :
1442154
Title :
Modular recurrent neural networks for Mandarin syllable recognition
Author :
Chen, Sin-Horng ; Liao, Yuan-Fu
Author_Institution :
Dept. of Commun. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
Volume :
9
Issue :
6
fYear :
1998
fDate :
11/1/1998 12:00:00 AM
Firstpage :
1430
Lastpage :
1441
Abstract :
A new modular recurrent neural network (MRNN)-based speech-recognition method that can recognize the entire vocabulary of 1280 highly confusable Mandarin syllables is proposed in this paper. The basic idea is to first split the complicated task, in both feature and temporal domains, into several much simpler subtasks involving subsyllable and tone discrimination, and then to use two weighting RNN´s to generate several dynamic weighting functions to integrate the subsolutions into a complete solution. The novelty of the proposed method lies mainly in the use of appropriate a priori linguistic knowledge of simple initial-final structures of Mandarin syllables in the architecture design of the MRNN. The resulting MRNN is therefore effective and efficient in discriminating among highly confusable Mandarin syllables. Thus both the time-alignment and scaling problems of the ANN-based approach for large-vocabulary speech-recognition can be addressed. Experimental results show that the proposed method and its extensions, the reverse-time MRNN (Rev-MRNN) and bidirection MRNN (Bi-MRNN), all outperform an advanced HMM method trained with the MCE/GPD algorithm in both recognition-rate and system complexity
Keywords :
recurrent neural nets; speech recognition; Bi-MRNN; MRNN architecture design; MRNN-based speech recognition; Mandarin syllable recognition; Rev-MRNN; bidirection MRNN; dynamic weighting functions; feature domain; initial-final structures; large-vocabulary speech recognition; linguistic knowledge; modular recurrent neural network; recognition-rate complexity; reverse-time MRNN; scaling problem; subsyllable discrimination; system complexity; temporal domain; time-alignment; tone discrimination; Artificial neural networks; Councils; Error analysis; Hidden Markov models; Mutual information; Neural networks; Pattern recognition; Recurrent neural networks; Speech recognition; Vocabulary;
fLanguage :
English
Journal_Title :
Neural Networks, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9227
Type :
jour
DOI :
10.1109/72.728393
Filename :
728393
Link To Document :
بازگشت