Title :
A neural network based speech recognition system for isolated Cantonese syllables
Author :
Lee, Tan ; Ching, P.C.
Author_Institution :
Dept. of Electron. Eng., Chinese Univ. of Hong Kong, Shatin, Hong Kong
Abstract :
This paper describes a novel design of neural network based speech recognition system for isolated Cantonese syllables. Since Cantonese is a monosyllabic and tonal language, the recognition system consists of a tone recognizer and a base syllable recognizer. The tone recognizer adopts the architecture of a multi-layer perceptron in which each output neuron represents a particular tone. The syllable recognizer contains a large number of independently trained recurrent networks, each representing a designated Cantonese syllable. Such a modular structure provides greater flexibility to expand the system vocabulary progressively by adding new syllable models. To demonstrate the effectiveness of the proposed method, a speaker-dependent recognition system has been built with the vocabulary growing from 40 syllables to 200 syllables. In the case of 200 syllables, a top-1 recognition accuracy of 81.8% has been attained and the top-3 accuracy is 95.2%
Keywords :
multilayer perceptrons; natural languages; neural net architecture; recurrent neural nets; speech recognition; base syllable recognizer; independently trained recurrent networks; isolated Cantonese syllables; modular structure; monosyllabic language; multi-layer perceptron; neural network; speaker-dependent recognition system; speech recognition system; tonal language; tone recognizer; vocabulary expansion; Design engineering; Electronic mail; Hidden Markov models; Loudspeakers; Multilayer perceptrons; Natural languages; Neural networks; Neurons; Speech recognition; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.595491