مرکز منطقه ای اطلاع رساني علوم و فناوري - Cantonese syllable recognition using neural networks

DocumentCode :

1522821

Title :

Cantonese syllable recognition using neural networks

Author :

Tan Lee ; Ching, P.C.

Author_Institution :

Dept. of Electron. Eng., Chinese Univ. of Hong Kong, Shatin

Volume :

Issue :

fYear :

1999

fDate :

7/1/1999 12:00:00 AM

Firstpage :

466

Lastpage :

472

Abstract :

This work describes a novel neural network based speech recognition system for isolated Cantonese syllables. Since Cantonese is a monosyllabic and tonal language, the recognition system is composed of two major components, namely the tone recognizer and the base syllable recognizer. The tone recognizer adopts the architecture of multilayer perceptron in which each output neuron represents a particular tone. The base syllable recognizer consists of a large number of independently trained recurrent networks, each representing a designated Cantonese syllable. An integrated recognition algorithm is developed to give the ultimate recognition results based on N-best outputs of the two subrecognizers. To demonstrate the effectiveness of the proposed methods, a speaker-dependent recognition system has been built with the vocabulary expanding progressively from 10 syllables to 200 syllables. In the case of 200 syllables, a top-1 recognition accuracy of 81.8% has been attained whilst the top-3 accuracy is 95.28

Keywords :

multilayer perceptrons; natural languages; neural net architecture; recurrent neural nets; speech recognition; Cantonese syllable recognition; base syllable recognizer; independently trained recurrent networks; integrated recognition algorithm; isolated Cantonese syllables; monosyllabic language; multilayer perceptron architecture; neural networks; output neuron; speaker-dependent recognition system; speech recognition system; tonal language; tone recognizer; top-1 recognition accuracy; top-3 accuracy; vocabulary; Hidden Markov models; Humans; Loudspeakers; Multilayer perceptrons; Natural languages; Neural networks; Neurons; Pattern recognition; Speech recognition; Vocabulary;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/89.771317

Filename :

771317

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1522821