A large phonemic time-delay neural network technique for all Mandarin consonants recognition

Author

Poo, Gee-Swee ; Ou, Yongzhen

Author_Institution

Dept. of Inf. Syst. & Comput. Sci., Nat. Univ. of Singapore, Singapore

fYear

1994

fDate

22-26 Aug 1994

Firstpage

521

Abstract

A large phonemic time-delay neural network (LPTDNN) technique has been developed to recognize all Mandarin consonants of the entire set of Chinese syllables. By modular construction, six component networks, viz. (b,d,g,ot), (l,m,n,r,ot), (j,z,zh,ot), (c,q,ch,ot), (f,s,x,h,sh,ot) and (p,t,k,ot), are formed to represent all 21 Mandarin consonants (where ot=others). The technique was thoroughly tested with 8 sets of 1157 isolated Hanyu Pinyin syllables, with 6 sets used for training and 2 sets used for testing. The overall result shows a high recognition performance with 99.2% for the inside test (i.e. testing on the datasets which were used entirely in training) and 92.7% for the outside test (i.e. testing on the remaining datasets which were never used in training)

Keywords

delays; feedforward neural nets; speech recognition; Chinese syllables; Mandarin consonants; component networks; inside test; isolated Hanyu Pinyin syllables; large phonemic time-delay neural network; modular construction; outside test; recognition performance; speech recognition; testing; training; Artificial neural networks; Computer science; Hidden Markov models; Information systems; Modular construction; Neural networks; Speech recognition; Testing; Vector quantization; Vocabulary;

fLanguage

English

Publisher

ieee

Conference_Titel

TENCON '94. IEEE Region 10's Ninth Annual International Conference. Theme: Frontiers of Computer Technology. Proceedings of 1994

Print_ISBN

0-7803-1862-5

Type

conf

DOI

10.1109/TENCON.1994.369246

Filename

369246