Title :
Phoneme recognition by phoneme filter neural networks
Author :
Nakamura, Masami ; Tamura, Shin´ichi ; Sagayama, Shigeki
Author_Institution :
Sumitomo Metal Ind., Ltd., Japan
Abstract :
A phoneme filter neural network (PFN) approach to vowel recognition is described. The PFN is a multilayer neural network with fewer hidden units than input units prepared for each of the phoneme categories. Each network is trained as identity mapping by speech data belonging to one phoneme category. In the recognition process, the similarity between the input data and output data is computed for each network. The results of an experiment involving the Japanese vowel recognition task showed that the PFN recognition rates for the top two or more choices are higher than those of a conventional three-layer neural network and the PFN outputs represented candidate likelihoods. It was also confirmed that the PFN has a mapping ability and recognition performance superior to those of the linear K-L transformation method because of the nonlinearity of the PFN
Keywords :
neural nets; speech recognition; Japanese vowel recognition; candidate likelihoods; hidden units; identity mapping; input units; multilayer neural network; output data; phoneme filter neural networks; speech data; speech recognition; Computer industry; Computer networks; Filters; High performance computing; Multi-layer neural network; Neural networks; Pattern recognition; Software engineering; Speech recognition; Telephony;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
0-7803-0003-3
DOI :
10.1109/ICASSP.1991.150284