DocumentCode
3342627
Title
Phoneme recognition using modified TDNN and a self-organizing clustering network
Author
Islam, Rafiqul ; Hiroshige, Makoto ; Miyanaga, Yoshikazu ; Tochinai, Koji
Author_Institution
Dept. of Inf. Eng., Hokkaido Univ., Sapporo, Japan
Volume
3
fYear
1995
fDate
30 Apr-3 May 1995
Firstpage
1816
Abstract
This paper presents a new approach to phoneme recognition system. A modified Time-delay Neural Network (TDNN) based on similarity vectors of clustering node information is developed for this purpose. The speech data have been analysed first by time varying ARMA-D model to have better response of its time varying characteristics. For the generation of the similarity vectors of the clustering nodes, Self-Organising Clustering process is used. To study the performance of this system, the speaker-independent recognition of the voiced explosive(stop) consonants /b,d,g/ in varying phonetic contexts is taken as the initial recognition task. This system gives a recognition rate for the stop consonants of about 84.3% for speaker independent speech data. For all these experiments, Japanese speech data is used supplied by ATR, Japan. The time taken for the training and recognition by the system can be considered reasonable
Keywords
autoregressive moving average processes; self-organising feature maps; speech recognition; Japanese speech; modified TDNN; phoneme recognition; self-organizing clustering network; similarity vectors; speaker independent speech; time varying ARMA-D model; time-delay neural network; training; voiced explosive stop consonants; Character recognition; Data analysis; Explosives; Frequency; Informatics; Neural networks; Sampling methods; Speech analysis; Speech processing; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Circuits and Systems, 1995. ISCAS '95., 1995 IEEE International Symposium on
Conference_Location
Seattle, WA
Print_ISBN
0-7803-2570-2
Type
conf
DOI
10.1109/ISCAS.1995.523767
Filename
523767
Link To Document