DocumentCode :
2646382
Title :
Time-warping neural network for phoneme recognition
Author :
Aikawa, Kiyoaki
Author_Institution :
NTT Human Interface Lab., Tokyo, Japan
fYear :
1991
fDate :
18-21 Nov 1991
Firstpage :
2122
Abstract :
The author investigates a feedforward neural network that can accept phonemes with an arbitrary duration coping with nonlinear time warping. The time-warping neural network is characterized by the time-warping functions embedded between the input layer and the first hidden layer in the network. The input layer accesses three different time points. The accessing points are determined by the time-warping functions. The input spectrum sequence itself is not warped but the accessing-point sequence is warped. The advantage of this network architecture is that the input layer can access the original spectrum sequence. The proposed network demonstrated higher phoneme recognition accuracy than the baseline recognizer based on conventional feedforward neural networks. The recognition accuracy was even higher than that achieved with discrete hidden Markov models
Keywords :
neural nets; speech recognition; accessing points; feedforward neural network; network architecture; nonlinear time warping; phoneme recognition; spectrum sequence; speech recognition; Dynamic programming; Feedforward neural networks; Feedforward systems; Heuristic algorithms; Hidden Markov models; Humans; Laboratories; Neural networks; Robustness; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks, 1991. 1991 IEEE International Joint Conference on
Print_ISBN :
0-7803-0227-3
Type :
conf
DOI :
10.1109/IJCNN.1991.170701
Filename :
170701
Link To Document :
بازگشت