Title :
Time-warping neural network for phoneme recognition
Author_Institution :
NTT Human Interface Lab., Tokyo, Japan
Abstract :
The author investigates a feedforward neural network that can accept phonemes with an arbitrary duration coping with nonlinear time warping. The time-warping neural network is characterized by the time-warping functions embedded between the input layer and the first hidden layer in the network. The input layer accesses three different time points. The accessing points are determined by the time-warping functions. The input spectrum sequence itself is not warped but the accessing-point sequence is warped. The advantage of this network architecture is that the input layer can access the original spectrum sequence. The proposed network demonstrated higher phoneme recognition accuracy than the baseline recognizer based on conventional feedforward neural networks. The recognition accuracy was even higher than that achieved with discrete hidden Markov models
Keywords :
neural nets; speech recognition; accessing points; feedforward neural network; network architecture; nonlinear time warping; phoneme recognition; spectrum sequence; speech recognition; Dynamic programming; Feedforward neural networks; Feedforward systems; Heuristic algorithms; Hidden Markov models; Humans; Laboratories; Neural networks; Robustness; Speech recognition;
Conference_Titel :
Neural Networks, 1991. 1991 IEEE International Joint Conference on
Print_ISBN :
0-7803-0227-3
DOI :
10.1109/IJCNN.1991.170701