Title :
Recurrent neural network speech predictor based on dynamical systems approach
Author :
Varoglu, E. ; Hacioglu, K.
Author_Institution :
Dept. of Electr. & Electron. Eng., Eastern Mediterranean Univ., Magosa, Turkey
fDate :
4/1/2000 12:00:00 AM
Abstract :
A nonlinear predictive model of speech, based on the method of time delay reconstruction, is presented and approximated using a fully connected recurrent neural network (RNN) followed by a linear combiner. This novel combination of the well established approaches for speech analysis and synthesis is compared with traditional techniques within a unified framework to illustrate the advantages of using an RNN. Extensive simulations are carried out to justify the expectations. Specifically, the network´s robustness to the selection of reconstruction parameters, the embedding time delay and dimension, is intuitively discussed and experimentally verified. In all cases, the proposed network was found to be a good solution for both prediction and synthesis
Keywords :
nonlinear dynamical systems; prediction theory; recurrent neural nets; speech enhancement; speech processing; speech synthesis; dimension; dynamical systems approach; embedding time delay; fully connected recurrent neural network; linear combiner; nonlinear predictive model; recurrent neural network speech predictor; robustness; speech analysis; speech synthesis; time delay reconstruction;
Journal_Title :
Vision, Image and Signal Processing, IEE Proceedings -
DOI :
10.1049/ip-vis:20000192