Title :
Non-linear short-term prediction in speech coding
Author :
Thyssen, Jes ; Nielsen, Henrik ; Hansen, Steffen Duus
Author_Institution :
Tele Danmark Res., Horsholm, Denmark
Abstract :
Addresses the question of how to extract the nonlinearities in speech with the prime purpose of facilitating coding of the residual signal in residual excited coders. The short-term prediction of speech in speech coders is extensively based on linear models, e.g. the linear predictive coding technique (LPC), which is one of the most basic elements in modern speech coders. This technique does not allow extraction of nonlinear dependencies. If nonlinearities are absent from speech the technique is sufficient, but if the speech contains nonlinearities the technique is inadequate. The authors give evidence for nonlinearities in speech and propose nonlinear short-term predictors that can substitute the LPC technique. The technique, called nonlinear predictive coding, is shown to be superior to the LPC technique. Two different nonlinear predictors are presented. The first is based on a second-order Volterra filter, and the second is based on a time delay neural network. The latter is shown to be the more suitable for speech coding applications
Keywords :
neural nets; nonlinear filters; prediction theory; speech coding; nonlinear predictive coding; nonlinear short-term prediction; nonlinearities; residual excited coders; residual signal; second-order Volterra filter; speech coding; time delay neural network; Data mining; Delay effects; Filters; Information analysis; Kernel; Linear predictive coding; Neural networks; Predictive coding; Predictive models; Speech coding;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
Print_ISBN :
0-7803-1775-0
DOI :
10.1109/ICASSP.1994.389324