DocumentCode
1184217
Title
Fully vector-quantized neural network-based code-excited nonlinear predictive speech coding
Author
Wu, Lizhong ; Niranjan, Mahesan ; Fallside, Frank
Author_Institution
Dept. of Comput. Sci. & Eng., Oregon Graduate Inst., Portland, OR, USA
Volume
2
Issue
4
fYear
1994
fDate
10/1/1994 12:00:00 AM
Firstpage
482
Lastpage
489
Abstract
Recent studies have shown that nonlinear predictors can achieve about 2-3 dB improvement in speech prediction over conventional linear predictors. In this paper, we exploit the advantage of the nonlinear prediction capability of neural networks and apply it to the design of improved predictive speech coders. Our studies concentrate on the following three aspects: (a) the development of short-term (formant) and long-term (pitch) nonlinear predictive vector quantizers (b) the analysis of the output variance of the nonlinear predictive filter with respect to the input disturbance (c) the design of nonlinear predictive speech coders. The above studies have resulted in a fully vector-quantized, code-excited, nonlinear predictive speech coder. Performance evaluations and comparisons with linear predictive speech coding are presented. These tests have shown the applicability of nonlinear prediction in speech coding and the improvement in coding performance
Keywords
linear predictive coding; neural nets; speech coding; vector quantisation; code-excited nonlinear predictive speech coding; coding performance; formant VQ; input disturbance; long-term VQ; neural networks; nonlinear predictive filter; output variance; performance evaluations; pitch VQ; predictive speech coders; short-term VQ; speech prediction; vector quantizers; Analysis of variance; Equations; Filters; Neural networks; Predictive models; Recurrent neural networks; Speech analysis; Speech coding; Testing; Vectors;
fLanguage
English
Journal_Title
Speech and Audio Processing, IEEE Transactions on
Publisher
ieee
ISSN
1063-6676
Type
jour
DOI
10.1109/89.326608
Filename
326608
Link To Document