Title :
Neural code-excited linear prediction for low power speech compression
Author :
Kamarsu, SriGouri ; Card, H.C.
Author_Institution :
Dept. of Electr. & Comput. Eng., Manitoba Univ., Winnipeg, Man., Canada
Abstract :
In this paper, we discuss the use of artificial neural learning methods for low bit-rate speech compression, potentially in non-stationary environments. Unsupervised learning algorithms are particularly well-suited for vector quantization (VQ) which is used in many speech compression applications. We discuss two unsupervised learning algorithms: frequency-sensitive competitive learning and Kohonen´s self-organizing maps which have both been investigated for learning the codebook vectors in an adaptive vector quantizer. In contrast with earlier work, we have employed these learning rules in VQ of the linear predictive coding (LPC) prediction residual. The performance of these unsupervised learning algorithms in speaker-dependent and speaker-independent speech compression are presented. Our results compare favourably with those of code-excited linear prediction (CELP) requiring reduced computational power with a tolerable reduction in speech quality. We also explore the effects of limited precision on classification and learning in competitive learning algorithms for low power VLSI implementations
Keywords :
VLSI; linear predictive coding; neural nets; pattern classification; self-organising feature maps; speech coding; unsupervised learning; vector quantisation; Kohonen´s self-organizing maps; adaptive vector quantizer; artificial neural learning methods; classification; codebook vectors; competitive learning algorithms; computational power; frequency-sensitive competitive learning; learning rules; linear predictive coding prediction residual; low power VLSI implementations; low power speech compression; neural code-excited linear prediction; nonstationary environments; speaker-dependent speech compression; speaker-independent speech compression; speech quality; unsupervised learning algorithms; vector quantization; Artificial neural networks; Frequency; Learning systems; Linear predictive coding; Signal processing algorithms; Speech coding; Speech processing; Telephony; Unsupervised learning; Vector quantization;
Conference_Titel :
WESCANEX 95. Communications, Power, and Computing. Conference Proceedings., IEEE
Conference_Location :
Winnipeg, Man.
Print_ISBN :
0-7803-2725-X
DOI :
10.1109/WESCAN.1995.494066