DocumentCode :
959846
Title :
KLT-based adaptive classified VQ of the speech signal
Author :
Kim, Moo Young ; Kleijn, W. Bastiaan
Author_Institution :
Dept. of Signals, Sensors & Syst., R. Inst. of Technol., Stockholm, Sweden
Volume :
12
Issue :
3
fYear :
2004
fDate :
5/1/2004 12:00:00 AM
Firstpage :
277
Lastpage :
289
Abstract :
Compared to scalar quantization (SQ), vector quantization (VQ) has memory, space-filling, and shape advantages. If the signal statistics are known, direct vector quantization (DVQ) according to these statistics provides the highest coding efficiency, but requires unmanageable storage requirements if the statistics are time varying. In code-excited linear predictive (CELP) coding, a single "compromise" codebook is trained in the excitation-domain and the space-filling and shape advantages of VQ are utilized in a nonoptimal, average sense. In this paper, we propose Karhunen-Loe`ve transform (KLT)-based adaptive classified VQ (CVQ), where the space-filling advantage can be utilized since the Voronoi-region shape is not affected by the KLT. The memory and shape advantages can be also used, since each codebook is designed based on a narrow class of KLT-domain statistics. We further improve basic KLT-CVQ with companding. The companding utilizes the shape advantage of VQ more efficiently. Our experiments show that KLT-CVQ provides a higher SNR than basic CELP coding, and has a computational complexity similar to DVQ and much lower than CELP. With companding, even single-class KLT-CVQ outperforms CELP, both in terms of SNR and codebook search complexity.
Keywords :
Karhunen-Loeve transforms; adaptive codes; computational complexity; linear predictive coding; speech coding; transform coding; vector quantisation; KLT-based adaptive classified VQ; Karhunen-Loeve transform; Voronei-region shape; code-excited linear predictive coding; compromise codebook; computational complexity; convergence; direct vector quantization; eigenvalue companding; memory requirement; scalar quantization; signal statistics; space-filling; speech coding; speech signal; Discrete transforms; Distortion; Histograms; Karhunen-Loeve transforms; Shape; Signal processing; Speech analysis; Speech coding; Statistics; Vector quantization;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2004.825661
Filename :
1288154
Link To Document :
بازگشت