مرکز منطقه ای اطلاع رساني علوم و فناوري - KLT-based adaptive classified VQ of the speech signal

DocumentCode :

959846

Title :

KLT-based adaptive classified VQ of the speech signal

Author :

Kim, Moo Young ; Kleijn, W. Bastiaan

Author_Institution :

Dept. of Signals, Sensors & Syst., R. Inst. of Technol., Stockholm, Sweden

Volume :

Issue :

fYear :

2004

fDate :

5/1/2004 12:00:00 AM

Firstpage :

277

Lastpage :

289

Abstract :

Compared to scalar quantization (SQ), vector quantization (VQ) has memory, space-filling, and shape advantages. If the signal statistics are known, direct vector quantization (DVQ) according to these statistics provides the highest coding efficiency, but requires unmanageable storage requirements if the statistics are time varying. In code-excited linear predictive (CELP) coding, a single "compromise" codebook is trained in the excitation-domain and the space-filling and shape advantages of VQ are utilized in a nonoptimal, average sense. In this paper, we propose Karhunen-Loe`ve transform (KLT)-based adaptive classified VQ (CVQ), where the space-filling advantage can be utilized since the Voronoi-region shape is not affected by the KLT. The memory and shape advantages can be also used, since each codebook is designed based on a narrow class of KLT-domain statistics. We further improve basic KLT-CVQ with companding. The companding utilizes the shape advantage of VQ more efficiently. Our experiments show that KLT-CVQ provides a higher SNR than basic CELP coding, and has a computational complexity similar to DVQ and much lower than CELP. With companding, even single-class KLT-CVQ outperforms CELP, both in terms of SNR and codebook search complexity.

Keywords :

Karhunen-Loeve transforms; adaptive codes; computational complexity; linear predictive coding; speech coding; transform coding; vector quantisation; KLT-based adaptive classified VQ; Karhunen-Loeve transform; Voronei-region shape; code-excited linear predictive coding; compromise codebook; computational complexity; convergence; direct vector quantization; eigenvalue companding; memory requirement; scalar quantization; signal statistics; space-filling; speech coding; speech signal; Discrete transforms; Distortion; Histograms; Karhunen-Loeve transforms; Shape; Signal processing; Speech analysis; Speech coding; Statistics; Vector quantization;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/TSA.2004.825661

Filename :

1288154

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=959846