Title of article :
Companded quantization of speech MDCT coefficients
Author/Authors :
F.، Norden, نويسنده , , P.، Hedelin, نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2005
Pages :
-162
From page :
163
To page :
0
Abstract :
Here, we propose speech-coding procedures achieving high subjective quality, avoiding speech-specific processing and interframe exploitation. Thus, the scheme is tractable for packet-based voice communication, and has the capability of coding generic audio. The architecture is based on an modified discrete cosine transform (MDCT) representation of the signal, and combines efficient vector quantization (VQ) techniques with psychoacoustic principles. Weighted quantization of MDCT coefficients is performed, using a codebook based on a statistical model of the multidimensional MDCT pdf. The weighting and the codebook are adapted for each frame to account for masking thresholds given by a psychoacoustic analysis. Actual quantization is performed using lattices, thereby, achieving close to rate independent complexity. The result is a coding scheme operational at a range of rates. Here, a particular instance at 16 kbits/s, using a sampling frequency of 8 kHz, is shown to perform better than an LD-CELP operating at the same rate, even though no interframe memory is exploited.
Keywords :
Power-aware
Journal title :
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING
Serial Year :
2005
Journal title :
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING
Record number :
86852
Link To Document :
بازگشت