Title :
A perceptually based embedded subband speech coder
Author :
Tang, Benjamim ; Shen, Albert ; Alwan, Abeer ; Pottie, Gregory
Author_Institution :
TRW Inc., Redondo Beach, CA, USA
fDate :
3/1/1997 12:00:00 AM
Abstract :
A new scheme for robust, high-quality, embedded speech coding based on subband decomposition and perceptually optimized bit allocation and prioritization is presented. An infinite impulse response (IIR) quadrature mirror filterbank (QMF) performs subband decomposition. A perceptual model, computed using subband spectral analysis, optimizes the coder´s perceptual quality. Dynamic bit allocation and prioritization is combined with embedded quantization resulting in little performance degradation relative to a nonembedded implementation. The coder output is scalable from high quality at higher bit rates to lower quality at lower bit rates, supporting a wide range of service and resource utilization. The lower bit-rate representation is obtained simply through truncation of the higher bit-rate representation. Since source-rate adaptation is performed through truncation of the encoded stream, interaction with the coder is not required, making the embedded coder ideally suited for rate-adaptive communication systems. Performance for both speech and music was verified through subjective listening tests
Keywords :
IIR filters; adaptive systems; band-pass filters; filtering theory; music; quadrature mirror filters; quantisation (signal); spectral analysis; speech coding; speech intelligibility; speech processing; IIR quadrature mirror filterbank; dynamic bit prioritization; encoded stream truncation; higher bit-rate representation; infinite impulse response; lower bit-rate representation; music; perceptual model; perceptually based embedded subband speech coder; perceptually optimized bit allocation; performance; rate-adaptive communication systems; resource utilization; scalable coder output; service utilization; source-rate adaptation; subband decomposition; subband spectral analysis; subjective listening tests; Bit rate; Degradation; Filter bank; IIR filters; Mirrors; Quantization; Resource management; Robustness; Spectral analysis; Speech coding;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on