Title :
Optimal Bit Layering for Scalable Audio Compression Using Objective Audio Quality Metrics
Author :
Kandadai, Srivatsan ; Creusere, Charles D.
Author_Institution :
New Mexico State Univ., Las Cruces
Abstract :
Perceptual audio compression uses the idea of auditory masking to hide coding distortion. These auditory masking thresholds are obtained from mathematical models of the human ear. At low bitrates however, coding noise is significant and cannot be masked by the audio content. A perceptually scalable audio compression system, even at low bitrates, should generate a bitstream with layers of audio fidelity such that each layer improves the quality of the reconstructed audio that is just noticeable by the listener. In this paper we describe a low bitrate (8-64 kbps), scalable audio compression system which uses a residual weighted VQ algorithm to generate a scalable bitstream. To modify this bitstream so that it is perceptually scalable, we layer the different residual indices generated by the coder using objective audio quality metrics developed for evaluating highly impaired audio signals.
Keywords :
audio coding; data compression; distortion; audio content; auditory masking thresholds; coding distortion hiding; coding noise; human ear; mathematical models; objective audio quality metrics; optimal bit layering; perceptual audio compression; scalable audio compression system; Audio compression; Bit rate; Ear; Humans; Linear predictive coding; Masking threshold; Mathematical model; Noise shaping; Quantization; Signal generators;
Conference_Titel :
Signals, Systems and Computers, 2007. ACSSC 2007. Conference Record of the Forty-First Asilomar Conference on
Conference_Location :
Pacific Grove, CA
Print_ISBN :
978-1-4244-2109-1
Electronic_ISBN :
1058-6393
DOI :
10.1109/ACSSC.2007.4487275