DocumentCode :
754316
Title :
Scalable Audio Compression at Low Bitrates
Author :
Kandadai, Srivatsan ; Creusere, Charles D.
Author_Institution :
Klipsch Sch. of Electr. & Comput. Eng., New Mexico State Univ., Las Cruces, NM
Volume :
16
Issue :
5
fYear :
2008
fDate :
7/1/2008 12:00:00 AM
Firstpage :
969
Lastpage :
979
Abstract :
A perceptually scalable audio coder generates a bit-stream that contains layers of audio fidelity and is encoded in such a way that adding one of these layers enhances the reconstructed audio by an amount that is just noticeable by the listener. Such algorithms have applications like music on demand at variable levels of fidelity, for instance using 3G and 4G cellular radio systems operating at different bit rates. While the MPEG-4 natural audio coder can create finely scalable bit streams using bit sliced arithmetic coding (BSAC), its perceptual quality at low bit rates is poor. On the other hand, the nonscalable transform-domain weighted interleaved vector quantization (TWIN-VQ) performs well at low bit rates. In this paper, we present a modified version of TWIN-VQ algorithm that generates a perceptually scalable bit-stream with many fine layers of audio fidelity. Using TWIN-VQ as our base ensures the best possible perceptual quality at low bit rates. Specifically, the proposed scalable algorithm performs as well as TWIN-VQ at rates of 8 to 16 kb/s and outperforms scalable BSAC by between 64% and 172% at rates of less than 24 kb/s.
Keywords :
3G mobile communication; 4G mobile communication; audio coding; data compression; vector quantisation; 3G cellular radio systems; 4G cellular radio systems; MPEG-4 natural audio coder; audio fidelity; bit sliced arithmetic coding; interleaved vector quantization; low bitrates; scalable audio compression; scalable bit streams; Arithmetic; Audio coding; Audio compression; Bit rate; Channel capacity; Land mobile radio cellular systems; MPEG 4 Standard; Scalability; Streaming media; Vector quantization; Objective audio quality metrics; perceptual coding; scalability; transform-domain weighted interleaved vector quantization (TWIN-VQ); vector quantization;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2008.925881
Filename :
4544823
Link To Document :
بازگشت