DocumentCode :
1364077
Title :
High-quality audio compression using an adaptive wavelet packet decomposition and psychoacoustic modeling
Author :
Srinivasan, Pramila ; Jamieson, Leah H.
Author_Institution :
Sch. of Electr. & Comput. Eng., Purdue Univ., West Lafayette, IN, USA
Volume :
46
Issue :
4
fYear :
1998
fDate :
4/1/1998 12:00:00 AM
Firstpage :
1085
Lastpage :
1093
Abstract :
This paper presents a technique to incorporate psychoacoustic models into an adaptive wavelet packet scheme to achieve perceptually transparent compression of high-quality (34.1 kHz) audio signals at about 45 kb/s. The filter bank structure adapts according to psychoacoustic criteria and according to the computational complexity that is available at the decoder. This permits software implementations that can perform according to the computational power available in order to achieve real time coding/decoding. The bit allocation scheme is an adapted zero-tree algorithm that also takes input from the psychoacoustic model. The measure of performance is a quantity called subband perceptual rate, which the filter bank structure adapts to approach the perceptual entropy (PE) as closely as possible. In addition, this method is also amenable to progressive transmission, that is, it can achieve the best quality of reconstruction possible considering the size of the bit stream available at the encoder. The result is a variable-rate compression scheme for high-quality audio that takes into account the allowed computational complexity, the available bit-budget, and the psychoacoustic criteria for transparent coding. This paper thus provides a novel scheme to marry the results in wavelet packets and perceptual coding to construct an algorithm that is well suited to high-quality audio transfer for Internet and storage applications
Keywords :
acoustic signal processing; adaptive signal processing; audio coding; audio signals; band-pass filters; computational complexity; data compression; decoding; entropy; filtering theory; signal reconstruction; transform coding; trees (mathematics); wavelet transforms; Internet applications; adapted zero-tree algorithm; adaptive wavelet packet decomposition; algorithm; available bit-budget; bit allocation; bit stream size; computational complexity; decoder; filter bank structure; high-quality audio compression; high-quality audio signals; high-quality audio transfer; perceptual coding; perceptual entropy; perceptually transparent compression; performance measure; progressive transmission; psychoacoustic criteria; psychoacoustic modeling; real time coding/decoding; reconstruction quality; software implementations; storage applications; subband perceptual rate; transparent coding; variable-rate compression; wavelet packets; Audio compression; Bit rate; Computational complexity; Decoding; Entropy; Filter bank; Psychoacoustic models; Psychology; Software performance; Wavelet packets;
fLanguage :
English
Journal_Title :
Signal Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1053-587X
Type :
jour
DOI :
10.1109/78.668558
Filename :
668558
Link To Document :
بازگشت