Title :
High-fidelity multichannel audio coding with Karhunen-Loeve transform
Author :
Yang, Dai ; Ai, Hongmei ; Kyriakakis, Chris ; Kuo, C. C Jay
Author_Institution :
NTT Cyber Space Lab., Tokyo, Japan
fDate :
7/1/2003 12:00:00 AM
Abstract :
A new quality-scalable high-fidelity multichannel audio compression algorithm based on MPEG-2 advanced audio coding (AAC) is presented. The Karhunen-Loeve transform (KLT) is applied to multichannel audio signals in the preprocessing stage to remove interchannel redundancy. Then, signals in decorrelated channels are compressed by a modified AAC main profile encoder. Finally, a channel transmission control mechanism is used to re-organize the bitstream so that the multichannel audio bitstream has a quality scalable property when it is transmitted over a heterogeneous network. Experimental results show that, compared with AAC, the proposed algorithm achieves a better performance while maintaining a similar computational complexity at the regular bit rate of 64 kbit/sec/ch. When the bitstream is transmitted to narrowband end users at a lower bit rate, packets in some channels can be dropped, and slightly degraded, yet full-channel, audio can still be reconstructed in a reasonable fashion without any additional computational cost.
Keywords :
Karhunen-Loeve transforms; audio coding; computational complexity; signal reconstruction; transform coding; 64 kbit/s; Karhunen-Loeve transform; MPEG-2 AAC; MPEG-2 advanced audio coding; audio bitstream; audio compression; audio signal reconstruction; computational complexity; decorrelated channels; heterogeneous network; high-fidelity audio coding; interchannel redundancy; multichannel audio coding; Audio coding; Audio compression; Bit rate; Computational complexity; Decorrelation; Degradation; Karhunen-Loeve transforms; Mechanical factors; Narrowband; Transform coding;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
DOI :
10.1109/TSA.2003.814375