Title :
A General Compression Approach to Multi-Channel Three-Dimensional Audio
Author :
Bin Cheng ; Ritz, Christian ; Burnett, Ian ; Xiguang Zheng
Author_Institution :
Sch. of Electr. Comput. & Telecommun. Eng., Univ. of Wollongong, Wollongong, NSW, Australia
Abstract :
This paper presents a technique for low bit rate compression of three-dimensional (3D) audio produced by multiple loudspeaker channels. The approach is based on the time-frequency analysis of the localization of spatial sound sources within the 3D space as rendered by a multi-channel audio signal (in this case 16 channels). This analysis results in the derivation of a stereo downmix signal representing the original 16 channels. Alternatively, a mono-downmix signal with side information representing the location of sound sources within the 3D spatial scene can also be derived. The resulting downmix signals are then compressed with a traditional audio coder, resulting in a representation of the 3D soundfield at bit rates comparable with existing stereo audio coders while maintaining the perceptual quality produced from separate encoding of each channel.
Keywords :
audio coding; channel coding; data compression; time-frequency analysis; 3D audio low bit rate compression technique; 3D sound field; 3D space; 3D spatial scene; channel encoding; general compression approach; mono-downmix signal; multichannel audio signal; multichannel three-dimensional audio; multiple loudspeaker channels; side information; spatial sound source localization; stereo audio coders; stereo downmix signal; time-frequency analysis; 3D audio; Audio coding;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2013.2260156