Title :
Multichannel object-based audio coding with controllable quality
Author :
Gorlow, Stanislaw ; Habets, Emanuel A. P. ; Marchand, Sylvain
Author_Institution :
LaBRI, Univ. Bordeaux, Talence, France
Abstract :
In this paper a new multichannel object-based audio coding scheme with scalable signal quality is proposed. The novel scheme is based on controlled downmixing and demixing. By means of a dedicated control mechanism, a number of distinct audio objects are mixed into a lower number of channels. The latter is chosen such that the desired quality level is met after demixing. The quality is assessed with two new psychoacoustically motivated metrics. Following the informed source separation approach, the downmix is decomposed via optimum spatial filtering guided by short-time power spectral densities of the audio objects. In an experiment it is shown that the raw data rate of an exemplary 10-track recording can be reduced by at least 30 % using linear pulse-code modulation while maintaining perceptual transparency.
Keywords :
audio coding; blind source separation; pulse code modulation; quality control; spatial filters; spectral analysis; controllable quality; controlled demixing; controlled downmixing; dedicated control mechanism; distinct audio objects; linear pulse-code modulation; multichannel object-based audio coding scheme; optimum spatial filtering; perceptual transparency; psychoacoustically motivated metrics; quality level; scalable signal quality; short-time power spectral density; source separation approach; Arrays; Covariance matrices; Decoding; Encoding; Measurement; Signal to noise ratio; Source separation; Audio coding; multichannel; object-based; quality control; spatial filtering;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6637710