Title :
Temporal quantization of spatial information using directional clustering for multichannel audio coding
Author :
Miyabe, Shigeki ; Masatoki, Keisuke ; Saruwatari, Hiroshi ; Shikano, Kiyohiro ; Nomura, Toshiyuki
Author_Institution :
Univ. of Tokyo, Tokyo, Japan
Abstract :
Binaural cue coding, which is a representing low bit-rate coding of multichannel audio, generates large distortion when the audio data have complex spatial image, such as symphony. Such distortion caused by the low frequency resolution of spatial information because BCC quantizes the parameters of localization. In this paper we propose a new coding framework by quantizing the spatial information temporally. The single-channel sum signal is panned to the multiple channels by selecting the prototypes of the spatial filter. Optimization of the prototypes with minimum coding error is given by a k-means-like clustering of the angles whose centroids are given by the first principal components of the covariances in the classes. The efficiency of the proposed coding with high quality is verified both in the objective and subjective evaluations.
Keywords :
audio coding; audio data; binaural cue coding; coding framework; directional clustering; low bit rate coding; minimum coding error; multichannel audio coding; single-channel sum signal; spatial filter; spatial information; temporal quantization; Acoustic distortion; Audio coding; Filter bank; Frequency; Image coding; Prototypes; Quantization; Signal generators; Signal resolution; Spatial resolution; Multi-channel audio coding; binaural cue coding; k-means clustering; vector quantization;
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics, 2009. WASPAA '09. IEEE Workshop on
Conference_Location :
New Paltz, NY
Print_ISBN :
978-1-4244-3678-1
Electronic_ISBN :
1931-1168
DOI :
10.1109/ASPAA.2009.5346519