DocumentCode :
763699
Title :
Parametric multichannel audio coding: synthesis of coherence cues
Author :
Faller, Christof
Author_Institution :
Audiovisual Commun. Lab., Lausanne, Switzerland
Volume :
14
Issue :
1
fYear :
2006
Firstpage :
299
Lastpage :
310
Abstract :
Parametric multichannel audio coding represents an audio signal as one single audio channel plus side information. The side information contains estimates of perceptually relevant differences between the original audio channels. Usually, time difference, level difference, and coherence cues are considered. These cues determine, to a large degree, the auditory spatial image that is perceived when playing back multichannel audio signals. Level difference and time difference synthesis is simple: Different gain factors and delays are applied to the sum signal in subbands for generating the different decoder output channels. However, it is not as obvious how coherence cues can be synthesized. Several heuristic methods for coherence synthesis were proposed previously. In this paper, we are proposing a systematic approach for coherence synthesis. The coherence that is measured in the encoder between a pair of channels is reproduced in the decoder. For that purpose, de-correlation filters modeling late reverberation with impulse responses of a length of several hundred milliseconds are used, resulting in the ability of the scheme to generate naturally sounding diffuse sound. A method for reducing the computational complexity of the scheme is presented. The results of a subjective test indicate that the proposed scheme achieves good audio quality. Furthermore, the scheme was compared to a previous scheme without multichannel coherence synthesis and performs significantly better for all items tested.
Keywords :
audio coding; channel coding; computational complexity; decorrelation; filtering theory; audio signal; auditory spatial image; coherence cues; coherence synthesis; computational complexity reduction; decoder output channels; decorrelation filters; gain factors; level difference synthesis; naturally sounding diffuse sound; parametric multichannel audio coding synthesis; side information; time difference synthesis; Audio coding; Coherence; Computational complexity; Decoding; Delay effects; Filters; Reverberation; Signal generators; Signal synthesis; Testing; Auditory spatial image; diffuse sound; late reverberation; parametric multichannel audio coding; spatial perception; surround;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TSA.2005.854105
Filename :
1561286
Link To Document :
بازگشت