A New Model-Based Algorithm for Optimizing the MPEG-AAC in MS-Stereo

Author

Derrien, Olivier ; Richard, Gaël

Author_Institution

ISITV, Univ. du Sud Toulon-Var, La Garde

Volume

16

Issue

8

fYear

2008

Firstpage

1373

Lastpage

1382

Abstract

In this paper, a new model-based algorithm for optimizing the MPEG-advanced audio coder (AAC) in MS-stereo mode is presented. This algorithm is an extension to stereo signals of prior work on a statistical model of quantization noise. Traditionally, MS-stereo coding approaches replace the left (l) and right (R) channels by the middle (M) and sides (S) channels, each channel being independently processed, almost like a monophonic signal. In contrast, our method proposes a global approach for coding both channels in the same process. A model for the quantization error allows us to tune the quantizers on channels M and S with respect to a distortion constraint on the reconstructed channels L and R as they will appear in the decoder. This approach leads to a more efficient perceptual noise-shaping and avoids using complex psychoacoustic models built on the M and S channels. Furthermore, it provides a straightforward scheme to choose between LR and MS modes in each subband for each frame. Subjective listening tests prove that the coding efficiency at a medium bitrate (96 kbits/s for both channels) is significantly better with our algorithm than with the standard algorithm, without increase of complexity.

Keywords

audio coding; optimisation; quantisation (signal); statistical analysis; MPEG-AAC; MS-stereo mode; advanced audio coder; complex psychoacoustic models; distortion constraint; left channels; middle and sides channels; model-based algorithm; monophonic signal; noise quantization; perceptual noise-shaping; reconstructed channels; right channels; statistical model; Bitrate constraint; MPEG-Advanced Audio Coder (AAC); MS-stereo; distortion constraint; optimization algorithm; perceptual audio coding; quantization; scalefactor; statistical model;

fLanguage

English

Journal_Title

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher

ieee

ISSN

1558-7916

Type

jour

DOI

10.1109/TASL.2008.2002068

Filename

4637898