• DocumentCode
    880389
  • Title

    A New Model-Based Algorithm for Optimizing the MPEG-AAC in MS-Stereo

  • Author

    Derrien, Olivier ; Richard, Gaël

  • Author_Institution
    ISITV, Univ. du Sud Toulon-Var, La Garde
  • Volume
    16
  • Issue
    8
  • fYear
    2008
  • Firstpage
    1373
  • Lastpage
    1382
  • Abstract
    In this paper, a new model-based algorithm for optimizing the MPEG-advanced audio coder (AAC) in MS-stereo mode is presented. This algorithm is an extension to stereo signals of prior work on a statistical model of quantization noise. Traditionally, MS-stereo coding approaches replace the left (l) and right (R) channels by the middle (M) and sides (S) channels, each channel being independently processed, almost like a monophonic signal. In contrast, our method proposes a global approach for coding both channels in the same process. A model for the quantization error allows us to tune the quantizers on channels M and S with respect to a distortion constraint on the reconstructed channels L and R as they will appear in the decoder. This approach leads to a more efficient perceptual noise-shaping and avoids using complex psychoacoustic models built on the M and S channels. Furthermore, it provides a straightforward scheme to choose between LR and MS modes in each subband for each frame. Subjective listening tests prove that the coding efficiency at a medium bitrate (96 kbits/s for both channels) is significantly better with our algorithm than with the standard algorithm, without increase of complexity.
  • Keywords
    audio coding; optimisation; quantisation (signal); statistical analysis; MPEG-AAC; MS-stereo mode; advanced audio coder; complex psychoacoustic models; distortion constraint; left channels; middle and sides channels; model-based algorithm; monophonic signal; noise quantization; perceptual noise-shaping; reconstructed channels; right channels; statistical model; Bitrate constraint; MPEG-Advanced Audio Coder (AAC); MS-stereo; distortion constraint; optimization algorithm; perceptual audio coding; quantization; scalefactor; statistical model;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2008.2002068
  • Filename
    4637898