• DocumentCode
    3493752
  • Title

    Parametric stereo extension of ITU-T G.722 based on a new downmixing scheme

  • Author

    Hoang, Thi Minh Nguyet ; Ragot, Stephane ; Vesi, Balazs KÖ ; Scalart, Pascal

  • Author_Institution
    Orange Labs., TECH/OPERA/TPS, Lannion, France
  • fYear
    2010
  • fDate
    4-6 Oct. 2010
  • Firstpage
    188
  • Lastpage
    193
  • Abstract
    In this paper, we present a novel, frequency-domain stereo to mono downmixing, which preserves the energy of spectral components and avoids setting the left or right channel as a phase reference. Based on this downmixing technique, a parametric stereo analysis-synthesis model is described in which subband stereo parameters consist of interchannel level differences and phase differences between the mono signal and one of the stereo channels (left or right). This model is applied to the stereo extension of ITU-T G.722 at 56+8 and 64+16 kbit/s with a frame length of 5 ms. AB test results are provided to assess the quality of the proposed downmixing technique. In addition, the quality of the proposed G.722-based stereo coder is compared against reference coders (G.722.1 at 24 and 32 kbit/s dual mono and G.722 at 64 kbit/s dual mono) for clean speech, noisy speech and music.
  • Keywords
    audio coding; frequency-domain analysis; spectral analysis; ITU-T G.722; frequency-domain mono downmixing; frequency-domain stereo downmixing; mono signal; parametric stereo analysis-synthesis model; spectral component; stereo channel; stereo coder; subband stereo parameter; Bit rate; Decoding; Frequency domain analysis; MONOS devices; Speech; Speech coding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Signal Processing (MMSP), 2010 IEEE International Workshop on
  • Conference_Location
    Saint Malo
  • Print_ISBN
    978-1-4244-8110-1
  • Electronic_ISBN
    978-1-4244-8111-8
  • Type

    conf

  • DOI
    10.1109/MMSP.2010.5662017
  • Filename
    5662017