DocumentCode
3493752
Title
Parametric stereo extension of ITU-T G.722 based on a new downmixing scheme
Author
Hoang, Thi Minh Nguyet ; Ragot, Stephane ; Vesi, Balazs KÖ ; Scalart, Pascal
Author_Institution
Orange Labs., TECH/OPERA/TPS, Lannion, France
fYear
2010
fDate
4-6 Oct. 2010
Firstpage
188
Lastpage
193
Abstract
In this paper, we present a novel, frequency-domain stereo to mono downmixing, which preserves the energy of spectral components and avoids setting the left or right channel as a phase reference. Based on this downmixing technique, a parametric stereo analysis-synthesis model is described in which subband stereo parameters consist of interchannel level differences and phase differences between the mono signal and one of the stereo channels (left or right). This model is applied to the stereo extension of ITU-T G.722 at 56+8 and 64+16 kbit/s with a frame length of 5 ms. AB test results are provided to assess the quality of the proposed downmixing technique. In addition, the quality of the proposed G.722-based stereo coder is compared against reference coders (G.722.1 at 24 and 32 kbit/s dual mono and G.722 at 64 kbit/s dual mono) for clean speech, noisy speech and music.
Keywords
audio coding; frequency-domain analysis; spectral analysis; ITU-T G.722; frequency-domain mono downmixing; frequency-domain stereo downmixing; mono signal; parametric stereo analysis-synthesis model; spectral component; stereo channel; stereo coder; subband stereo parameter; Bit rate; Decoding; Frequency domain analysis; MONOS devices; Speech; Speech coding;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Signal Processing (MMSP), 2010 IEEE International Workshop on
Conference_Location
Saint Malo
Print_ISBN
978-1-4244-8110-1
Electronic_ISBN
978-1-4244-8111-8
Type
conf
DOI
10.1109/MMSP.2010.5662017
Filename
5662017
Link To Document