DocumentCode
3510200
Title
A modified distortion metric for audio coding
Author
Melkote, Vinay ; Rose, Kenneth
Author_Institution
Dept. of Electr. & Comput. Eng., Univ. of California, Santa Barbara, CA
fYear
2009
fDate
19-24 April 2009
Firstpage
17
Lastpage
20
Abstract
Current audio coding standards employ the modified discrete cosine transform (MDCT) where overlapped frames of audio are windowed and transformed to the frequency domain. Encoding parameters are chosen so as to minimize a distortion measure subject to a rate constraint. At the decoder, inverse transformation involves additional windowing and overlap-add of frames. An analysis of the time domain error in the reconstructed frame reveals that distortion metrics based solely on the MDCT domain error are in fact unable to capture the effects of windowing and overlap-add at the decoder. The main contribution of this paper is a modified distortion metric that does capture these effects via modified discrete sine transform analysis. When incorporated into an Advanced Audio Coder the proposed distortion metric significantly improves subjective quality of reconstructed audio.
Keywords
audio coding; code standards; discrete cosine transforms; distortion measurement; frequency-domain analysis; advanced audio coder; audio coding standards; audio overlapped frames; decoder; discrete sine transform analysis; distortion measure subject; frequency domain; inverse transformation; modified discrete cosine transform; modified distortion metric; reconstructed audio quality; Audio coding; Decoding; Discrete Fourier transforms; Discrete cosine transforms; Discrete transforms; Distortion measurement; Entropy; Frequency domain analysis; Quantization; Time domain analysis; audio coding; lapped transform; modified discrete sine transform; perceptual distortion;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4959509
Filename
4959509
Link To Document