DocumentCode :
1224375
Title :
On Integer MDCT for Perceptual Audio Coding
Author :
Li, Te ; Rahardja, Susanto ; Yu, Rongshan ; Koh, Soo Ngee
Author_Institution :
Inst. for Infocomm Res. (I2R), Singapore
Volume :
15
Issue :
8
fYear :
2007
Firstpage :
2236
Lastpage :
2248
Abstract :
In MPEG-4 scalable lossless coding (SLS) which was recently published as an ISO standard in June 2006, the integer modified discrete cosine transform (IntMDCT) was adopted to enable efficient lossless reconstruction. In addition, there is an MDCT filterbank which is inherent to the advanced audio coding (AAC) core that is present in the SLS codec. The presence of two filterbanks have undoubtedly increased the complexity of the implementation, and it is for this reason that the MDCT is disabled and the IntMDCT is then the only type of filterbank that is employed in SLS for both lossy and lossless operations. Because of the rounding operations in the IntMDCT, there is a concern if the use of IntMDCT for perceptual audio coding will eventually degrade the fidelity of the audio codec. This paper addresses this concern by analyzing the performance of the IntMDCT in a lossy coding scenario. It is found that noise introduced by the IntMDCT does not affect the perceptual quality of the coded audio under standard playback circumstances. As such, it concludes that the MDCT and IntMDCT filterbanks are interchangeable at lossy bitrate, and the way of using only the IntMDCT filterbank in scalable audio coding is also justified.
Keywords :
audio coding; channel bank filters; codecs; discrete cosine transforms; ISO standard; MPEG-4 scalable lossless coding; SLS codec; filterbanks; integer MDCT; modified discrete cosine transform; perceptual audio coding; standard playback circumstance; Audio coding; Codecs; Degradation; Discrete cosine transforms; Filter bank; ISO standards; Laser sintering; MPEG 4 Standard; Performance analysis; Performance loss; Integer modified discrete cosine transform (IntMDCT); perceptual audio coding; scalable lossless coding;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2007.905144
Filename :
4317569
Link To Document :
بازگشت