DocumentCode
2160989
Title
Single transform perceptual audio encoder
Author
Kurniawati, E. ; Absar, J. ; George, S. ; Lau, C.T. ; Premkumar, B.
Author_Institution
SCE-Parallel Process. Lab, Nanyang Technol. Univ., Singapore, Singapore
Volume
2
fYear
2002
fDate
2002
Firstpage
599
Abstract
One of the most computationally intensive tasks in a perceptual audio encoder is the time to frequency transformation. The present state-of-the-art encoder, MPEG-AAC, uses a modified discrete cosine transform (MDCT) as its transform engine due to its favorable characteristics. Being a perceptual coder, another crucial module in AAC is the psychoacoustics module, in which the masking threshold is estimated by appropriately considering the effect of each of the masking components. An FFT is performed in this module in order to perform the analysis. The presence of these two transforms has been accepted in MPEG2/4-AAC standard. We explore the possibility of combining the two with the aim of reducing the complexity of the encoder. This technique will also benefit applications with low delay requirement.
Keywords
audio coding; channel bank filters; code standards; discrete Fourier transforms; discrete cosine transforms; filtering theory; telecommunication standards; transform coding; DFT; FFT algorithm; MDCT; MPEG-AAC; MPEG2/4-AAC standard; filter banks; frequency transformation; low delay; masking components effect; masking threshold; modified discrete cosine transform; perceptual audio encoder; psychoacoustics module; transform engine; Discrete cosine transforms; Engines; Filter bank; Image analysis; Image reconstruction; Independent component analysis; Masking threshold; Microelectronics; Performance analysis; Psychoacoustics;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Signal Processing, 2002. DSP 2002. 2002 14th International Conference on
Print_ISBN
0-7803-7503-3
Type
conf
DOI
10.1109/ICDSP.2002.1028161
Filename
1028161
Link To Document