Title :
Modified MP3 encoder using complex modified cosine transform
Author :
Mathew, Manu ; Bha, Vasudha ; Thomas, Shine M. ; Yim, Changhoon
Author_Institution :
Software Platform Team, Samsung Electron., Suwon, South Korea
Abstract :
MPEG-1 layer-3, popularly known as MP3, has revolutionized the digital music domain. MP3 makes use of psychoacoustic modeling to achieve compression through the removal of perceptually irrelevant components of digital audio. The psychoacoustic model is the key element of perceptual coding and requires intensive FFT computation for calculating the frequency spectrum. This spectrum is used to compute masking thresholds. Thus, the original MP3 algorithm computes modified discrete cosine transform (MDCT) and FFT parallelly. The proposed algorithm is an alternative to this. We make use of complex modified discrete cosine transform (CMDCT) of the filter-bank outputs for generating MDCT coefficients as well as the frequency spectrum. This method requires fewer computations than the original method. A novel method of window switching, based on filter-bank output is used to simplify the overall algorithm. The proposed algorithm reduces the amount of computations for MP3 encoder while retaining the audio quality.
Keywords :
audio coding; discrete cosine transforms; fast Fourier transforms; filtering theory; FFT computation; MP3 encoder; cosine transform; digital audio; digital music; frequency spectrum; masking thresholds; modified discrete cosine transform; perceptual coding; psychoacoustic modeling; window switching; Acoustic noise; Codecs; Digital audio players; Discrete cosine transforms; Frequency; Masking threshold; Psychoacoustic models; Psychology; Quantization; Transform coding;
Conference_Titel :
Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
Print_ISBN :
0-7803-7965-9
DOI :
10.1109/ICME.2003.1221715