Title :
Direct MDCT Domain Psychoacoustic Modeling
Author :
Suresh, K. ; Sreenivas, TV
Author_Institution :
Indian Inst. of Sci., Bangalore
Abstract :
We extend the recently proposed spectral integration based psychoacoustic model for sinusoidal distortions to the MDCT domain. The estimated masking threshold additionally depends on the sub-band spectral flatness measure of the signal which accounts for the non- sinusoidal distortion introduced by masking. The expressions for masking threshold are derived and the validity of the proposed model is established through perceptual transparency test of audio clips. Test results indicate that we do achieve transparent quality reconstruction with the new model. Performance of the model is compared with MPEG psychoacoustic models with respect to the estimated perceptual entropy (PE). The results show that the proposed model predicts a lower PE than other models.
Keywords :
audio coding; data compression; discrete cosine transforms; distortion; entropy codes; signal reconstruction; spectral analysis; MPEG psychoacoustic model; digital audio compression; direct MDCT domain psychoacoustic modeling; masking threshold estimation; perceptual entropy estimation; quality reconstruction; sinusoidal distortion; spectral integration based psychoacoustic model; subband spectral flatness measure; Audio coding; Auditory system; Distortion; Frequency domain analysis; Humans; Masking threshold; Psychoacoustic models; Psychology; Signal processing; Transform coding; Psychoacoustics; audio coding; masking threshold;
Conference_Titel :
Signal Processing and Information Technology, 2007 IEEE International Symposium on
Conference_Location :
Giza
Print_ISBN :
978-1-4244-1835-0
Electronic_ISBN :
978-1-4244-1835-0
DOI :
10.1109/ISSPIT.2007.4458108