DocumentCode :
3069994
Title :
Direct MDCT Domain Psychoacoustic Modeling
Author :
Suresh, K. ; Sreenivas, TV
Author_Institution :
Indian Inst. of Sci., Bangalore
fYear :
2007
fDate :
15-18 Dec. 2007
Firstpage :
742
Lastpage :
747
Abstract :
We extend the recently proposed spectral integration based psychoacoustic model for sinusoidal distortions to the MDCT domain. The estimated masking threshold additionally depends on the sub-band spectral flatness measure of the signal which accounts for the non- sinusoidal distortion introduced by masking. The expressions for masking threshold are derived and the validity of the proposed model is established through perceptual transparency test of audio clips. Test results indicate that we do achieve transparent quality reconstruction with the new model. Performance of the model is compared with MPEG psychoacoustic models with respect to the estimated perceptual entropy (PE). The results show that the proposed model predicts a lower PE than other models.
Keywords :
audio coding; data compression; discrete cosine transforms; distortion; entropy codes; signal reconstruction; spectral analysis; MPEG psychoacoustic model; digital audio compression; direct MDCT domain psychoacoustic modeling; masking threshold estimation; perceptual entropy estimation; quality reconstruction; sinusoidal distortion; spectral integration based psychoacoustic model; subband spectral flatness measure; Audio coding; Auditory system; Distortion; Frequency domain analysis; Humans; Masking threshold; Psychoacoustic models; Psychology; Signal processing; Transform coding; Psychoacoustics; audio coding; masking threshold;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Information Technology, 2007 IEEE International Symposium on
Conference_Location :
Giza
Print_ISBN :
978-1-4244-1835-0
Electronic_ISBN :
978-1-4244-1835-0
Type :
conf
DOI :
10.1109/ISSPIT.2007.4458108
Filename :
4458108
Link To Document :
بازگشت