Title :
A low-complexity spectro-temporal based perceptual model
Author :
Taal, Cees ; Heusdens, Richard
Author_Institution :
Dept. of Mediamatics, Delft Univ. of Technol., Delft
Abstract :
The use of psychoacoustical masking models for audio coding applications has been wide spread over the past decades. In such applications, it is typically assumed that the original input signal serves as a masker for the distortions that are introduced by the lossy coding method that is used. Up to now, these masking models are mostly based on spectral masking. In this paper, we propose a new perceptual model for audio and speech processing algorithms based on spectro-temporal masking. A sophisticated perceptual model is simplified, such that the eventual distortion measure can be written as a frequency-weighted l2-norm. This yields the same computational complexity as conventional spectral-based methods, but with the preservation of the temporal fine structure of the clean signal. It is shown that the new model can successfully avoid pre-echoes and can correctly predict masking curves for various maskers.
Keywords :
audio coding; audio coding applications; auditory masking; low-complexity spectro-temporal based perceptual model; psychoacoustical masking models; Additive noise; Audio coding; Computational complexity; Distortion measurement; Filters; Frequency measurement; Masking threshold; Psychoacoustic models; Quantization; Speech processing; audio coding; auditory masking; psychoacoustics;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4959543