DocumentCode
3510892
Title
A low-complexity spectro-temporal based perceptual model
Author
Taal, Cees ; Heusdens, Richard
Author_Institution
Dept. of Mediamatics, Delft Univ. of Technol., Delft
fYear
2009
fDate
19-24 April 2009
Firstpage
153
Lastpage
156
Abstract
The use of psychoacoustical masking models for audio coding applications has been wide spread over the past decades. In such applications, it is typically assumed that the original input signal serves as a masker for the distortions that are introduced by the lossy coding method that is used. Up to now, these masking models are mostly based on spectral masking. In this paper, we propose a new perceptual model for audio and speech processing algorithms based on spectro-temporal masking. A sophisticated perceptual model is simplified, such that the eventual distortion measure can be written as a frequency-weighted l2-norm. This yields the same computational complexity as conventional spectral-based methods, but with the preservation of the temporal fine structure of the clean signal. It is shown that the new model can successfully avoid pre-echoes and can correctly predict masking curves for various maskers.
Keywords
audio coding; audio coding applications; auditory masking; low-complexity spectro-temporal based perceptual model; psychoacoustical masking models; Additive noise; Audio coding; Computational complexity; Distortion measurement; Filters; Frequency measurement; Masking threshold; Psychoacoustic models; Quantization; Speech processing; audio coding; auditory masking; psychoacoustics;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4959543
Filename
4959543
Link To Document