• DocumentCode
    3510892
  • Title

    A low-complexity spectro-temporal based perceptual model

  • Author

    Taal, Cees ; Heusdens, Richard

  • Author_Institution
    Dept. of Mediamatics, Delft Univ. of Technol., Delft
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    153
  • Lastpage
    156
  • Abstract
    The use of psychoacoustical masking models for audio coding applications has been wide spread over the past decades. In such applications, it is typically assumed that the original input signal serves as a masker for the distortions that are introduced by the lossy coding method that is used. Up to now, these masking models are mostly based on spectral masking. In this paper, we propose a new perceptual model for audio and speech processing algorithms based on spectro-temporal masking. A sophisticated perceptual model is simplified, such that the eventual distortion measure can be written as a frequency-weighted l2-norm. This yields the same computational complexity as conventional spectral-based methods, but with the preservation of the temporal fine structure of the clean signal. It is shown that the new model can successfully avoid pre-echoes and can correctly predict masking curves for various maskers.
  • Keywords
    audio coding; audio coding applications; auditory masking; low-complexity spectro-temporal based perceptual model; psychoacoustical masking models; Additive noise; Audio coding; Computational complexity; Distortion measurement; Filters; Frequency measurement; Masking threshold; Psychoacoustic models; Quantization; Speech processing; audio coding; auditory masking; psychoacoustics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4959543
  • Filename
    4959543