Title :
Psycho-acoustic modeling of audio with exponentially damped sinusoids
Author :
Hermus, Kris ; Verhelst, Werner ; Wambacq, Patrick
Author_Institution :
Lab. of Processing Speech and Images (PSI), Dept. of Electrical Engineering - ESAT, Katholieke Universiteit Leuven, Belgium
Abstract :
While a traditional sinusoidal model is capable of representing audio segments, a sum of exponentially damped sinusoids is more efficient to model the transient segments that are readily found in audio signals. In this paper, Total Least Squares (TLS) algorithms are applied to automatically extract the modeling parameters in the Exponential Sinusoidal Model (ESM). In order to turn the SNR . optimization criterion of these TLS algorithms into a perceptual modeling strategy we incorporate the psycho-acoustic model of MPEG 1 - Layer 1 into a subband TLS-ESM scheme. This allows us to model each subband in accordance with its perceptual relevance. Informal listening tests confirm that perceptual ESM achieves the same perceived quality as plain ESM while using substantially less components.
Keywords :
Lead; Signal to noise ratio;
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
Print_ISBN :
0-7803-7402-9
DOI :
10.1109/ICASSP.2002.5744978