DocumentCode :
542652
Title :
Psycho-acoustic modeling of audio with exponentially damped sinusoids
Author :
Hermus, Kris ; Verhelst, Werner ; Wambacq, Patrick
Author_Institution :
Lab. of Processing Speech and Images (PSI), Dept. of Electrical Engineering - ESAT, Katholieke Universiteit Leuven, Belgium
Volume :
2
fYear :
2002
fDate :
13-17 May 2002
Abstract :
While a traditional sinusoidal model is capable of representing audio segments, a sum of exponentially damped sinusoids is more efficient to model the transient segments that are readily found in audio signals. In this paper, Total Least Squares (TLS) algorithms are applied to automatically extract the modeling parameters in the Exponential Sinusoidal Model (ESM). In order to turn the SNR . optimization criterion of these TLS algorithms into a perceptual modeling strategy we incorporate the psycho-acoustic model of MPEG 1 - Layer 1 into a subband TLS-ESM scheme. This allows us to model each subband in accordance with its perceptual relevance. Informal listening tests confirm that perceptual ESM achieves the same perceived quality as plain ESM while using substantially less components.
Keywords :
Lead; Signal to noise ratio;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5744978
Filename :
5744978
Link To Document :
بازگشت