DocumentCode :
1239799
Title :
Perceptual segmentation and component selection for sinusoidal representations of audio
Author :
Painter, Ted ; Spanias, Andreas
Author_Institution :
Handheld Comput. Div., Intel Corp., Hudson, MA, USA
Volume :
13
Issue :
2
fYear :
2005
fDate :
3/1/2005 12:00:00 AM
Firstpage :
149
Lastpage :
162
Abstract :
This paper presents two fundamental enhancements in a hybrid audio signal model consisting of sinusoidal, transient, and noise (STN) components. The first enhancement involves a novel application of a perceptual metric for optimal time segmentation for the analysis of transients. In particular, Moore and Glasberg´s model of partial loudness is modified for use with general signals and then integrated into a novel time segmentation scheme. The second, and perhaps more significant STN enhancement is concerned with a new methodology for ranking and selection of the most perceptually relevant sinusoids. A systematic procedure is developed for the selection of a compact set of sinusoids and comparative results are given to demonstrate the merit of this method.
Keywords :
audio coding; channel bank filters; loudness; noise; transient analysis; Glasberg model; Moore model; hybrid audio signal model; optimal time segmentation; partial loudness; perceptual segmentation; sinusoidal representation; time segmentation scheme; Audio coding; Filter bank; Frequency estimation; Psychoacoustic models; Signal analysis; Signal processing; Signal synthesis; Speech coding; Steady-state; Transient analysis; Audio coding; psychoacoustics; segmentation; sinusoidal models;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2004.841050
Filename :
1395960
Link To Document :
بازگشت