DocumentCode
1973636
Title
A psychoacoustic model for audio coding based on a cochlear filter bank
Author
Baumgarte, Frank
Author_Institution
Media Signal Process. Res., Agere Syst., Murray Hill, NJ, USA
fYear
2001
fDate
2001
Firstpage
139
Lastpage
142
Abstract
Perceptual audio coders use an estimated masked threshold for the determination of the maximum permissible just-inaudible noise level introduced by quantization. This estimate is derived from a psychoacoustic model mimicking the psychoacoustics of masking. Current applications use a uniform spectral decomposition as first stage of that model to approximate the frequency selectivity of the human auditory system. The availability of efficient implementations led to a virtually exclusive use of uniform decompositions in audio coding. However, the equal filter properties of the uniform sub-bands are not in line with the nonuniform auditory filters. This paper presents a psychoacoustic model based on an efficient nonuniform cochlear filter bank with a simplified less complex post-processing for estimating the masked threshold. Application results in audio coding show a significantly better performance in terms of bit rate and/or quality of the new model in comparison with other state-of-the-art models with a uniform spectral decomposition
Keywords
audio coding; channel bank filters; hearing; parameter estimation; quantisation (signal); spectral analysis; audio coding; cochlear filter bank; human auditory system; masked threshold; maximum permissible just-inaudible noise level; perceptual audio coders; psychoacoustic model; quantization; spectral decomposition; Audio coding; Auditory system; Bit rate; Filter bank; Frequency; Humans; Noise level; Psychoacoustic models; Psychology; Quantization;
fLanguage
English
Publisher
ieee
Conference_Titel
Applications of Signal Processing to Audio and Acoustics, 2001 IEEE Workshop on the
Conference_Location
New Platz, NY
Print_ISBN
0-7803-7126-7
Type
conf
DOI
10.1109/ASPAA.2001.969562
Filename
969562
Link To Document