DocumentCode
788503
Title
A Dynamic Compressive Gammachirp Auditory Filterbank
Author
Irino, Toshio ; Patterson, Roy D.
Author_Institution
Fac. of Syst. Eng., Wakayama Univ.
Volume
14
Issue
6
fYear
2006
Firstpage
2222
Lastpage
2232
Abstract
It is now common to use knowledge about human auditory processing in the development of audio signal processors. Until recently, however, such systems were limited by their linearity. The auditory filter system is known to be level-dependent as evidenced by psychophysical data on masking, compression, and two-tone suppression. However, there were no analysis/synthesis schemes with nonlinear filterbanks. This paper describe 18300060s such a scheme based on the compressive gammachirp (cGC) auditory filter. It was developed to extend the gammatone filter concept to accommodate the changes in psychophysical filter shape that are observed to occur with changes in stimulus level in simultaneous, tone-in-noise masking. In models of simultaneous noise masking, the temporal dynamics of the filtering can be ignored. Analysis/synthesis systems, however, are intended for use with speech sounds where the glottal cycle can be long with respect to auditory time constants, and so they require specification of the temporal dynamics of auditory filter. In this paper, we describe a fast-acting level control circuit for the cGC filter and show how psychophysical data involving two-tone suppression and compression can be used to estimate the parameter values for this dynamic version of the cGC filter (referred to as the "dcGC" filter). One important advantage of analysis/synthesis systems with a dcGC filterbank is that they can inherit previously refined signal processing algorithms developed with conventional short-time Fourier transforms (STFTs) and linear filterbanks
Keywords
audio coding; channel bank filters; data compression; speech coding; speech synthesis; audio signal processors; compression; dynamic compressive gammachirp auditory filterbank; fast-acting level control circuit; human auditory processing; psychophysical data; psychophysical filter shape; tone-in-noise masking; two-tone suppression; Circuit synthesis; Control system synthesis; Filter bank; Humans; Nonlinear filters; Psychology; Signal processing; Signal synthesis; Speech analysis; Speech synthesis; Compression; nonlinear analysis/synthesis auditory filterbank; simultaneous masking; speech processing; two-tone suppression;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2006.874669
Filename
1709909
Link To Document