Title :
Auditory model inversion for sound separation
Author :
Slaney, Malcolm ; Naar, Daniel ; Lyon, Richard F.
Author_Institution :
Apple Comput. Inc., Cupertino, CA, USA
Abstract :
Techniques to recreate sounds from perceptual displays known as cochleagrams and correlograms are developed using a convex projection framework. Prior work on cochlear-model inversion is extended to account for rectification and gain adaptation. A prior technique for phase recovery in spectrogram inversion is combined with the synchronized overlap-and-add technique of speech rate modification, and is applied to inverting the short-time autocorrelation function representation in the auditory correlogram. Improved methods of initial phase estimation are explored. A range of computational cost options, with and without iteration, produce a range of quality levels from fair to near perfect
Keywords :
correlation methods; ear; filtering theory; hearing; iterative methods; phase estimation; speech enhancement; auditory correlogram; auditory model inversion; cochleagrams; cochlear model inversion; computational cost; convex projection; filter bank; gain adaptation; iteration; perceptual displays; phase estimation; phase recovery; quality levels; rectification; short-time autocorrelation function; sound separation; spectrogram inversion; speech rate modification; synchronized overlap-and-add technique; Acoustic noise; Auditory displays; Channel bank filters; Computer displays; Filter bank; Humans; Low pass filters; Microphone arrays; Spectrogram; Speech;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
Print_ISBN :
0-7803-1775-0
DOI :
10.1109/ICASSP.1994.389714