Title :
Zero-Crossing Based Binaural Mask Estimation for Missing Data Speech Recognition
Author :
Kim, Young-Ik ; An, Sung Jun ; Kil, Rhee Man
Author_Institution :
Div. of Appl. Math., Korea Adv. Inst. of Sci. & Technol., Daejeon
Abstract :
This paper presents a new method of zero-crossing based binaural mask estimation for missing data speech recognition under the condition that multiple sound sources are present simultaneously. The masking is determined by the estimated directions of sound sources using the spatial cues such as inter-aural time differences (ITDs) and inter-aural intensity differences (IIDs). In the suggested method, the estimation of ITDs is utilizing the statistical properties of zero-crossings generated from binaural filter-bank outputs. We also consider the estimation of ITDs with the aid of IID samples to cope with the phase ambiguities of ITD samples in high frequencies. As a result, the proposed method is able to provide an accurate estimate of sound source directions and a good masking scheme for speech recognition while offering significantly less computational complexity compared to cross-correlation based methods
Keywords :
channel bank filters; computational complexity; correlation methods; speech intelligibility; speech processing; speech recognition; statistical analysis; binaural filter-bank outputs; computational complexity; cross-correlation based methods; inter-aural intensity differences; inter-aural time differences; sound sources; speech recognition; statistical properties; zero-crossing based binaural mask estimation; Acoustic noise; Acoustic sensors; Auditory system; Data mining; Frequency estimation; Humans; Industrial training; Phase estimation; Speech recognition; Working environment noise;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
Print_ISBN :
1-4244-0469-X
DOI :
10.1109/ICASSP.2006.1661219